AI Roundup 05/30 -> 06/06/2024
Mamba-2 sets speed records in language modeling, Stability’s new open audio model, Google's LLMs surpass adults in mental state reasoning, and an agent that completes tasks on phones autonomously
Welcome to the weekly edition of AI Tidbits, where I curate the firehose of AI research papers and tools every week so you won’t have to.
Overview
✨ Highlights (6 entries)
Language Models (6 entries)
Vision (10 entries)
Audio (1 entry)
Open-source Packages (4 entries)
Recent Deep Dives
✨ Highlights
ElevenLabs releases Sound Effects, turning text into rich sounds (Company blog)
Stability AI releases Stable Audio Open - an open-source model that generates high-quality audio samples from text descriptions (Stability AI)
Language Models
Vision
Audio
Open-source Packages
MusicGPT - generate music using natural language on your local machine
MusePose - turn an image of a human and a pose into an animated video
Ragapp - build deployable RAG-powered applications using a simple interface
Plus >70 more open-source packages for AI engineers
Last week’s AI Tidbits roundup
Reach AI builders, researchers, and entrepreneurs by partnering with AI Tidbits
If you find AI Tidbits valuable, share it with a friend and consider showing your support.