AI Roundup 07/04 -> 07/11/2024
A new text-to-video startup to disrupt Hollywood, better tools from Anthropic for AI developers, a Math Olympiad winning LLM, and novel models for document extraction
Welcome to the weekly edition of AI Tidbits, where I curate the firehose of AI research papers and tools every week so you won’t have to.
📩 Published a new breakthrough paper? Just released an open-source package? Submit it here to ensure we don’t miss it and that it gets featured in next week’s post.
Overview
✨ Highlights (4 entries)
Language Models (11 entries)
Multimodal (3 entries)
Vision (9 entries)
Audio (1 entry)
AI Tools (3 entries)
Open-source Packages (3 entries)
Recent Deep Dives
✨ Highlights
Language Models
Multimodal
Vision
Audio
AI Tools
Open-source Packages
Pipecat - an open-source framework for voice and multimodal conversational AI
Micro Agent - an AI agent that writes code and tests for you
Plus >70 more open-source packages for AI engineers
Last week’s AI Tidbits roundup
Reach AI builders, researchers, and entrepreneurs by partnering with AI Tidbits
If you find AI Tidbits valuable, share it with a friend and consider showing your support.





![temp.mov [optimize output image] temp.mov [optimize output image]](https://substackcdn.com/image/fetch/$s_!41Un!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17bdd26c-d751-4d25-89bc-92c8ece0cbd5_600x338.gif)

![temp.mov [video-to-gif output image] temp.mov [video-to-gif output image]](https://substackcdn.com/image/fetch/$s_!Hczq!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b3cb911-fb11-402a-936a-742dda23403d_600x338.gif)
![rfDPaVdW8154XXumfyBfdZUQm9M.mp4 [optimize output image] rfDPaVdW8154XXumfyBfdZUQm9M.mp4 [optimize output image]](https://substackcdn.com/image/fetch/$s_!AS8Y!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe6afc310-a6ce-4173-8e98-6d2c43d72758_600x414.gif)






![ssstwitter.com_1720615479859.mp4 [video-to-gif output image] ssstwitter.com_1720615479859.mp4 [video-to-gif output image]](https://substackcdn.com/image/fetch/$s_!YP6V!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65ef6546-5d0c-4b77-beb9-00be6b1c8248_600x400.gif)









![אקצפ.mov [optimize output image] אקצפ.mov [optimize output image]](https://substackcdn.com/image/fetch/$s_!Vgfo!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d5e5073-82d7-4886-bb9e-69b8140c9624_600x326.gif)


![s9--d0_concat.mp4 [optimize output image] s9--d0_concat.mp4 [optimize output image]](https://substackcdn.com/image/fetch/$s_!B0Bu!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fafa9d473-b264-4b60-bff4-08616ea07e5f_600x200.gif)


Claude updates its features quite quickly. Now in the field of programming, Claude ranks first, and GPT-4o ranks second. It is very worthwhile to use for development.