<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI Tidbits: Monthly's]]></title><description><![CDATA[The absolute must-know each month for those pressed for time and can only catch one AI Tidbits edition a month.

]]></description><link>https://www.aitidbits.ai/s/monthlys</link><image><url>https://substackcdn.com/image/fetch/$s_!-amS!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png</url><title>AI Tidbits: Monthly&apos;s</title><link>https://www.aitidbits.ai/s/monthlys</link></image><generator>Substack</generator><lastBuildDate>Mon, 04 May 2026 20:47:41 GMT</lastBuildDate><atom:link href="https://www.aitidbits.ai/feed" rel="self" type="application/rss+xml"/><language><![CDATA[en]]></language><webMaster><![CDATA[aitidbits@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aitidbits@substack.com]]></itunes:email><itunes:name><![CDATA[Sahar Mor]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sahar Mor]]></itunes:author><googleplay:owner><![CDATA[aitidbits@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aitidbits@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sahar Mor]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[LinkedIn Highlights, Apr 2025]]></title><description><![CDATA[Control your computer with an open-source library, a structured multi-agent framework, a better agent for coding tasks, Karpathy's LLM tips, and financial hallucination research]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-apt-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-apt-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 04 May 2025 15:02:23 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/e93575ba-0a2c-4cfd-a15e-b917881360b6_800x451.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>Computer-Use Agent</h2><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;83c1aebc-3b45-4ded-9dfe-95839ba9a127&quot;,&quot;duration&quot;:null}"></div><p>Running OpenAI&#8217;s computer-use model just got a whole lot easier. A new open-source package called Computer-Use Agent (Cua) lets you run OpenAI&#8217;s computer-use-preview model inside a full-featured macOS virtual machine.</p><p>Until now, using OpenAI&#8217;s computer-use model meant working with limited APIs or browser-based sandboxes (like ChatGPT Operator). With Cua, you can:</p><ol><li><p>Interact with native apps like Finder, Terminal, and Final Cut Pro</p></li><li><p>Automate real desktop workflows, not just web tasks</p></li><li><p>Run everything locally for better privacy and control</p></li><li><p>Avoid the pain of wiring screenshots, actions, and VM interfaces manually</p></li></ol><p>Under the hood, Cua uses Apple&#8217;s Virtualization framework to launch macOS VMs on Apple Silicon and runs an event loop that handles clicks, typing, scrolling, and more &#8212; all based on OpenAI&#8217;s structured responses.<br><br>With Cua, you can build an AI agent that files your expenses across desktop and browser apps &#8212; moving between Excel, Chrome, and system dialogs, or launch a self-healing QA bot that installs your macOS app, navigates its UI, and reports bugs automatically.<br><br>If you're experimenting with OS-level agents, GUI automation, or reinforcement learning on real UIs, Cua provides the missing infrastructure.<br><br>GitHub repo <a href="https://github.com/trycua/cua">https://github.com/trycua/cua</a></p><div><hr></div><h2>Portia</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ExgD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ExgD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;diagram&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="diagram" title="diagram" srcset="https://substackcdn.com/image/fetch/$s_!ExgD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Portia just open-sourced a powerful agent framework that solves the three biggest challenges of production AI agents.</p><p>The new open-source library addresses the core problems that plague most agentic frameworks with a refreshingly practical approach: separating agent planning from execution, allowing clear human oversight and structured collaboration at every step.</p><p>Package highlights:</p><ol><li><p>Multi-agent planning - the SDK uses few-shot prompting to teach your agents what successful plans look like, significantly boosting reliability.</p></li><li><p>Stateful execution - agents track their own progress and proactively request human input whenever necessary. Think: authentication requests, missing data, or asking for missing context when task execution hits an unexpected scenario.</p></li><li><p>Streamlined security - just-in-time authentication handovers ensure your agents can securely interact with popular tools like Google Calendar, Zendesk, and Hubspot without compromising credentials</p></li></ol><p>This architecture solves persistent roadblocks like unpredictable behavior, lack of human oversight, and cumbersome authentication processes, making production-ready agent deployment realistic and scalable.<br><br>It's open-source, production-ready, and works out-of-the-box with major LLM providers including OpenAI, Anthropic, Mistral, Gemini, and Azure.<br><br>GitHub repo <a href="https://github.com/portiaAI/portia-sdk-python">https://github.com/portiaAI/portia-sdk-python</a></p><div><hr></div><p>My recent post on coding with AI</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6eabfe08-1000-431f-b4f0-fe356facdc47&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>Goose - LLM-powered Agents</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!STeu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!STeu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 424w, https://substackcdn.com/image/fetch/$s_!STeu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 848w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, text, application&quot;,&quot;title&quot;:&quot;graphical user interface, text, application&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, text, application" title="graphical user interface, text, application" srcset="https://substackcdn.com/image/fetch/$s_!STeu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 424w, https://substackcdn.com/image/fetch/$s_!STeu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 848w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Block recently launched Goose, an open-source framework for developers building AI agents, and I got to tinker with it the last few days.</p><p>Goose is a free open framework for LLM-powered agents, from DeepSeek to proprietary models from OpenAI, Google, or Anthropic.</p><p>Unlike other agent frameworks, Goose is designed for software development tasks. The framework has already proven valuable for tasks like:</p><ol><li><p>Conducting code migrations - from Ember to React or Ruby to Kotlin</p></li><li><p>Navigating new projects in unfamiliar languages - eliminating steep learning curves</p></li><li><p>Generating unit tests - quickly increasing code coverage above specific thresholds</p></li></ol><p>The neat thing about Goose is that it's extremely easy to extend its capabilities by leveraging MCP servers like Figma, Google Drive, and Asana.</p><p>Other notable frameworks for coding tasks include OpenHand and Cline.</p><p>GitHub repo <a href="https://github.com/block/goose">https://github.com/block/goose</a></p><div><hr></div><h2>Andrej Karpathy leverging LLMs</h2><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;8f1b0f54-8555-4134-96c7-2fea575fb13a&quot;,&quot;duration&quot;:null}"></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-apt-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Mar 2025]]></title><description><![CDATA[Document AI breakthroughs from GOT-OCR, Maestro, and Mistral, Vellum's agent autonomy framework, Skyvern's visual web automation, plus performance tips for ChatGPT and reasoning models]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-mar-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-mar-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 06 Apr 2025 15:03:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> seven top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>1. GOT-OCR 2.0</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8XzN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8XzN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg" width="1456" height="814" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:814,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!8XzN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I finally had the chance to explore a new document extraction technique introduced in a paper last September. Bonus: the code and model are free to use (Apache 2.0).</p><p>This new approach, called General OCR Theory (GOT-OCR2.0), suggests a unified end-to-end model that handles tasks traditional OCR systems struggle with.</p><p>Unlike legacy OCR, which relies on complex multi-modular pipelines, GOT uses a simple encoder-decoder architecture with only 580M parameters that outperforms models 10-100&#215; larger.</p><p>Paper highlights:</p><ol><li><p>Unified architecture - a high-compression encoder paired with a long-context decoder that handles everything from scene text to complex formulas</p></li><li><p>Stunning performance - delivers nearly perfect text accuracy on documents, surpassing Qwen-VL-Max (&gt;72B) and other leading models</p></li><li><p>Versatility beyond text - processes math formulas, molecular structures, and even geometric shapes</p></li><li><p>Interactive capabilities - supports region-level recognition guided by coordinates or colors</p></li></ol><p>I just tried it out and was blown away by how it handles complex documents with mixed content types. The ability to convert math formulas from Arxiv PDFs to Mathpix format alone is worth exploring this model.</p><p>What strikes me most about GOT is how it challenges the notion that only billion-parameter LLMs can tackle complex visual tasks. <br><br>Paper + code + model can be found in their GitHub repo <a href="https://github.com/Ucas-HaoranWei/GOT-OCR2.0">https://github.com/Ucas-HaoranWei/GOT-OCR2.0</a></p><div><hr></div><p><strong>Last month&#8217;s LinkedIn Highlights</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d83b0b93-1e05-4bf0-8654-bb770edc200b&quot;,&quot;caption&quot;:&quot;Welcome to LinkedIn Highlights!&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, Feb 2025&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-03-02T16:02:11.079Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-february-2025&quot;,&quot;section_name&quot;:&quot;Monthly's&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:158212237,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:17,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>2. Six Levels of Agenic Behavior</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ggQB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ggQB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg" width="1456" height="859" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:859,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!ggQB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I came across a new framework that brings clarity to the messy world of AI agents with a 6-level autonomy hierarchy.<br><br>While most definitions of AI agents are binary (it either is or isn't), a new framework from <a href="https://www.linkedin.com/company/vellumai/">Vellum</a> introduces a spectrum of agency that makes far more sense for the current AI landscape.<br><br>The six levels of agentic behavior provide a clear path from basic to advanced:<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 0 - &#119825;&#119854;&#119845;&#119838;-&#119809;&#119834;&#119852;&#119838;&#119837; &#119830;&#119848;&#119851;&#119844;&#119839;&#119845;&#119848;&#119856; (&#119813;&#119848;&#119845;&#119845;&#119848;&#119856;&#119838;&#119851;)<br>No intelligence&#8212;just if-this-then-that logic with no decision-making or adaptation. Examples include Zapier workflows, pipeline schedulers, and scripted bots&#8212;useful but rigid systems that break when conditions change.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 1 - &#119809;&#119834;&#119852;&#119842;&#119836; &#119825;&#119838;&#119852;&#119849;&#119848;&#119847;&#119837;&#119838;&#119851; (&#119812;&#119857;&#119838;&#119836;&#119854;&#119853;&#119848;&#119851;)<br>Shows minimal autonomy&#8212;processing inputs, retrieving data, and generating responses based on patterns. The key limitation: no control loop, memory, or iterative reasoning. It's purely reactive, like basic implementations of ChatGPT or Claude.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 2 - &#119828;&#119852;&#119838; &#119848;&#119839; &#119827;&#119848;&#119848;&#119845;&#119852; (&#119808;&#119836;&#119853;&#119848;&#119851;)<br>Not just responding but executing&#8212;capable of deciding to call external tools, fetch data, and incorporate results. This is where most current AI applications live, including ChatGPT with plugins or Claude with Function Calling. Still fundamentally reactive without self-correction.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 3 - &#119822;&#119835;&#119852;&#119838;&#119851;&#119855;&#119838;, &#119823;&#119845;&#119834;&#119847;, &#119808;&#119836;&#119853; (&#119822;&#119849;&#119838;&#119851;&#119834;&#119853;&#119848;&#119851;)<br>Managing execution by mapping steps, evaluating outputs, and adjusting before moving forward. These systems detect state changes, plan multi-step workflows, and run internal evaluations. Examples like AutoGPT or LangChain agents attempt this, though they still shut down after task completion.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 4 - &#119813;&#119854;&#119845;&#119845;&#119858; &#119808;&#119854;&#119853;&#119848;&#119847;&#119848;&#119846;&#119848;&#119854;&#119852; (&#119812;&#119857;&#119849;&#119845;&#119848;&#119851;&#119838;&#119851;)<br>Behaving like stateful systems that maintain state, trigger actions autonomously, and refine execution in real-time. These agents "watch" multiple streams and execute without constant human intervention. Cognition Labs' Devin and Anthropic's Claude Code aspire to this level, but we're still in the early days, with reliable persistence being the key challenge.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 5 - &#119813;&#119854;&#119845;&#119845;&#119858; &#119810;&#119851;&#119838;&#119834;&#119853;&#119842;&#119855;&#119838; (&#119816;&#119847;&#119855;&#119838;&#119847;&#119853;&#119848;&#119851;)<br>Creating its own logic, building tools on the fly, and dynamically composing functions to solve novel problems. We're nowhere near this yet&#8212;even the most powerful models (o1, o3, Deepseek R1) still overfit and follow hardcoded heuristics rather than demonstrating true creativity.<br><br>The framework shows where we are now: production-grade solutions up to Level 2, with most innovation happening at Levels 2-3. This taxonomy helps builders understand what kind of agent they're creating and what capabilities correspond to each level.<br><br>Full report <a href="https://www.vellum.ai/blog/levels-of-agentic-behavior">https://www.vellum.ai/blog/levels-of-agentic-behavior</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>3. Skyraven</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HxMO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HxMO!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 424w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 848w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1272w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif" width="600" height="427" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:427,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!HxMO!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 424w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 848w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1272w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Traditional web automation is dying as developers waste countless hours maintaining brittle XPath selectors. Skyvern, a new open-source package, revolutionizes browser automation by combining LLMs with computer vision.</p><p>Unlike traditional automation tools that break when websites change, Skyvern uses visual understanding and natural language processing to dynamically interpret and interact with web interfaces. This enables developers to:<br><br>&#8594;  Build website-agnostic automations - create workflows that work across multiple sites without custom code</p><p>&#8594; Handle complex inference tasks - automatically reason through form responses like eligibility questions</p><p>&#8594; Execute multi-step sequences - coordinate multiple agents for tasks like authentication, navigation, and data extraction</p><p>Packages like Skyvern signal the emergence of truly adaptable web agents. Instead of hard-coded rules, we see AI systems that can understand and navigate the web like humans do - reading content, making decisions, and handling edge cases autonomously. I wrote more about it in my latest <a href="https://www.aitidbits.ai/s/ai-agents">AI Agents blog series</a>.<br><br>GitHub repo <a href="https://github.com/Skyvern-AI/skyvern">https://github.com/Skyvern-AI/skyvern</a></p><div><hr></div><h2>4. Maestro</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FhbD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FhbD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!FhbD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-mar-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Feb 2025]]></title><description><![CDATA[Anthropic's tip for long context prompts, a curated list of agents for computer use, an app to chat with multiple LLMs, and tips to improve the performance of GPT-4.5 and o3]]></description><link>https://www.aitidbits.ai/p/linkedin-february-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-february-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 02 Mar 2025 16:02:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> six top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes six posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>1. Long Context Prompting Tips</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xJU0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xJU0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg" width="800" height="447" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:447,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!xJU0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic shared a counterintuitive yet powerful tip that boosts Claude's accuracy by 30% when working with long documents/prompts.</p><p>The secret? Place your lengthy documents (~20K+ tokens) at the TOP of your prompt before your actual query. While this might seem counterintuitive, internal tests show this simple change significantly improves response quality across all Claude models.</p><p>This becomes crucial when dealing with multiple documents. For optimal results:</p><ol><li><p>Documents first - place all your data inputs at the beginning</p></li><li><p>Structured organization - use XML tags to separate documents and metadata</p></li><li><p>Specific query - end with a clear, focused question</p></li></ol><p>As language models' context window grows in size and companies increasingly rely on LLMs to process complex datasets, reports, and documentation, this technique ensures more reliable and accurate results.</p><p>P.S. For those working with multi-document analysis, I highly recommend structuring your content with XML tags - it provides additional clarity and helps the model better understand document relationships.</p><div><hr></div><h2>2. Agents for Computer Use Repository</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!509N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!509N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 424w, https://substackcdn.com/image/fetch/$s_!509N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 848w, https://substackcdn.com/image/fetch/$s_!509N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg" width="800" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!509N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 424w, https://substackcdn.com/image/fetch/$s_!509N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 848w, https://substackcdn.com/image/fetch/$s_!509N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>While everyone's talking about AI agents, most developers and researchers are missing out on the most comprehensive collection of computer use frameworks, papers, and tools ever assembled.</p><p>Awesome Agents for Computer Use is a curated repository documenting the recent rapid progress of AI agents that can autonomously control computers through clicks, keystrokes, and API calls. From Anthropic's Claude Computer Use to Microsoft's OmniParser and Self-Operating Computer framework, it covers the entire landscape of computer control agents.</p><p>It features:</p><ul><li><p>Research papers - featuring 30+ recent publications on GUI agents, from foundational models to safety considerations</p></li><li><p>Open-source frameworks - documenting practical implementations like AutoGen, Browser Use, and OpenInterpreter</p></li><li><p>Commercial solutions - tracking industry developments from major players like Anthropic and emerging startups</p></li></ul><p>The rise of computer-controlling AI agents marks a pivotal shift in human-computer interaction. As these systems mature, we're moving towards a future where AI assistants won't just give advice - they'll directly help us accomplish complex tasks across applications and platforms (I wrote more about this topic <a href="https://www.aitidbits.ai/p/agent-responsive-design">here</a>).</p><p>Repo <a href="https://github.com/francedot/acu">https://github.com/francedot/acu</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>3. GPT-4.5 Pro Tip</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3A5R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3A5R!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 424w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 848w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1272w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" width="800" height="507" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:507,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!3A5R!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 424w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 848w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1272w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Pro tip from the research team @ OpenAI on how to improve GPT-4.5 performance: add the following text to the start of your system message.</p><pre><code><em>You are a highly capable, thoughtful, and precise assistant. Your goal is to deeply understand the user's intent, ask clarifying questions when needed, think step-by-step through complex problems, provide clear and accurate answers, and proactively anticipate helpful follow-up information. Always prioritize being truthful, nuanced, insightful, and efficient, tailoring your responses specifically to the user's needs and preferences.</em></code></pre><p>OpenAI internal evals show it results in better performance.</p><p>Try it out in the OpenAI Playground <a href="https://platform.openai.com/playground/chat?preset=7CywXwBqWRC5quhkU9LEFv6A">https://platform.openai.com/playground/chat?preset=7CywXwBqWRC5quhkU9LEFv6A</a></p><div><hr></div><h2>4. Chorus - <strong>Chat with Multiple AIs on Your Desktop</strong></h2>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-february-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Jan 2025]]></title><description><![CDATA[From Anthropic's agent patterns to DeepSeek's reasoning breakthroughs, plus innovative tools for document workflows, PDF processing, table understanding, SQL generation, and web automation]]></description><link>https://www.aitidbits.ai/p/january-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/january-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 09 Feb 2025 16:01:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> eight top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes eight posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. Building Effective AI Agents by Anthropic</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3u7j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3u7j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!3u7j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic clarifies the fuzzy definition of AI Agents by introducing a critical architectural distinction: workflows are systems with predefined code paths, while agents dynamically direct their own processes.<br><br>After building agents for a year, they identified five fundamental patterns that drive successful agentic implementations:</p><ol><li><p>Prompt chaining - breaking tasks into sequential steps, useful for complex operations like content generation and translation</p></li><li><p>Routing - directing inputs to specialized handlers, perfect for customer service and model optimization</p></li><li><p>Parallelization - running subtasks simultaneously through sectioning or voting, ideal for code review and content moderation</p></li><li><p>Orchestrator-workers - using a central LLM to coordinate task delegation, essential for complex coding projects</p></li><li><p>Evaluator-optimizer - implementing feedback loops for iterative refinement, perfect for improving search results</p></li></ol><p>Success isn't about building the most sophisticated system - it's about choosing the right pattern for your specific needs. Start simple, measure performance, and only add complexity when simpler solutions fall short.<br><br>Anthropic&#8217;s post (highly recommend reading it) <a href="https://www.anthropic.com/research/building-effective-agents">https://www.anthropic.com/research/building-effective-agents</a></p><div><hr></div><h1>2. Agentic Document Workflows</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jlGR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jlGR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!jlGR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LlamaIndex just unveiled a new approach involving AI agents for reliable document processing, from processing invoices to insurance claims and contract reviews.<br><br><a href="https://www.linkedin.com/company/llamaindex/">LlamaIndex</a>&#8217;s new architecture, Agentic Document Workflows (ADW), goes beyond basic retrieval and extraction to orchestrate end-to-end document processing and decision-making. Imagine a contract review workflow: you don't just parse terms, you identify potential risks, cross-reference regulations, and recommend compliance actions.<br><br>This level of coordination requires an agentic framework that maintains context, applies business rules, and interacts with multiple system components.<br><br>Here&#8217;s how ADW works at a high level:</p><ol><li><p>Document parsing and structuring &#8211; using robust tools like LlamaParse to extract relevant fields from contracts, invoices, or medical records.</p></li><li><p>Stateful agents &#8211; coordinating each step of the process, maintaining context across multiple documents, and applying logic to generate actionable outputs.</p></li><li><p>Retrieval and reference &#8211; tapping into knowledge bases via LlamaCloud to cross-check policies, regulations, or best practices in real-time.</p></li><li><p>Actionable recommendations &#8211; delivering insights that help professionals make informed decisions rather than just handing over raw text.</p></li></ol><p>ADW provides a path to building truly &#8220;intelligent&#8221; document systems that augment rather than replace human expertise. From legal contract reviews to patient case summaries, invoice processing, and insurance claims management&#8212;ADW supports human decision-making with context-rich workflows rather than one-off extractions.<br><br>Ready to use notebooks <a href="https://github.com/run-llama/llamacloud-demo/tree/main/examples/document_workflows">https://github.com/run-llama/llamacloud-demo/tree/main/examples/document_workflows</a><br><br>More open-source tools for AI agent developers in my recent blog post <a href="https://www.aitidbits.ai/p/open-source-agents">https://www.aitidbits.ai/p/open-source-agents</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h1>3. Vanna</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dwQC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dwQC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg" width="1456" height="841" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:841,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!dwQC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Turning questions into SQL statements has been an area of research and development for over a decade. An open-source package called Vanna uses a retrieval augmented generation approach to crack the code.<br><br>Vanna is a Python-based, MIT-licensed framework that allows non-data folks to interact and ask questions about their SQL databases.<br><br>At its core, Vanna employs a RAG model that leverages a large corpus of data, including a diverse range of SQL queries and their natural language descriptions. When a query is received, Vanna searches this corpus to find similar queries and their corresponding SQL translations. This step enables Vanna to understand the context and structure of the query better.<br><br>Using the insights gained from the retrieved examples, Vanna generates the SQL query that matches the user's natural language request. This involves structuring the select statements, where clauses, joins, and other SQL components are based on the intent and requirements identified in the initial query.<br><br>Pro tip: Vanna often hallucinates when it doesn't know the content of your table's columns. I therefore recommend providing a few examples through the train() method.<br><br>The GitHub repo already includes ready-to-use templates to deploy Vanna in Slack, Streamlit, or a Flask endpoint <a href="https://github.com/vanna-ai/vanna">https://github.com/vanna-ai/vanna</a></p><div><hr></div><h1>4. DeepSeek &amp; How to prompt reasoning models</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LK9x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LK9x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg" width="691" height="537.9373996789727" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1246,&quot;resizeWidth&quot;:691,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!LK9x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>DeepSeek's new model doesn't just beat GPT-4o - it proves &#119856;&#119838;'&#119855;&#119838; &#119835;&#119838;&#119838;&#119847; &#119849;&#119851;&#119848;&#119846;&#119849;&#119853;&#119842;&#119847;&#119840; &#119851;&#119838;&#119834;&#119852;&#119848;&#119847;&#119842;&#119847;&#119840; &#119846;&#119848;&#119837;&#119838;&#119845;&#119852; &#119856;&#119851;&#119848;&#119847;&#119840; &#119834;&#119845;&#119845; &#119834;&#119845;&#119848;&#119847;&#119840;.</p>
      <p>
          <a href="https://www.aitidbits.ai/p/january-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Dec 2024]]></title><description><![CDATA[Claude&#8217;s new PDF API, a playground to build with the new Gemini Realtime Multimodal API, open multimodal vision models from Meta, an open-source Perplexity alternative, and easy LLM fine-tuning]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-dec-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-dec-2024</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 02 Jan 2025 16:00:45 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b46c580b-68d6-4b07-ae11-27a0193832f3_800x500.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. MindSearch</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D0dY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D0dY!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif" width="656" height="370.64" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:600,&quot;resizeWidth&quot;:656,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!D0dY!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>An open-source search engine is rivaling top-tier AI products like <a href="http://perplexity.ai/">Perplexity.ai</a> Pro and ChatGPT web search.</p><p>MindSearch is an innovative AI search engine framework that combines LLMs and a multi-agent system to tackle three critical issues that often limit LLM-powered search engines:</p><ol><li><p>LLMs struggle to decompose complex queries into simpler, actionable requests</p></li><li><p>Search results often contain too much noise, making it hard to filter and extract relevant information</p></li><li><p>Iterative searches can quickly overload the LLM&#8217;s input length capacity</p></li></ol><p>MindSearch utilizes two main components:</p><ul><li><p>WebPlanner - decomposes complex queries into sub-tasks and creates a dynamic graph structure for problem-solving</p></li><li><p>WebSearcher - conducts fine-grained searches and delivers summarized information back to WebPlanner for further refinement</p></li></ul><p>This approach allows MindSearch to handle massive web content (e.g., more than 300 pages) effectively, surpassing limitations faced by traditional LLM-based search systems.</p><p>Code <a href="https://github.com/InternLM/MindSearch">https://github.com/InternLM/MindSearch</a></p><div><hr></div><h1>2. Gemini Multimodal Playground</h1><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;71791ce7-ddf9-425a-90dd-2ea0ccb32d9a&quot;,&quot;duration&quot;:null}"></div><p>Holiday coding project: Build voice agents that can see with Google's new Gemini 2.0 model and my new real-time Multimodal Playground repo.</p><p>The playground implements voice and video-based interactions with the new Gemini model, allowing natural conversations in real-time while solving the critical background noise challenge using Voice Activity Detection (VAD).</p><p>In the last few days, I added a full-stack web app to interact with Gemini (see video below) along with a standalone script for those eager to quickly dive into building real-time voice agents.</p><p>Google&#8217;s real-time Gemini model is a game-changer, enabling you to independently create production-ready voice agents for industries like customer service, education, and healthcare in a matter of days.</p><p>Happy holidays. Go build! <a href="https://github.com/saharmor/gemini-multimodal-playground">https://github.com/saharmor/gemini-multimodal-playground</a></p><div><hr></div><h1>3. Meta Apollo</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H-EB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H-EB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!H-EB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Video understanding has been lagging behind text, image, and audio modalities&#8212;until now.</p><p>Meta and Stanford researchers unveiled Apollo, a new family of state-of-the-art video-centric large multimodal models (video-LMMs) designed to close this gap. Unlike prior efforts, Apollo sets a new standard by efficiently analyzing hour-long videos and achieving breakthrough results on multiple benchmarks.</p><p>Paper highlights:</p><ol><li><p>Scaling Consistency - design decisions made with smaller models transfer reliably to larger ones, drastically cutting computational costs</p></li><li><p>Advanced video sampling techniques - Apollo uses FPS sampling, outperforming traditional uniform sampling methods</p></li><li><p>Streamlined evaluation - the new ApolloBench benchmark evaluating video-LMMs efficiently, reducing evaluation time by 41x while maintaining accuracy</p></li></ol><p>Apollo&#8217;s superior video comprehension capabilities pave the way for breakthroughs like real-time video summarization for content creators, better temporal reasoning for medical diagnostics, and enhanced video analytics for autonomous driving.</p><p>With Apollo, video understanding might finally catch up to its multimodal counterparts.<br><br>Project page <a href="https://apollo-lmms.github.io/">https://apollo-lmms.github.io</a></p><div><hr></div><h1>4. Claude PDF API</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZlDd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg" width="1456" height="1064" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1064,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic has introduced a powerful new PDF-processing feature in its Claude API, surpassing basic text extraction, and it has largely flown under the radar.</p><p>Historically, many LLMs stumble when documents include complex elements like images, charts, and LaTeX formulas. But Anthropic&#8217;s latest upgrade manages to parse both textual and visual content within a PDF&#8212;no extra coding wizardry needed.</p><p>Key capabilities include:</p><ol><li><p>Automatically parsing PDF text, images, and tables for further analysis, from answering questions about the attached PDF to turning unstructured data into formatted JSONs</p></li><li><p>Providing insight on charts and diagrams by evaluating visual context, not just textual tags</p></li><li><p>Extracting and interpreting LaTeX for scientific or technical documentation</p></li></ol><p>It works by splitting each PDF into two components: the text is extracted as normal, and the entire page is converted into an image. Claude then merges text and visual context for a more holistic understanding. It&#8217;s essentially combining LLM intelligence with basic computer vision techniques.</p><p>The API supports up to 32MB or 100 pages of PDF content and pricing is similar to the LLM pricing so there&#8217;s no premium cost for PDF analysis.</p><p>This API could dramatically streamline how we handle financial reports, legal docs, or any PDF requiring detailed interpretation.<br></p><p>Ready-to-run notebook analyzing Anthropic's constitutional AI paper here <a href="https://github.com/anthropics/anthropic-cookbook/blob/main/misc/pdf_upload_summarization.ipynb">https://github.com/anthropics/anthropic-cookbook/blob/main/misc/pdf_upload_summarization.ipynb</a></p><div><hr></div><h1>5. LLaMa-Factory</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hzZb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hzZb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!hzZb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When is it better to fine-tune a language model than using prompt engineering or RAG? Here&#8217;s a clear framework you can apply along with an open-source library I use for fine-tuning.</p><p>Good reasons to fine-tune:</p><ol><li><p>Emphasizing knowledge that already exists in the model - for instance, in a text-to-SQL task, fine-tuning can be used to emphasize specific SQL dialects or to avoid error-prone edge cases, utilizing the comprehensive understanding of SQL syntax, dialects, and database functionality that the model already possesses.</p></li><li><p>Customizing the structure or tone of responses - fine-tuning can modify the structure or tone of a model's output, such as making the model output valid JSON, which is beneficial for programmatic interactions where handling invalid JSON could lead to many downstream error cases. This includes fine-tuning a model to your company&#8217;s writing style.</p></li><li><p>Teaching a model very complex instructions - fine-tuning allows for showing the model many more examples than can be included in a model's context window, which is helpful for complex instructions. This leads to cheaper and faster inference.</p></li></ol><p>Wrong reasons to fine-tune:</p><ol><li><p>Adding new knowledge to the base model - the knowledge in a large language model is established during the pre-training runs. New knowledge can't effectively be introduced during the limited scope of fine-tuning. RAG is better suited in such cases.</p></li><li><p>Quickly iterating on a new use-case - fine-tuning involves a slower feedback loop and requires substantial investment in creating the dataset and other aspects of the fine-tuning process. Therefore, it's not suitable for rapid iteration of new use cases.</p></li></ol><p>My preferred tool for fine-tuning open language models is LLaMA-Factory. It features 100+ different large language models, including Meta&#8217;s Llama-2, Google&#8217;s Gemma, and Mistral&#8217;s Mixtral. It also supports advanced algorithms like LoRA, QLoRA, and GaLore for optimized performance.<br><br>GitHub repo <a href="https://github.com/hiyouga/LLaMA-Factory">https://github.com/hiyouga/LLaMA-Factory</a></p><div><hr></div><p><strong>Last month&#8217;s LinkedIn Highlights</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;caa2247c-7336-41c5-a815-99a63469778b&quot;,&quot;caption&quot;:&quot;Something different today: Rather than our usual Thursday roundup, I'll take a slight detour to share some in-depth insights about AI Agents that have occupied my mind lately. For the next two weeks, expect more of Sahar's 2&#162; pieces.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, Oct 2024&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-07T15:30:18.165Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-october-2024&quot;,&quot;section_name&quot;:&quot;Monthly's&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:150965620,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:17,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p></p>]]></content:encoded></item><item><title><![CDATA[October 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[New open-source powerful video generation models, agentic frameworks and tools to control computers and smartphones from Apple and Anthropic, and open-source NotebookLM]]></description><link>https://www.aitidbits.ai/p/october-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/october-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 10 Nov 2024 16:01:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lOTV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>October marked a pivotal moment for agentic AI, as interest from industry and academia reached new heights, alongside groundbreaking video generation releases that democratized previously exclusive capabilities.</p><p>Genmo's release of Mochi 1 and Rhymes AI's Allegro led the charge, bringing commercial-grade text-to-video generation into the open-source domain. Meanwhile, industry giants made significant moves - Anthropic released Claude 3.5 Sonnet and Haiku with unprecedented computer use capabilities, GitHub expanded Copilot with Claude 3.5 and Gemini 1.5 integration, and OpenAI introduced Canvas for real-time writing and coding assistance.</p><p>The open-source community continued its remarkable momentum, with Nvidia's Llama-3.1-nemotron surpassing GPT-4 and Claude 3.5 on key benchmarks, while Apple made waves by open-sourcing Depth Pro and Ferret-UI 2, pushing the boundaries of on-device AI capabilities.</p><p>Multimodal AI saw further developments with Rhymes AI's Aria, the first open-source mixture-of-experts multimodal model, and Meta's innovative Spirit LM, combining text and speech capabilities.</p><p>These breakthroughs and numerous advances in autonomous agents, audio generation, and AI tools paint a picture of rapid democratization across AI domains.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>&#10024; <strong>Special Feature</strong>: Open Video Generation</p></li><li><p>Industry announcements (10 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (9 entries)</p></li><li><p>Research (8 entries)</p></li></ul></li><li><p>Autonomous Agents (8 entries)</p></li><li><p>Multimodal (7 entries)</p></li><li><p>Image and Video (9 entries)</p></li><li><p>Audio (3 entries)</p></li><li><p>AI Tools (3 entries)</p></li><li><p>Open-source Packages (7 entries)</p><div><hr></div></li></ul><h2><strong>&#10024; Special Feature: Open-source Video Generation</strong></h2><ol><li><p><a href="https://www.genmo.ai/blog?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Genmo openly releases Mochi 1 - a text-to-video model delivering smooth 30fps videos with precise motion and accurate prompt adherence, with downloadable weights on Hugging Face and a commercially permissive license</a></p></li><li><p><a href="https://rhymes.ai/blog-details/allegro-advanced-video-generation-model?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Rhymes AI releases Allegro - a 2.8B open-source model capable of generating cinematic 6-second videos from text prompts at 15 FPS and 720p resolution</a></p></li><li><p><a href="https://ai.meta.com/research/movie-gen/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta AI presents MovieGen - a next-gen model family that generates HD personalized videos and synchronized audio from text prompts, enabling users to create and edit videos featuring their own faces</a></p></li><li><p><a href="https://pyramid-flow.github.io/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Peking University develops a new method for efficient video generation, producing smooth, high-quality 10-second videos in 768p resolution at 24 FPS</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lOTV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lOTV!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lOTV!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0993cf71-6782-4e24-979d-0be87b003204_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!lOTV!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!lOTV!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0993cf71-6782-4e24-979d-0be87b003204_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Mochi is the leading open-source text2video model, <a href="https://x.com/ArtificialAnlys/status/1850987133563216118">outperforming Runway, Pika, and Luma Labs</a></figcaption></figure></div><h2><strong><br>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;e8f337dc-5f71-4386-a10c-547f735f3ca1&quot;,&quot;caption&quot;:&quot;I&#8217;m excited to share a new Deep Dive after a short hiatus.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Great AI Consolidation&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-09-29T15:01:08.322Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35bffcff-1d4f-4670-a10e-7af96da00945_2020x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/the-great-ai-consolidation&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:149185115,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:37,&quot;comment_count&quot;:5,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;2b51ee44-4c84-415d-b205-d86f88f36d3d&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/28ffae47-5e98-4f7d-998d-e8f5f9841f69_2202x1416.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:73,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;473872eb-ba6c-481b-ae8d-6a0c4b79ab87&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong><br>Industry announcements</strong></h2><ol><li><p><a href="https://www.anthropic.com/news/3-5-models-and-computer-use?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases new versions of its LLMs Claude 3.5 Sonnet and Haiku, boasting top-tier performance in coding and problem-solving, surpassing OpenAI o1-preview, along with new human-like software interaction capabilities, enabling the model to click, type, and automate tasks directly through GUIs</a></p></li><li><p><a href="https://openai.com/index/introducing-canvas/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI introduces Canvas - a visual interface that simplifies real-time edits on writing and coding tasks within ChatGPT</a></p></li><li><p><a href="https://x.com/OpenAIDevs/status/1846972985170972923?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases Chat Completions API with support for text and audio, enabling both asynchronous audio experiences and real-time interactions</a></p></li><li><p><a href="https://blog.google/technology/ai/notebooklm-update-october-2024/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google elevates its podcast-generating NotebookLM, introducing customizable Audio Overviews and giving users the ability to fine-tune AI summaries with specific instructions</a>&nbsp;</p></li><li><p><a href="https://github.blog/news-insights/product-news/bringing-developer-choice-to-copilot?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">GitHub unveils multi-model Copilot in its annual developer conference, adding Claude 3.5 and Gemini 1.5, as well as launches GitHub Spark, an AI-native tool that builds micro web apps entirely through natural language with no coding required</a></p></li><li><p><a href="https://runwayml.com/research/introducing-act-one?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Runway releases Act-One - a cutting-edge tool for transforming simple video and voice inputs into expressive character performances</a></p></li><li><p><a href="https://x.com/elevenlabsio/status/1849083718838657186?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">ElevenLabs unveils Voice Design, allowing users to create unique voices from a text prompt alone</a></p></li><li><p><a href="https://about.ideogram.ai/canvas?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Ideogram releases Canvas - an AI-powered image editor offering inpainting and outpainting capabilities, outperforming competitors like Midjourney</a></p></li><li><p><a href="https://www.sequoiacap.com/article/generative-ais-act-o1?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Sequoia Capital publishes a report on the evolution of generative AI, highlighting the shift from fast, pattern-based responses ("System 1 thinking") to deliberate reasoning at inference time ("System 2 thinking")</a></p></li><li><p><a href="https://darioamodei.com/machines-of-loving-grace?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic's CEO presents a hopeful vision for AI, predicting breakthroughs in health, economics, and governance if AI&#8217;s potential is harnessed correctly</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1XsJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1XsJ!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1XsJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif" width="674" height="379.68666666666667" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:674,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!1XsJ!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!1XsJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc505177f-b3ec-4668-b65e-6d365fd9723d_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Claude Computer Use scheduling a meeting autonomously</figcaption></figure></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Claude, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong><br>Large Language Models</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia releases Llama-3.1-nemotron-70b - a language model that outperforms GPT-4o and Claude 3.5 Sonnet on instruction following benchmarks like AlpacaEval and MT-Bench, allowing commercial use</a>&nbsp;</p></li><li><p><a href="https://mistral.ai/news/ministraux/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral releases Ministral 3B and 8B models for edge computing, pushing new limits in reasoning and function-calling within the sub-10B range, outperforming Llama 3 8B and Mistral 7B on instruction-following benchmarks</a>&nbsp;</p></li><li><p><a href="https://ai.meta.com/blog/meta-llama-quantized-lightweight-models/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta releases quantized Llama 3.2 models, delivering faster on-device AI processing with reduced size and memory use for mobile deployment</a></p></li><li><p><a href="https://github.com/meta-llama/llama-recipes/tree/main/recipes/quickstart/NotebookLlama?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta releases an open-source replica of Google&#8217;s NotebookLM called NotebookLlama, offering an open-source framework using Llama models and text-to-speech tools to generate podcast-style audio from PDFs</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/october-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Oct 2024]]></title><description><![CDATA[Turning unstructured docs to structured data with language models, a platform for generating Windows-operating agents, open-source text-to-speech, answering questions over databases, and LLM security]]></description><link>https://www.aitidbits.ai/p/linkedin-october-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-october-2024</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 07 Nov 2024 15:30:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Something different today: Rather than our usual Thursday roundup, I'll take a slight detour to share some in-depth insights about AI Agents that have occupied my mind lately. For the next two weeks, expect more of <a href="https://www.aitidbits.ai/s/deep-dives">Sahar's 2&#162;</a> pieces.</p><p>This deep dive series will probably launch next week, so I'm fast-tracking the <em>LinkedIn Highlights</em> post.</p><div><hr></div><p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. Sparrow</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GZ2g!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GZ2g!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GZ2g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg" width="1456" height="839" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:839,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!GZ2g!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GZ2g!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34a63e77-9acf-48d9-b151-43177e0b0f84_2048x1180.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new open-source project called Sparrow simplifies the challenging task of extracting structured data from unstructured documents like forms, invoices, and images using machine learning and LLM pipelines.<br><br>Its modular and pluggable architecture lets you seamlessly integrate tools like LlamaIndex, Haystack, and Unstructured for customizable data processing workflows. Whether you're processing PDFs or extracting content from images, Sparrow provides independent agents for each task.<br><br>Sparrow's standout feature is its ability to let users build and deploy LLM agents through a simple API, making integration into your systems seamless and efficient. It even supports local LLM execution using Ollama or Apple MLX.<br><br>Key agents include:</p><ul><li><p>llamaindex - PDF processing with LlamaIndex</p></li><li><p>vprocessor - OCR + LlamaIndex for image processing</p></li><li><p>haystack - PDF processing with Haystack</p></li><li><p>unstructured-light - PDF and image processing with Unstructured and LangChain</p><p></p></li></ul><p>GitHub repo <a href="https://github.com/katanaml/sparrow">https://github.com/katanaml/sparrow</a></p><div><hr></div><h1>2. Windows Agent Arena</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Or2n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Or2n!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Or2n!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif" width="666" height="375.18" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:666,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;1727127118262.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="1727127118262.mp4 [optimize output image]" title="1727127118262.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!Or2n!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!Or2n!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d5d7f6c-72b3-4a60-88b1-cb49bceb08e7_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Are AI Agents coming to Windows? Microsoft just released an open-source project for developers to build autonomous agents for its Windows operating system.</p><p>As part of the release, Microsoft open-sourced&nbsp;<a href="https://github.com/microsoft/OmniParser">Omniparser</a>, the current top-performing screen understanding model in their benchmark.<br><br>A ready Windows OS environment ensures agents perform optimally in real-world conditions. Microsoft also integrated it with Azure ML so multiple agents can run in parallel and complete their tasks in minutes rather than days, thanks to cloud scaling.<br><br>Code <a href="https://github.com/microsoft/WindowsAgentArena">https://github.com/microsoft/WindowsAgentArena</a></p><div><hr></div><pre><code><code>Become a premium member to get full access to my content and $1k+ in free credits for leading AI tools and APIs, including Claude, Hugging Face, Deepgram. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Support AI Tidbits as a premium member&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Support AI Tidbits as a premium member</span></a></p><div><hr></div><h1>3. ChatTTS</h1><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;334f5271-086c-476e-8a95-32f1c29adc17&quot;,&quot;duration&quot;:null}"></div><p>A new breakthrough in text-to-speech technology is here: ChatTTS.<br>Explicitly designed for dialogue-based scenarios like LLM assistants, ChatTTS pushes the boundaries of conversational AI with the ability to generate natural, expressive speech.<br><br>ChatTTS is optimized for multi-speaker dialogue tasks, making it ideal for AI assistants and interactive conversation models. The model also allows fine-grained prosody control, such as pauses, laughter, and interjections, significantly enhancing the expressiveness of synthesized speech.<br><br>Its ability to predict and replicate natural speech patterns surpasses many open-source TTS models.<br><br>ChatTTS was trained on 100,000+ hours of English and Chinese audio and is open-sourced for research use.<br><br>Repo <a href="https://github.com/2noise/ChatTTS">https://github.com/2noise/ChatTTS</a><br>Example notebook <a href="https://github.com/2noise/ChatTTS/blob/main/examples/ipynb/example.ipynb">https://github.com/2noise/ChatTTS/blob/main/examples/ipynb/example.ipynb</a></p><div><hr></div><h1>4. Table-Augmented Generation</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hmWs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hmWs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hmWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/baedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!hmWs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hmWs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-october-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Sep 2024]]></title><description><![CDATA[A Perplexity-like open-source package, ready-to-run notebooks for advanced RAG techniques, a GPT-powered OCR, Anthropic's tool to refine prompts, and a novel RAG method]]></description><link>https://www.aitidbits.ai/p/linkedin-september-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-september-2024</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 13 Oct 2024 15:01:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!oGCb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Introducing: AI Tidbits LinkedIn Highlights</em></p><p>Welcome to a new AI Tidbits series! Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. MindSearch</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oGCb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oGCb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oGCb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif" width="718" height="405.67" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:600,&quot;resizeWidth&quot;:718,&quot;bytes&quot;:778273,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oGCb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!oGCb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd1ec5ceb-7867-4380-ace1-a3328157bc77_600x339.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new open-source search engine is rivaling top-tier AI products like <a href="http://perplexity.ai/">Perplexity.ai</a> Pro and ChatGPT-Web.</p><p>MindSearch is an innovative AI search engine framework that combines LLMs and a multi-agent system to tackle three critical issues that often limit LLM-powered search engines:</p><ol><li><p>LLMs struggle to decompose complex queries into simpler, actionable requests</p></li><li><p>Search results often contain too much noise, making it hard to filter and extract relevant information</p></li><li><p>Iterative searches can quickly overload the LLM&#8217;s input length capacity<br></p></li></ol><p>MindSearch utilizes two main components:</p><ul><li><p>WebPlanner - decomposes complex queries into sub-tasks and creates a dynamic graph structure for problem-solving</p></li><li><p>WebSearcher - conducts fine-grained searches and delivers summarized information back to WebPlanner for further refinement<br></p></li></ul><p>This approach allows MindSearch to handle massive web content (e.g., more than 300 pages) effectively, surpassing limitations faced by traditional LLM-based search systems.<br><br>According to subjective evaluations from human experts, MindSearch significantly outperforms major search engines like ChatGPT-Web and <a href="http://perplexity.ai/">Perplexity.ai</a> Pro. Its superior depth, breadth, and factual accuracy make it a breakthrough solution for both open-set and closed-set QA tasks.<br><br>Technical report <a href="https://arxiv.org/abs/2407.20183">https://arxiv.org/abs/2407.20183</a><br>Code <a href="https://github.com/InternLM/MindSearch">https://github.com/InternLM/MindSearch</a> </p><div><hr></div><h1>2. Advanced RAG techniques</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AtKj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AtKj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AtKj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg" width="700" height="390.86538461538464" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:700,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!AtKj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 424w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 848w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!AtKj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb5c1c84c-aa57-4405-9df2-a7ff48efc873_2048x1144.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new GitHub repository provides the most comprehensive RAG tutorials you&#8217;ll find, showcasing advanced techniques to enhance the accuracy, efficiency, and contextual richness of RAG systems.</p><p>The repository offers easy-to-start notebooks covering methods like:</p><ul><li><p>Reliable RAG &#8211; refining and validating retrieved information for better accuracy</p></li><li><p>Proposition Chunking &#8211; breaking down text into meaningful sentences for improved control over query handling</p></li><li><p>Query Transformations &#8211; optimizing queries by rewriting and decomposing complex ones into sub-queries.</p></li><li><p>Semantic Chunking &#8211; dividing documents based on semantic coherence for more meaningful retrieval.</p></li></ul><p>GitHub repo <a href="https://github.com/NirDiamant/RAG_Techniques">https://github.com/NirDiamant/RAG_Techniques</a></p><pre><code><code>Become a premium member to get full access to my content and $1k+ in free credits for leading AI tools and APIs, including Claude, Hugging Face, Deepgram. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Support AI Tidbits as a premium member&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.aitidbits.ai/subscribe"><span>Support AI Tidbits as a premium member</span></a></p><h1>3. Zerox OCR</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oZpP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oZpP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oZpP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg" width="700" height="390.86538461538464" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:813,&quot;width&quot;:1456,&quot;resizeWidth&quot;:700,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!oZpP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 424w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 848w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!oZpP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe3698852-ea43-43bd-b11b-bcbcdd2f25ae_2048x1143.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>OCR just got simpler thanks to Zerox OCR, a dead simple open-source solution for extracting text from documents for AI ingestion.</p><p>Documents are visual by nature, filled with tricky layouts, tables, and charts, making vision models the perfect fit. Zerox uses GPT-4o Mini to turn visual documents into characters, &#224; la OCR.</p><p>The process is straightforward:</p><ol><li><p>Feed in a PDF</p></li><li><p>PDF is converted into a series of images</p></li><li><p>Each image is sent to GPT, which is tasked to convert it into markdown format</p></li><li><p>The response of each image is aggregated into a cohesive Markdown file<br></p></li></ol><p>While it may sound basic, Zerox OCR with gpt-4o-mini is both cost-effective and delivers superior results compared to existing specialized solutions like AWS Textract, Google Document AI, and Azure Document AI.<br><br>Try it out <a href="https://github.com/getomni-ai/zerox">https://github.com/getomni-ai/zerox</a></p><div><hr></div><h1>4. Anthropic&#8217;s metaprompt</h1>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-september-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[September 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[OpenAI&#8217;s new reasoning model and realtime API to power AI assistants, Qwen's and AI2&#8217;s new fully open-sourced state-of-the-art multimodal models, and speech2text with Whisper v3 Turbo]]></description><link>https://www.aitidbits.ai/p/september-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/september-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 06 Oct 2024 15:01:01 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/0d1a603e-7ff6-4acd-8baf-38a04e7ddce0_600x338.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>September has been a busy month for everyone in the AI space. It has been packed with groundbreaking developments across various AI domains, from industry giants to open-source breakthroughs.</p><p>In the realm of large language models, we've seen significant strides from both industry leaders and open-source initiatives. OpenAI introduced its advanced o1-preview and o1-mini models, excelling in high-level reasoning for coding and math. Meanwhile, Alibaba released the impressive Qwen 2.5 family of open multilingual models, handling an expansive 128K tokens. Meta also made waves with Llama 3.2, featuring edge-optimized text models and their first large multimodal models.</p><p>Multimodal AI saw remarkable progress, with AI2's Molmo models rivaling and surpassing industry giants like GPT-4V and Gemini 1.5. Nvidia's NVLM 1.0 and Apple's MM1.5 further pushed the boundaries of vision-language reasoning and diverse task performance.</p><p>In the audio domain, OpenAI released Whisper Large v3 Turbo, a faster and more capable speech-to-text model, while Google developed a promising zero-shot Voice Transfer module for cross-lingual applications.</p><p>The image and video generation landscape continued to evolve, with Meta's Imagine Yourself technology enabling personalized image generation and advancements in text-to-video models like CogVideoX-5B pushing the boundaries of visual content creation.</p><p>This month's roundup features these breakthroughs and many more exciting updates across AI tools, research methodologies, and vision AI.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (11 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (11 entries)</p></li><li><p>Research (9 entries)</p></li></ul></li><li><p>Multimodal (7 entries)</p></li><li><p>Autonomous Agents (3 entries)</p></li><li><p>Image and Video (8 entries)</p></li><li><p>Audio (4 entries)</p></li><li><p>AI Tools (5 entries)</p></li><li><p>Open-source Packages (5 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9abb755d-22e8-4143-a36e-ebd7281da003&quot;,&quot;caption&quot;:&quot;I&#8217;m excited to share a new Deep Dive after a short hiatus.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Great AI Consolidation&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-09-29T15:01:08.322Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/35bffcff-1d4f-4670-a10e-7af96da00945_2020x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/the-great-ai-consolidation&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:149185115,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:37,&quot;comment_count&quot;:5,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://openai.com/index/introducing-openai-o1-preview/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI introduces o1-preview and o1-mini, advanced models that excel in high-level reasoning for coding and math, with o1-mini being a faster, cost-efficient option</a>&nbsp;</p></li><li><p><a href="https://openai.com/index/introducing-the-realtime-api/">As part of its DevDay event, OpenAI releases the Realtime API, allowing developers to create low-latency, voice-to-voice interactions with continuous audio streaming, along with new tools like vision fine-tuning, prompt caching, and model distillation</a></p></li><li><p><a href="https://x.com/OpenAI/status/1838642444365369814?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases Advanced Voice Mode for ChatGPT Plus and Team users, adding Custom Instructions, Memory, and five new voices</a></p></li><li><p><a href="https://github.com/anthropics/anthropic-quickstarts/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases the Quickstarts repo, providing ready-to-deploy app projects powered by its API, starting with a Claude-based customer support agent</a></p></li><li><p><a href="https://techcrunch.com/2024/09/25/meta-connect-2024-orion-glasses-quest-3s-headset-meta-ai-upgrades-ray-ban-meta-real-time-video-and-more-revealed/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">In its annual Connect conference, Meta announced multiple AI-related releases, including Ray-Ban Meta smart glasses with real-time AI video processing, AI-powered visual search for Instagram, translation and dubbing tools for creator content with lip sync, and Meta AI vocal responses across platforms with customizable celebrity voices</a></p></li><li><p><a href="https://techcrunch.com/2024/09/25/openais-chief-research-officer-has-left/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI's CTO, Mira Murati, along with other OpenAI execs to step down and leave the company</a></p></li><li><p><a href="https://x.com/pika_labs/status/1841143349576941863?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Pika Labs launches its new video generating model, featuring new "Pikaffects" that transform video subjects with surreal, physics-defying effects like melting and cake-ifying objects</a>&nbsp;</p></li><li><p><a href="https://developers.googleblog.com/en/updated-production-ready-gemini-models-reduced-15-pro-pricing-increased-rate-limits-and-more/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google introduces new Gemini-1.5 Pro and Flash models, with faster response speeds, reduced prices (&gt;50%) and improved task performance</a></p></li><li><p><a href="https://deepgram.com/learn/introducing-ai-voice-agent-api?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Deepgram unveils its Voice Agent API, enabling natural, real-time human-machine conversations powered by high-performance speech recognition and synthesis models</a></p></li><li><p><a href="https://x.com/hume_ai/status/1833906262351974483?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Hume releases Empathic Voice Interface 2 (EVI 2) - a GPT-4o-like voice model, allowing users to converse with its AI chatbot with sub-second response times</a></p></li><li><p><a href="https://x.com/amasad/status/1831730911685308857?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Replit announces Replit Agent - an AI tool that automates software development tasks like environment setup and deployment</a></p></li></ol><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;2a8c4d9c-f678-4d3f-a0de-f37856f4d8f0&quot;,&quot;duration&quot;:null}"></div><p>&#128070; OpenAI&#8217;s Realtime API powering Speak&#8217;s language learning app</p><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Claude, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>Large Language Models</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://qwenlm.github.io/blog/qwen2.5/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Alibaba releases Qwen 2.5 - a family of open multilingual models handling 128K tokens and outperforming competitors like Mistral 2 (123B) on major benchmarks, offering multilingual support in 29 languages and specialized models across Math and coding</a></p></li><li><p><a href="https://ai.meta.com/blog/llama-3-2-connect-2024-vision-edge-mobile-devices/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta releases Llama 3.2 - a new family of open models featuring edge-optimized text models (1B and 3B) and Meta's first large multimodal models (11B and 90B) supporting 128K tokens</a></p></li><li><p><a href="https://x.com/kyutai_labs/status/1836427396959932492?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Kyutai Labs openly releases Moshi - a 7.6B open speech-to-speech model with cutting-edge performance and low latency, alongside Mimi, a SoTA streaming audio codec that compresses 24 kHz audio to 1.1 kbps for optimized real-time speech communication</a></p></li><li><p><a href="https://x.com/deepseek_ai/status/1832026579180163260?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">The DeepSeek team releases DeepSeek-V2.5 - a SOTA versatile open model integrating DeepSeek-Coder with advanced features like Function Calling and JSON output</a></p></li><li><p><a href="https://01-ai.github.io/blog.html?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">01.AI unveils Yi-Coder - a high-performing series of code LLMs with up to 9B parameters, excelling in long-context modeling and outperforming other bigger models</a></p></li><li><p><a href="https://arxiv.org/abs/2409.03420?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Alibaba proposes DocOwl2 - a state-of-the-art model for multi-page document understanding that reduces GPU usage and inference time by compressing high-resolution document images into 324 tokens</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/september-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Aug 2024]]></title><description><![CDATA[AI models for PDF extraction, Mistral's fine-tuning SDK, an open-source Perplexity clone, HippoRAG's 20% performance boost over RAG methods, and OpenAI's new evaluation library for benchmarking LLMs]]></description><link>https://www.aitidbits.ai/p/linkedin-august-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-august-2024</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 22 Sep 2024 15:02:20 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/821d18c2-ab85-43dd-8773-5ad2f7a110aa_746x360.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Introducing: AI Tidbits LinkedIn Highlights</em></p><p>Welcome to a new AI Tidbits series! Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Claude, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h1>1. PDF Extract Kit</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7IVl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7IVl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7IVl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg" width="1456" height="821" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:821,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!7IVl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 424w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 848w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!7IVl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F32289c54-6d73-4a9e-a765-e561f64d0123_2048x1155.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Extracting information from documents has been one of AI&#8217;s holy grails. A new open-source project deploys specialized AI models to tackle this challenge head-on.<br><br>PDF-Extract-Kit is a comprehensive pipeline that breaks down PDF content extraction into several components: </p><ol><li><p>Layout detection - leveraging LayoutLMv3 to precisely identify regions like images, tables, titles, and text</p></li><li><p>Table recognition - featuring StructEqTable for converting complex tables into LaTeX</p></li><li><p>OCR - utilizing PaddleOCR for high-performance text extraction in multiple languages</p></li><li><p>Formula detection - using YOLOv8 to accurately detect inline and isolated formulas</p></li><li><p>Formula recognition - employing UniMERNet to rival commercial software in formula recognition quality</p></li></ol><p><br>Trained on diverse datasets, these models handle various document types, from academic papers to financial reports.<br><br>GitHub repo <a href="https://github.com/opendatalab/PDF-Extract-Kit">https://github.com/opendatalab/PDF-Extract-Kit</a></p><div><hr></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;0ccab0b7-cd19-43cf-93bd-518e8c98b7e2&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go!&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Revolutionizing document processing with multimodal GPT&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-10-30T14:30:30.962Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4c326a-53e0-492d-b375-9c69899b8fcd_800x1032.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/doc-extraction-gpt4&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138339915,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>2. Mistral fine-tuning API + SDK</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4Rm7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4Rm7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4Rm7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg" width="1456" height="833" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:833,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!4Rm7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 424w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 848w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!4Rm7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F54381a60-02ba-4419-9d29-1f76b450ffec_2048x1171.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Mistral just dropped a game-changing fine-tuning API and SDK to help developers easily fine-tune Mistral variants on a single GPU. Clone, prep, train.<br><br>The SDK is a lightweight GitHub repository that leverages LoRA, allowing for memory-efficient training by freezing most model weights and only updating 1-2% with low-rank matrix perturbations. It's optimized for multi-GPU setups but can also be used with a single GPU for smaller models like the 7B.<br><br>To get started:</p><ol><li><p>Clone the repo and install dependencies</p></li><li><p>Download and prepare your model and data</p></li><li><p>Validate and start training with a few simple commands</p><p></p></li></ol><p>This repository is opinionated to simplify the finetuning process, focusing on Mistral models and specific hardware. It also includes a Colab notebook to hit the ground running.<br><br>Full details and setup instructions are in the GitHub repo <a href="https://github.com/mistralai/mistral-finetune">https://github.com/mistralai/mistral-finetune</a></p><div><hr></div><h2>3. Perplexica</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!swpk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!swpk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 424w, https://substackcdn.com/image/fetch/$s_!swpk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 848w, https://substackcdn.com/image/fetch/$s_!swpk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!swpk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!swpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg" width="1456" height="836" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:836,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!swpk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 424w, https://substackcdn.com/image/fetch/$s_!swpk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 848w, https://substackcdn.com/image/fetch/$s_!swpk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!swpk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe570ac0d-49f4-4e98-ab64-36631ddf158a_2048x1176.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new open-source project called Perplexica replicates the $2.5B startup Perplexity so developers can easily launch AI-powered search tools.<br><br>Perplexica is an open-source AI-powered search tool that dives deep into the internet to find precise answers. Perplexica not only searches the web but also understands your questions, delivering clear answers with cited sources.<br><br>It supports local models such as Llama3 and Mixtral for faster and cheaper inference and has six specialized modes tailored to answer specific types of questions:<br></p><ol><li><p>All Mode - searches the entire web for the best results</p></li><li><p>Writing Assistant Mode - assists with writing tasks without web searches</p></li><li><p>Academic Search Mode - ideal for finding articles and papers for academic research</p></li><li><p>YouTube Search Mode - finds YouTube videos based on search queries</p></li><li><p>Wolfram Alpha Search Mode - uses Wolfram Alpha for calculations and data analysis</p></li><li><p>Reddit Search Mode - searches Reddit for discussions and opinions<br></p></li></ol><p>Unlike other tools that use outdated data, Perplexica provides the latest information using a metasearch engine called SearxNG.</p><p>GitHub repo <a href="https://github.com/ItzCrazyKns/Perplexica">https://github.com/ItzCrazyKns/Perplexica</a></p><div><hr></div><h2>4. HippoRAG</h2>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-august-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[August 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Musk's xAI unveils Grok-2, OpenAI's new GPT-4o model, Microsoft's open-source Phi 3.5 series, Nvidia's Eagle multimodal LLMs, Black Forest Labs' FLUX.1 to flood the internet with uncensored images]]></description><link>https://www.aitidbits.ai/p/august-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/august-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 08 Sep 2024 15:01:44 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7l2n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>August has been a month of remarkable progress across various AI domains, from industry giants to open-source breakthroughs.</p><p>Elon Musk's xAI made waves with the unveiling of Grok-2 and Grok-2 mini, showcasing advanced capabilities that rival top models. OpenAI continued to refine its offerings with a more efficient GPT-4o and the introduction of Structured Outputs. The open-source community saw significant advancements, with Microsoft's Phi 3.5 series and AI21's Jamba models pushing the boundaries of what's possible with freely available models.</p><p>In the realm of multimodal AI, Nvidia's Eagle and Alibaba's Qwen2-VL demonstrated impressive performance in visual understanding tasks. The image and video generation field saw major leaps with Black Forest Labs' FLUX.1 and Tsinghua University's CogVideoX-5B. Audio AI also made strides, with Qwen2-Audio enabling multilingual voice interaction and HuggingFace's Parler TTS v1 offering enhanced text-to-speech capabilities.</p><p>Perhaps most intriguingly, Sakana AI introduced The AI Scientist, a system that could revolutionize scientific research by automating idea generation, execution, and documentation.</p><p>These developments, along with many more exciting updates across language models, multimodal AI, and specialized applications, are part of this month's comprehensive roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (8 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (8 entries)</p></li><li><p>Research (8 entries)</p></li></ul></li><li><p>Multimodal (11 entries)</p></li><li><p>Image and Video (10 entries)</p></li><li><p>Audio (6 entries)</p></li><li><p>Robotics (2 entries)</p></li><li><p>Open-source Packages (8 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Claude, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://x.ai/blog/grok-2?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Elon Musk's xAI unveils Grok-2 and Grok-2 mini, showcasing advanced chat, coding, and reasoning capabilities that outperform top models like Claude 3.5 Sonnet and GPT-4 Turbo</a></p></li><li><p><a href="https://x.com/OpenAIDevs/status/1820987573793386527?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases a new GPT-4o version that is slightly better and 50% cheaper than the previous GPT-4o model</a></p></li><li><p><a href="https://openai.com/index/introducing-structured-outputs-in-the-api/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases Structured Outputs, ensuring AI-generated data conforms precisely to developer-supplied JSON schemas for enhanced reliability, achieving perfect accuracy with the new gpt-4o-2024-08-06 model</a></p></li><li><p><a href="https://www.anthropic.com/news/prompt-caching">Anthropic introduces prompt caching on its API, reducing costs and latency for large prompts by up to 90% and 85%, respectively, now in public beta for Claude 3.5 Sonnet and Claude 3 Haiku</a></p></li><li><p><a href="https://blog.google/products/gemini/made-by-google-gemini-ai-updates/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google releases Gemini Live - a voice-interactive AI chatbot with enhanced emotional expression and real-time adaptive dialogue, similar to ChatGPT's new Advanced Voice Mode capability, offering hands-free and long-context conversational capabilities</a></p></li><li><p><a href="https://arxiv.org/abs/2408.07009?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeepMind releases Imagen 3 - a latent diffusion model that outperforms state-of-the-art models in generating high-quality images from text prompts, with built-in measures to enhance safety and representation</a></p></li><li><p><a href="https://openai.com/index/gpt-4o-fine-tuning/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI introduces fine-tuning for GPT-4o, allowing developers to tailor model responses and improve performance on domain-specific tasks like software engineering and text-to-SQL</a>&nbsp;</p></li><li><p><a href="https://x.com/ideogram_ai/status/1826277550798278804?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Ideogram introduces Ideogram 2.0 - a new version of its text-to-image model, outperforming DALL-E, Midjourney, and FLUX Pro with improved text accuracy and an API for developers</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7l2n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7l2n!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 424w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 848w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 1272w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7l2n!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif" width="522" height="522" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:600,&quot;width&quot;:600,&quot;resizeWidth&quot;:522,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;ssstwitter.com_1725662377237.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ssstwitter.com_1725662377237.mp4 [optimize output image]" title="ssstwitter.com_1725662377237.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!7l2n!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 424w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 848w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 1272w, https://substackcdn.com/image/fetch/$s_!7l2n!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F53acc928-f670-4422-9672-d9bfeabffe01_600x600.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">xAI&#8217;s Grok-2 is in 2nd place on the Chatbot Arena Leaderboard, surpassing Claude and the previous GPT-4o version</figcaption></figure></div></li></ol><h2><strong><br>Large Language Models</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft releases three new open-source AI models in the Phi 3.5 series: Phi 3.5 mini-instruct, MoE-instruct, and vision-instruct models, offering scalable reasoning capabilities for commercial and scientific use across languages</a></p></li><li><p><a href="https://arxiv.org/abs/2409.02060?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">AI2 introduces OLMoE - a sparse Mixture-of-Experts language model that activates only 1B parameters per token, achieving state-of-the-art performance and outperforming larger models like Llama2-13B-Chat</a></p></li><li><p><a href="https://www.ai21.com/blog/announcing-jamba-model-family?utm_source=linkedin&amp;utm_medium=post&amp;utm_campaign=saharmor">AI21 releases Jamba Large and Jamba Mini - two new language models in its family of Mamba-Transformer models, featuring the longest context window for open models (256k) and rivaling state-of-the-art models like Llama 3.1 and Mistral Large</a></p></li><li><p><a href="https://arxiv.org/abs/2408.06941?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers open source OpenResearcher - an AI-driven platform that integrates LLMs with domain-specific knowledge through Retrieval-Augmented Generation, enabling researchers to efficiently navigate and generate insights from scientific literature</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/august-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[July 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[New language models from OpenAI, Mistral, and Google, Meta&#8217;s open release of its powerful Llama 3.1 models, Segment Anything v2 leapfrogs computer vision, and advanced RAG techniques]]></description><link>https://www.aitidbits.ai/p/july-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/july-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 04 Aug 2024 17:01:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F145b23f8-f057-47b6-9db3-a3d17be07853_600x338.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>July was filled with remarkable progress across various AI domains, from LLMs to text-to-video models. But most of all, July was the month of open-source AI, with Meta's Llama 3.1 suite and Google's Gemma-2-2B pushing the boundaries of what's possible with freely available models, rivaling top proprietary LLM providers such as GPT-4o and Claude Sonnet.</p><p>OpenAI made waves with the release of GPT-4o mini, a cost-effective multimodal model, and the unveiling of SearchGPT, a potential Google Search competitor. Meanwhile, Mistral AI flexed its muscles with Mistral Large 2 and partnered with Mamba creators to release the impressive Codestral-Mamba 7B.</p><p>In the realm of research, DeepMind's AlphaProof and AlphaGeometry 2 showcased AI's growing prowess in advanced mathematics. The emergence of autonomous agents continued with OpenDevin, while image and video generation saw leaps forward with Meta's Segment Anything Model (SAM) 2 and Stability AI's Stable Video 4D.</p><p>These developments, along with many more exciting updates across language models, multimodal AI, and specialized applications, are part of this month's comprehensive roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (6 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (14 entries)</p></li><li><p>Research (8 entries)</p></li></ul></li><li><p>Autonomous Agents (4 entries)</p></li><li><p>Multimodal (3 entries)</p></li><li><p>Image and Video (10 entries)</p></li><li><p>Audio (2 entries)</p></li><li><p>AI Tools (5 entries)</p></li><li><p>Open-source Packages (7 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Perplexity, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h3><strong>Industry announcements</strong></h3><ol><li><p><a href="https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases GPT-4o mini - a cost-effective multimodal model, offering superior performance at 60% less cost than GPT-3.5 Turbo</a></p></li><li><p><a href="https://mistral.ai/news/mistral-large-2407/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral releases Mistral Large 2, boasting a 128k context window and superior performance in multiple languages and coding tasks</a></p></li><li><p><a href="https://openai.com/index/searchgpt-prototype/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI unveils SearchGPT - a Google Search competitor combining real-time web information with conversational abilities for precise and fast answers</a></p></li><li><p><a href="https://x.com/OpenAI/status/1818353584154796239?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI begins alpha testing Advanced Voice Mode - ChatGPT's new real-time and natural conversational AI demoed in OpenAI's April Spring Updates event</a></p></li><li><p><a href="https://x.com/AnthropicAI/status/1810747792807342395?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases a revamped developer console to help LLM users and developers optimize and evaluate prompts, including a side-by-side comparison of outputs</a></p></li><li><p><a href="https://odyssey.systems/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Odyssey, a new startup emerging from stealth, presents Hollywood-grade visual AI, offering filmmakers and artists detailed control over geometry, materials, lighting, and motion for high-end productions</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Lqgh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Lqgh!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Lqgh!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84eec709-f6ef-44d8-bf52-337011933773_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;ssstwitter.com_1722364357402.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ssstwitter.com_1722364357402.mp4 [optimize output image]" title="ssstwitter.com_1722364357402.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!Lqgh!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!Lqgh!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84eec709-f6ef-44d8-bf52-337011933773_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ChatGPT Advanced Voice Mode is starting to roll out to alpha users</figcaption></figure></div></li></ol><h2><strong>Large Language Models</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://llama.meta.com/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta unveils Llama 3.1 - an open suite of three language models (7B, 80B, 405B), with the largest one outperforming GPT-4o and Claude Sonnet on multiple benchmarks</a></p></li><li><p><a href="https://developers.googleblog.com/en/smaller-safer-more-transparent-advancing-responsible-ai-with-gemma">Google releases Gemma-2-2B - a commercially permissive compact 2 billion parameter model that outperforms models 20x its size, including Mixtral 8x7B, GPT 3.5, and Llama-2 70B</a></p></li><li><p><a href="https://mistral.ai/news/codestral-mamba/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral partners with the Mamba creators to release the commercially permissive Codestral-Mamba 7B - the strongest code model for its size, featuring a 256k context window</a></p></li><li><p><a href="https://mistral.ai/news/mathstral/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral releases Mathstral - an open model excelling in multi-step logical reasoning for STEM subjects, achieving state-of-the-art scores on key benchmarks</a>&nbsp;</p></li><li><p><a href="https://mistral.ai/news/mistral-nemo/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral AI releases Mistral NeMo - a powerful 12B open language model developed with Nvidia, featuring a 128k context window and outperforming competitors in reasoning and coding tasks</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/july-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[June 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Apple's AI revolution, Anthropic's powerful new LLM Sonnet 3.5, a new SOTA AI software engineer, hyperrealistic video generation, and Microsoft's commercially permissive vision models]]></description><link>https://www.aitidbits.ai/p/june-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/june-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 30 Jun 2024 14:30:51 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffaf8852e-f9a2-4dfd-8a5c-f9b3fa8b0b9c_600x390.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>As we step into summer, the AI landscape continues moving with groundbreaking innovations and exciting developments. June has been a month of significant strides across various AI domains, from industry giants to open-source breakthroughs.</p><p>Apple's Worldwide Developers Conference (WWDC) took center stage, unveiling a suite of AI features that promise to revolutionize the user experience across Apple devices. Meanwhile, the language model arena saw remarkable advancements, with Anthropic's Claude 3.5 Sonnet pushing the boundaries of performance, a new software agent that scored 19% on SWE-bench, and a new state-of-the-art version of DeepSeek-Coder. </p><p>In video generation, companies like Runway and Luma AI challenge the status quo with their hyperrealistic video creation tools.</p><p>This month's roundup also spotlights impressive progress in multimodal AI, with Microsoft openly releasing Florence-2, a commercially permissive state-of-the-art small vision model family, and EPFL and Apple's new training approach setting new benchmarks for multimodal AI.</p><p>In addition to these highlights, June&#8217;s roundup features novel LLM techniques (e.g. Mixture-of-Agents), promising open-source projects (e.g. Open Interpreter), and a host of other developments in autonomous agents and multimodal AI.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>&#10024; <strong>Special feature</strong>: Apple Worldwide Developers Conference (WWDC)</p></li><li><p>Industry announcements (9 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (10 entries)</p></li><li><p>Research (10 entries)</p></li></ul></li><li><p>Autonomous Agents (4 entries)</p></li><li><p>Multimodal (4 entries)</p></li><li><p>Image and Video (8 entries)</p></li><li><p>Audio (3 entries)</p></li><li><p>Open-source Packages (6 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Perplexity, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h3><strong>&#10024;Special feature: Apple Worldwide Developers Conference (WWDC)</strong></h3><ul><li><p><strong>Apple Intelligence -</strong>&nbsp;a suite of new AI features for iPhone, Mac, and other Apple devices. This includes a more conversational Siri, custom AI-generated "Genmoji," and integration with OpenAI's GPT-4o. Apple&#8217;s on-device models excel in specific tasks using an adapter strategy, with on-device models outperforming larger models in summarizing and composing text.</p></li><li><p><strong>Enhanced Siri capabilities</strong>&nbsp;- Siri will gain new abilities such as managing notifications, writing and summarizing text, and carrying out actions across multiple apps. Users can interact with Siri through voice or typing.</p></li><li><p><strong>Genmoji and Image Playground</strong>&nbsp;- Apple is launching Genmoji to create emoji-like reactions on demand and Image Playground for AI-generated images. These features will be integrated into various apps, including Photos, which will have improved search and editing capabilities similar to Google's Magic Eraser.</p></li><li><p><strong>OpenAI integration</strong>&nbsp;- Siri will leverage ChatGPT, powered by GPT-4o, for complex requests, ensuring user permission before sending data. ChatGPT will be available across iOS, macOS, and iPadOS, supporting AI writing and image generation tools.</p></li></ul><p>&#8212;&gt;&nbsp;<a href="https://www.youtube.com/watch?v=sBXdyUA6A88">Apple WWDC 2024 keynote in 18 minutes</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;149eaf49-30ba-4dfb-936a-56d78ada1bd0&quot;,&quot;duration&quot;:null}"></div><h3><strong>Industry announcements</strong></h3><ol><li><p><a href="https://www.anthropic.com/news/claude-3-5-sonnet?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases Claude 3.5 Sonnet - a model with a 200k token context window, outperforming GPT-4o and featuring a new dynamic Artifacts workspace</a></p></li><li><p><a href="https://www.etched.com/announcing-etched?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">A new startup called Etched is trying to take on Nvidia by presenting the world&#8217;s first specialized chip for Transformers, delivering over 500,000 tokens per second and claims to be &gt;10x faster and cheaper than NVIDIA&#8217;s next-generation Blackwell</a></p></li><li><p><a href="https://www.factory.ai/news/code-droid-technical-report?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Factory emerges out of stealth to automate software engineering by modeling the cognitive processes of developers, achieving top performance on SWE-bench (19.27% compared to Devin's 13.86%)</a></p></li><li><p><a href="https://x.com/ssi/status/1803472825476587910?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Ilya Sutskever, OpenAI's former chief scientist, starts Safe Superintelligence Inc. - a new company to build safe superintelligence</a></p></li><li><p><a href="https://runwayml.com/blog/introducing-gen-3-alpha/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Runway releases Gen-3 Alpha - an AI model generating high-quality, hyperrealistic videos with expressive human characters and smooth transitions</a>&nbsp;</p></li><li><p><a href="https://venturebeat.com/ai/what-you-need-to-know-about-kling-the-ai-video-generator-rival-to-sora-thats-wowing-creators/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">A Chinese company launches a Sora competitor called Kling - an AI model that generates realistic video clips from text prompts using advanced 3D AI techniques</a></p></li><li><p><a href="https://lumalabs.ai/dream-machine?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Luma AI debuts Dream Machine - an OpenAI Sora-like tool enabling users to create realistic videos from text prompts in just two minutes</a></p></li><li><p><a href="https://elevenlabs.io/app/sound-effects?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">ElevenLabs releases Sound Effects, turning text into rich sounds</a></p></li><li><p><a href="https://mistral.ai/news/customization/">Mistral introduces model customization on its platform, enabling efficient fine-tuning of AI models to meet specific user needs with reduced costs and expertise</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Yom7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Yom7!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 424w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 848w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 1272w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Yom7!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif" width="600" height="337" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:337,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [speed output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [speed output image]" title="temp.mov [speed output image]" srcset="https://substackcdn.com/image/fetch/$s_!Yom7!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 424w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 848w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 1272w, https://substackcdn.com/image/fetch/$s_!Yom7!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbd45a10b-803e-42d2-8e78-1cc0f8416997_600x337.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Claude Artifacts generates and executes code live</figcaption></figure></div><h2><strong>Large Language Models</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://huggingface.co/collections/microsoft/florence-6669f44df0d87d9c3bfb76de?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft releases Florence-2 - a commercially permissive state-of-the-art small vision model family (200M, 800M params) that outperforms larger specialized models in tasks like image description and object recognition</a></p></li><li><p><a href="https://arxiv.org/abs/2406.11931?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeepSeek-AI releases DeepSeek-Coder-V2 - an open-source language model supporting 338 programming languages, beating top commercial models like GPT-4 Turbo in code generation and mathematics</a></p></li><li><p><a href="https://changes.openinterpreter.com/log/local-iii?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Open Interpreter introduces Local III - a suite of tools for running powerful language models locally to control your personal computer, enhancing control and privacy</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/june-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[May 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[A 2M context window from Google, an any-to-any model from OpenAI, new open models for document understanding, Mistral&#8217;s first coding LLM, and a sneak peek into LM&#8217;s brain from Anthropic]]></description><link>https://www.aitidbits.ai/p/may-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/may-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 02 Jun 2024 15:00:16 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b82d0cd3-6160-4bd0-b450-6f2b4e1d5c83_600x501.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>Welcome to the May edition of AI Tidbits Monthly, where we unravel the latest and greatest in AI. This month has been particularly eventful, with major updates from industry leaders such as Google, OpenAI, and Microsoft.</p><p>In its<strong> Spring Updates</strong> event, OpenAI introduced GPT-4o, a multimodal model processing text, vision, and audio with real-time emotion recognition and adaptive speech responses. They expanded the free tier to include ChatGPT Plus features and announced a new Mac app, with a Windows version coming soon.</p><p><strong>Google I/O</strong> was also filled with announcements, including a Gemini 1.5 Pro with featuring a 2M-token context window and Gemini 1.5 Flash, optimized for speed and cost-efficiency. Google unveiled Project Astra, a real-time multimodal AI assistant, and Veo, a long-form video generator, and announced the Trillium chip (TPU v6) for AI datacenters.</p><p>Lastly, Microsoft held its annual developers conference, <strong>Microsoft Build</strong>, introducing Copilot+ PCs, AI-optimized devices with advanced silicon and all-day battery life. They launched agents as part of Copilot to complete tasks autonomously, unveiled Phi-3 Vision and Phi Silica models, and announced AI-powered real-time video translation for the Edge browser.</p><p>In addition to these highlights, May's roundup includes groundbreaking advancements in language models (a new coding model from Mistral!), research, and open-source projects.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>&#10024; <strong>Special feature</strong>: AI updates from Google, OpenAI, and Microsoft</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (10 entries)</p></li><li><p>Research (8 entries)</p></li></ul></li><li><p>Autonomous Agents (2 entries)</p></li><li><p>Multimodal (5 entries)</p></li><li><p>Image and Video (5 entries)</p></li><li><p>Audio (4 entries)</p></li><li><p>Open-source Packages (6 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Perplexity, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2>&#10024;<strong> Special feature: AI updates from Google, OpenAI, and Microsoft</strong></h2><h3><strong>OpenAI Spring Updates</strong></h3><p>OpenAI introduced GPT-4o, a cutting-edge multimodal model that processes text, vision, and audio, offering superior speed and cost-efficiency compared to GPT-4 Turbo. Key enhancements include real-time emotion recognition and adaptive speech responses, inspired by the movie "Her." The new voice assistant features real-time translation, facial expression reading, and dynamic voice adaptation, significantly improving interactivity. OpenAI expanded its free tier, providing features previously exclusive to ChatGPT Plus users, and limited access to GPT-4o. Additionally, a new desktop app for Mac was announced, with a Windows version coming soon, and potential integration with Apple devices is on the horizon.</p><p>&#8212;&gt; More&nbsp;<a href="https://openai.com/index/hello-gpt-4o/">here</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;cd50c206-b982-4f05-a622-b23da727784e&quot;,&quot;duration&quot;:null}"></div><h3><strong>Google I/O</strong></h3><p>At Google I/O 2024, Google unveiled Gemini 1.5 Pro, boasting an expanded context window of up to two million tokens, and Gemini 1.5 Flash, optimized for speed and cost-efficiency. New projects include Project Astra, a real-time multimodal AI assistant, and Veo, a long-form video generator. AI capabilities are being integrated across Google's ecosystem, enhancing Search, Gmail, Google Photos, and Android. The Trillium chip (TPU v6) was introduced, designed for AI datacenters to enhance processing power and energy efficiency, along with Gemini Nano, bringing on-device multimodal capabilities to Pixel devices.</p><p>&#8212;&gt; More&nbsp;<a href="https://blog.google/inside-google/message-ceo/google-io-2024-keynote-sundar-pichai/#gemini-era">here</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;52d52f25-6e8e-430c-b488-0c6770e8e887&quot;,&quot;duration&quot;:null}"></div><h3><strong>Microsoft Build</strong></h3><p>Microsoft introduced Copilot+ PCs, a new category of AI-optimized Windows devices with advanced silicon and all-day battery life. The company expanded Copilot AI agents to handle autonomous tasks, with new capabilities launching in Copilot Studio. Phi-3 Vision, a compact multimodal AI model, and Phi Silica, a local language model optimized for Copilot+ PCs, were unveiled. Additionally, the Edge browser will soon feature AI-powered real-time video translation, enabling multilingual video accessibility and enhancing global communication.</p><p>&#8212;&gt; More&nbsp;<a href="https://www.theverge.com/24161636/microsoft-build-2024-ai-copilot-windows-teams-edge">here</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f310498b-1825-4506-853c-c2f1b52677ca&quot;,&quot;duration&quot;:null}"></div><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://azure.microsoft.com/en-us/blog/new-models-added-to-the-phi-3-family-available-on-microsoft-azure/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft open sources new Phi-3 models, including a 7B, 14b, and a new multimodal variant with vision capabilities featuring a 128k context window</a></p></li><li><p><a href="https://mistral.ai/news/codestral/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral releases Codestral - a 32k context window coding model supporting over 80 programming languages and setting a new standard in performance and latency</a></p></li><li><p><a href="https://huggingface.co/mistralai/Mistral-7B-v0.3?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral releases 7B v0.3 - a new version of its small and powerful open language model, with function calling support</a>&nbsp;</p></li><li><p><a href="https://developers.googleblog.com/en/gemma-family-and-toolkit-expansion-io-2024/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google releases Gemma 2 - featuring a new, efficient architecture that offers class-leading performance with fewer parameters and lower deployment costs, optimized for diverse hardware</a></p></li><li><p><a href="https://developers.googleblog.com/en/gemma-family-and-toolkit-expansion-io-2024/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google releases PaliGemma - a state-of-the-art open vision-language model that can perform deeper analysis of images and provide useful insights, such as captioning for images and short videos, object detection, and reading text embedded within images</a></p></li><li><p><a href="https://arxiv.org/abs/2405.01535?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers open-source Prometheus 2 - a groundbreaking, cost-effective language model evaluator, achieving unmatched performance in aligning with human assessments</a></p></li><li><p><a href="https://x.com/CohereForAI/status/1793643648703168807?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Cohere open sources Aya 23 - a new family of multilingual LLMs (8B, 35B) that supports 23 languages and outperforms existing models in various linguistic tasks</a></p></li><li><p><a href="https://arxiv.org/abs/2405.04324?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">IBM open sources for commercial use the Granite Code models - a series of code LLMs optimized for a wide range of coding tasks, from code generation to repository maintenance, achieving state-of-the-art performance in software development tasks</a></p></li><li><p><a href="https://arxiv.org/abs/2405.14906?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers unveil AutoCoder - the first coding model to outperform GPT-4 Turbo and GPT-4o on the Human Eval benchmark, achieving a pass@1 score of 90.9% and featuring an enhanced code interpreter with the capability to install external packages</a></p></li><li><p><a href="https://tiger-ai-lab.github.io/MAmmoTH2/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">CMU showcases MAmmoTH2 - a novel LLM that significantly boosts reasoning performance by using web-extracted instruction data, setting new standards in efficiency and effectiveness</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lbjN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lbjN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 424w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 848w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 1272w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lbjN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp" width="1456" height="622" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:622,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface" title="graphical user interface" srcset="https://substackcdn.com/image/fetch/$s_!lbjN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 424w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 848w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 1272w, https://substackcdn.com/image/fetch/$s_!lbjN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad95c26e-aed3-45d2-9321-c3059fa2a4b6_2282x975.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Microsoft&#8217;s Phi-3 is a suite of small powerful language models</figcaption></figure></div></li></ol><h3><strong>Research</strong></h3><ol><li><p><a href="https://www.anthropic.com/research/mapping-mind-language-model?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases a detailed study peeking into LLMs' brains, showcasing how millions of concepts such as gender, conversational styles, and political views are represented, revealing the first-ever detailed look inside a modern LLM</a></p></li><li><p><a href="https://console.anthropic.com/dashboard?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic releases a new tool that turns task descriptions into production-ready optimized prompts</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/may-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[March 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Devin AI revolutionizes software engineering, Claude 3 sets new benchmarks, Mistral Large rivals GPT-4, crucial LLM security insights, and the first open-source Mamba-based LLM supporting 256k tokens]]></description><link>https://www.aitidbits.ai/p/march-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/march-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 31 Mar 2024 15:00:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02dd0f0f-3e1a-4a86-a2f4-42739ef4fb0f_800x450.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>Welcome to the March edition of AI Tidbits Monthly, where we uncover the latest and greatest in AI. This month has been filled with groundbreaking announcements from industry leaders and exciting progress in open-source AI, showcasing the rapid advancements in the field.</p><p>March saw the release of Cognition's Devin AI, the world's first autonomous AI software engineer, and Anthropic's Claude 3, setting new industry benchmarks across various domains. Mistral AI also introduced Mistral Large, a top-tier model rivaling GPT-4, now available on Azure through a new partnership with Microsoft.</p><p>In the realm of LLM security, research from UIUC, Cornell, DeepMind, and ETH Zurich highlighted potential cybersecurity risks and the urgent need for improved security measures against adversarial attacks.</p><p>Open-source initiatives continued to thrive, with AI21's Jamba, Sakana AI's Evolutionary Model Merge, xAI's Grok-1, and Apple's MM-1, showcasing novel approaches and achieving state-of-the-art results with enhanced efficiency.</p><p>Image and video generation also saw significant advancements, with Stability AI's Stable Diffusion 3, Alibaba's EMO framework for life-like portrait animation, and the introduction of YOLOv9 and GELAN for improved object detection.</p><p>Lastly, AI agents took center stage with DeepMind's Genie, transforming images into interactive 2D worlds, and SIMA, a versatile AI capable of following natural-language instructions across different video game environments.</p><p>These and many more exciting updates across various AI domains are part of this month's roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry Announcements (6 entries)</p></li><li><p><strong>&#10024; Special feature: LLMs Security and Safety (4 entries)</strong></p></li><li><p>Large Language Models</p><ul><li><p>Open-source (11 entries)</p></li><li><p>Research (6 entries)</p></li></ul></li><li><p>Autonomous Agents (4 entries)</p></li><li><p>Image and Video (12 entries)</p></li><li><p>Audio (2 entries)</p></li><li><p>Multimodal (3 entries)</p></li><li><p>Open-source Packages (7 entries)</p></li><li><p>AI tools (2 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://twitter.com/cognition_labs/status/1767548763134964000?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Cognition releases Devin AI - the world's first autonomous AI software engineer, excelling in complex tasks and learning from feedback, outperforming in real-world coding benchmarks</a></p></li><li><p><a href="https://www.anthropic.com/news/claude-3-family?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic announces Claude 3 - three state-of-the-art language models, setting new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision</a></p></li><li><p><a href="https://mistral.ai/news/mistral-large/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral AI releases Mistral Large - a top-tier model that rivals GPT-4 with advanced multilingual reasoning and competitive pricing, now available on Azure as part of a new partnership with Microsoft</a></p></li><li><p><a href="https://about.ideogram.ai/1.0?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Ideogram releases Ideogram 1.0 - a text-to-image model excelling in text rendering and photorealism</a></p></li><li><p><a href="https://twitter.com/adcock_brett/status/1767913955295744449?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Figure partners with OpenAI to enhance its humanoid robot, Figure 01, showcasing human-like communication and reasoning in an unprecedented demo</a></p></li><li><p><a href="https://www.theverge.com/2024/3/18/24105157/nvidia-blackwell-gpu-b200-ai?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia unveils Blackwell at GTC 2024 - its next generation and the world's most powerful AI superchip</a></p></li></ol><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;0436199d-cb92-43d9-8041-0b5e519ea63c&quot;,&quot;duration&quot;:null}"></div><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Perplexity, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>&#10024; Special feature: LLMs Security and Safety</strong></h2><ol><li><p><a href="https://arxiv.org/abs/2402.06664?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">UIUC shows that GPT-4 can autonomously hack websites, performing advanced tasks like SQL injections and finding vulnerabilities, highlighting potential cybersecurity risks</a></p></li><li><p><a href="https://arxiv.org/abs/2403.06634?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeepMind and ETH Zurich introduce a novel attack capable of extracting detailed information from black-box language models, determining the exact hidden dimensions of notable models like OpenAI's ChatGPT for under $20</a></p></li><li><p><a href="https://sites.google.com/view/compromptmized?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Cornell unveils Morris II - a computer worm targeting GenAI systems, demonstrating the urgent need for improved security against adversarial self-replicating prompts</a></p></li><li><p><a href="https://scale.com/blog/measuring-mitigating-risk-wmdp?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Scale and the Center for AI Safety release the WMDP benchmark - a safety evaluation for LLMs to gauge their knowledge in biosecurity, chemical security, and cybersecurity</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Bu-a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Bu-a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 424w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 848w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 1272w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Bu-a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png" width="956" height="444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df65196e-6048-4069-bc9f-1f11d815f588_956x444.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:444,&quot;width&quot;:956,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Bu-a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 424w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 848w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 1272w, https://substackcdn.com/image/fetch/$s_!Bu-a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf65196e-6048-4069-bc9f-1f11d815f588_956x444.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">UIUC successfully leverages GPT-4 to automatically hack websites</figcaption></figure></div></li></ol><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://www.ai21.com/blog/announcing-jamba">AI21 open-sources Jamba - a pioneering Mamba SSM-Transformer model, enhancing AI performance with a 256K context window and tripled throughput on long contexts</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/march-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[February 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[New language models that push the envelope from Anthropic and Google, OpenAI's foray into video generation, new AI models to solve math problems, and models to navigate mobile apps autonomously]]></description><link>https://www.aitidbits.ai/p/february-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/february-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sat, 09 Mar 2024 16:00:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!00Pr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>Welcome to the February edition of AI Tidbits Monthly, where we unravel the latest and greatest in AI. February continued January&#8217;s strong momentum for commercial and open-source AI across modalities.</p><p>On the commercial LLMs front, Google released Gemini 1.5, supporting a groundbreaking context window of 10M tokens. Anthropic released Claude 3, a suite of powerful language models with image understanding capabilities and performance that outperform GPT-4. Mistral launched its largest and most powerful model to date, Mistral Large.</p><p>Open-source language models experienced a step change in performance, with Google&#8217;s Gemma, Abacus&#8217; Smaug, and Qwen 1.5&#8212;all demonstrating GPT-3.5-level performance with a commercially permissive license.</p><p>Nonetheless, February&#8217;s biggest announcement was OpenAI&#8217;s new text-to-video model, Sora, which produces Hollywood-grade one-minute videos. Alibaba unveiled a remarkable new framework designed to bring portraits to life with incredibly realistic expressions and accurate lip-syncing. Lastly, Google released a pioneering tool that turns any image into an interactive 2D game.</p><p>These breakthroughs, along with many more across speech, video, multimodal AI, and autonomous agents, are featured in this month&#8217;s roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (7 entries)</p></li><li><p><strong>&#10024; Special feature: Speech recognition and text-to-speech AI (5 entries)</strong></p></li><li><p>Large Language Models</p><ul><li><p>Open-source (15 entries)</p></li><li><p>Research (9 entries)</p></li></ul></li><li><p>Autonomous Agents (5 entries)</p></li><li><p>Image and Video (13 entries)</p></li><li><p>Audio (3 entries)</p></li><li><p>Multimodal (5 entries)</p></li><li><p>Robotics (4 entries)</p></li><li><p>Open-source packages (5 entries)</p></li><li><p>AI tools (5 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5a6ba41c-3c0b-4db2-997d-e9aebaaaf69f&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Builders Series - helping AI developers and researchers study and deploy the latest breakthroughs reliably and efficiently. Me: What language model do you use for your [enter task name here]? AI peer: GPT-4 Me: Why? I bet a smaller model will work while being cheaper and faster&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Top 8 leaderboards to choose the right AI model for your task&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-17T14:00:59.111Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3b4f244a-7b00-4dc8-837f-fffce001ac83_2276x1270.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/leaderboards-for-choosing-best-model&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141513249,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:29,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://openai.com/sora?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI unveils Sora - a groundbreaking text-to-video model that creates realistic videos up to a minute long from text prompts</a></p></li><li><p><a href="https://www.anthropic.com/news/claude-3-family?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic announces Claude 3 - three state-of-the-art language models, setting new industry benchmarks across reasoning, math, coding, multilingual understanding, and vision</a></p></li><li><p><a href="https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google releases Gemini 1.5, featuring a groundbreaking 10M context window for superior performance across multiple modalities with reduced compute</a></p></li><li><p><a href="https://mistral.ai/news/mistral-large/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral AI releases Mistral Large - a top-tier model that rivals GPT-4 with advanced multilingual reasoning and competitive pricing, now available on Azure as part of a new partnership with Microsoft</a></p></li><li><p><a href="https://stability.ai/news/stable-diffusion-3?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Stability AI unveils Stable Diffusion 3, featuring enhanced multi-subject and advanced text prompt handling</a></p></li><li><p><a href="https://venturebeat.com/ai/stability-ai-launches-svd-1-1-a-diffusion-model-for-more-consistent-ai-videos/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Stability AI launches SVD 1.1 - a text-to-video model optimized for better motion and consistency</a></p></li><li><p><a href="https://about.ideogram.ai/1.0?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Ideogram releases Ideogram 1.0 - a text-to-image model excelling in text rendering and photorealism</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!00Pr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!00Pr!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!00Pr!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;ssstwitter.com_1708588021070.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="ssstwitter.com_1708588021070.mp4 [optimize output image]" title="ssstwitter.com_1708588021070.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!00Pr!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!00Pr!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190484e6-3f3a-4d35-be7f-81668fe34b92_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">OpenAI&#8217;s novel text-to-video system, Sora</figcaption></figure></div></li></ol><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs like Perplexity, Replicate, and Hugging Face. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>&#10024; Special feature: Speech recognition and text-to-speech AI</strong></h2><ol><li><p><a href="https://twitter.com/metavoiceio/status/1754983953193218193?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">MetaVoice open sources a commercially permissive 1B base model for text-to-speech, supporting voice cloning and emotional speech synthesis</a></p></li><li><p><a href="https://amazon-ltts-paper.com/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Amazon unveils Base TTS - the largest text-to-speech model trained on 100K hours of speech, achieving unprecedented naturalness in speech synthesis with novel tokenization</a></p></li><li><p><a href="https://audioflamingo.github.io?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia presents Audio Flamingo - a novel audio language model that improves LLMs' abilities to understand audio</a></p></li><li><p><a href="https://nvidia.github.io/NeMo/blogs/2024/2024-02-canary/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia releases Canary 1 - a state-of-the-art automatic speech recognition and speech translation model leading the Open ASR Leaderboard across four languages</a></p></li><li><p><a href="https://nvidia.github.io/NeMo/blogs/2024/2024-01-parakeet-tdt/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia introduces Parakeet-TDT - revolutionizing speech recognition with unparalleled accuracy and 64% faster processing speed compared to previous models</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;0385aa5d-fc72-4396-814e-29a16f489eb7&quot;,&quot;duration&quot;:null}"></div></li></ol><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://ai.google.dev/gemma?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google open sources Gemma - a suite of small language models (7B, 12B) that outperforms Llama 2 and Mistral 7B, permitting commercial use</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/february-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[January 2024 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Generative models that turn text and images into high-quality videos, cheaper GPT, a powerful non-transformer language model, and novel open-source multimodal AI that understand documents and images]]></description><link>https://www.aitidbits.ai/p/january-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/january-2024</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 04 Feb 2024 16:00:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb662a168-480e-427d-9e8f-c8c88e325c1f_600x317.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where we curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>Welcome to the first monthly edition of AI Tidbits Monthly, where we unravel the latest and greatest in AI. January has kicked off 2024 with a strong start and breakthroughs across generative AI modalities, from text to video and audio.</p><p>More significantly, January was the month of image and video generation. Google unveiled Lumiere, its new text-to-video generative model, TikTok open-sourced a new state-of-the-art depth estimation model, and an image synthesis tool called InstantID took the internet by storm.</p><p>We also saw substantial progress in multimodal AI with the release of LLaVA 1.6 and Qwen-VL-Max, two commercially permissive models capable of understanding images and reading documents.</p><p>On the open-source front, Microsoft allowed the commercial use of its powerful and small Phi-2 model, Meta open-sourced the 70B version of its powerful coding language model Code Llama, and a non-transformer model rivaled large transformer-based SOTA models while being cheaper and faster.</p><p>This roundup includes these and many other exciting updates across generative audio, autonomous agents, useful AI-powered tools, and open-source repositories are part of this month&#8217;s roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (6 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (10 entries)</p></li><li><p>Research (16 entries)</p></li></ul></li><li><p>Autonomous Agents (2 entries)</p></li><li><p>Image and Video (12 entries)</p></li><li><p>Audio (4 entries)</p></li><li><p>Multimodal (5 entries)</p></li><li><p>Open-source packages (5 entries)</p></li><li><p>AI tools (7 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6d2c597a-cb50-46a7-959b-eb4b6ec7b269&quot;,&quot;caption&quot;:&quot;This is a re-post of my guest post in Artificial Intelligence Made Simple https://www.aitidbits.ai/cp/141205235 &#8212; I started my career in the cybersecurity space. Dancing the endless dance of deploying defense mechanisms only to be hijacked by a more brilliant attacker a few months later. Hacking language models and language-powered applications are no dif&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;[cross-post] 7 methods to secure LLM apps from prompt injections and jailbreaks&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-02-09T19:28:11.316Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9b82f8cc-62e9-4032-9fb5-5b643a6624ee_2256x1260.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/mitigate-prompt-attacks&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:141512513,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:0,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8f45a6ff-c443-427c-858d-b8de835b4ace&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. While launching user-facing ML-powered applications has been around for more than a decade now, open-ended language models have only surged in popularity in the last 12 months. Given this nascency, best practices for managing cost, latency, and accuracy in LLM-powered applications are still being developed.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;12 techniques to reduce your LLM API bill and launch blazingly fast products&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits (http://aitidbits.ai).\n\nLinkedIn www.linkedin.com/in/sahar-mor\nTwitter www.twitter.com/theaievangelist&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-01-13T15:30:11.977Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02ba0b76-623d-4130-941b-bb73aba699b7_2408x1344.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/reduce-llm-latency-and-cost&quot;,&quot;section_name&quot;:&quot;AI Builders Series&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140635380,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:46,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;acd2b690-4992-4ce4-a059-9a9b4a82aa1a&quot;,&quot;caption&quot;:&quot;Note: \&quot;SOTA\&quot; stands for state-of-the-art, referring to the most advanced and effective models currently available in the field. Exactly a year ago, ChatGPT was one month old, Anthropic just released Claude, and Microsoft unveiled the first zero-shot model to clone someone&#8217;s voice. Long before Google Bard&#8217;s debut, Stanford&#8217;s inaugural autonomous agents pa&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;AI Tidbits 2023 SOTA Report&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-28T16:00:54.496Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22e5f043-5e0c-41b3-b685-b5c6c62806d1_2008x1130.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/2023-sota-report&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140124891,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:30,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://openai.com/blog/new-embedding-models-and-api-updates?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI announces new embedding models with superior performance and affordability, alongside reduced pricing for GPT-3.5 Turbo and an improved GPT-4 Turbo preview addressing response "laziness"</a></p></li><li><p><a href="https://openai.com/blog/introducing-the-gpt-store?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI launches its GPT Store, its platform for sharing and monetizing custom GPTs</a></p></li><li><p><a href="https://techcrunch.com/2024/01/09/can-a-striking-design-set-rabbits-r1-pocket-ai-apart-from-a-gaggle-of-virtual-assistants/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Rabbit introduces R1 - a $199 standalone AI device featuring voice control and a Large Action Model for universal application control</a></p></li><li><p><a href="https://techcrunch.com/2024/01/15/microsoft-launches-a-pro-plan-for-copilot/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft launches a Pro plan for its Copilot chatbot, infusing its Microsoft 365 suite of apps (Outlook, Word, Excel, etc.) with ChatGPT-like capabilities</a></p></li><li><p><a href="https://www.theverge.com/2024/1/18/24042354/mark-zuckerberg-meta-agi-reorg-interview?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta's Mark Zuckerberg announces the development of AGI and Llama 3, integrating FAIR and GenAI teams, and a significant GPU expansion to advance AI capabilities</a></p></li><li><p><a href="https://stability.ai/news/introducing-stable-lm-2?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Stability AI introduces Stable LM 2 1.6B - an efficient multilingual language model, setting a new benchmark in small-scale LMs</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8z0P!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8z0P!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 424w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 848w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 1272w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8z0P!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif" width="600" height="300" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:300,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!8z0P!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 424w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 848w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 1272w, https://substackcdn.com/image/fetch/$s_!8z0P!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9bb33c47-0510-4678-ae40-42fb7601e95e_600x300.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Microsoft&#8217;s Copilots are now everywhere</figcaption></figure></div><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://twitter.com/SebastienBubeck/status/1743519400626643359?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft changes the license for its small open-source powerful language model phi-2, allowing commercial use</a></p></li><li><p><a href="https://www.theverge.com/2024/1/29/24055011/meta-llama2-code-generator-generative-ai?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta open-sources Code Llama 70B - a coding language model outperforming GPT-4 in complex coding tasks, available for commercial use</a></p></li><li><p><a href="https://qwenlm.github.io/blog/qwen-vl/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Alibaba open sources Qwen-VL-Max - a large vision language model outperforming all previous open-source models and performing on par with Gemini Ultra and GPT-4V</a></p></li><li><p><a href="https://twitter.com/bindureddy/status/1752092619373793614?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Abacus AI releases SMAUG, the world-leading 30B open-source LLM, with a top MMLU score of 76%, inching towards GPT-4 performance</a></p></li><li><p><a href="https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">RWKV releases Eagle 7B, a non-transformer multilingual language model rivaling larger models in performance while being faster and cheaper, available for open commercial use</a>&nbsp;</p></li><li><p><a href="https://huggingface.co/vikhyatk/moondream1?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Moondream1 - a tiny 1.6B parameter vision language model that punches above its weight</a></p></li><li><p><a href="https://arxiv.org/abs/2401.02385?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers open source TinyLlama - a 1.1B parameter language model, showcasing superior performance over similar-sized models</a></p></li><li><p><a href="https://arxiv.org/abs/2401.12246?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers release Orion-14B - a collection of state-of-the-art multilingual LLMs achieving superior performance in diverse tasks</a></p></li><li><p><a href="https://arxiv.org/abs/2401.10774?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers release Medusa-2 - combining parallel token predictions and tree-based attention to achieve up to 3.6x faster inference without compromising quality or accuracy</a></p></li><li><p><a href="https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">A new leaderboard on Hugging Face to track and evaluate hallucinations in open-source language models</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xoY7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xoY7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 424w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 848w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 1272w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xoY7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png" width="1456" height="367" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:367,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xoY7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 424w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 848w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 1272w, https://substackcdn.com/image/fetch/$s_!xoY7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa0a3245c-603e-4a59-a2a6-2e1446aa18c1_1920x484.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Qwen-VL-Max is better at recognizing, extracting, and analyzing details within images and texts</figcaption></figure></div></li></ol><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h3><strong>Research</strong></h3><ol><li><p><a href="https://arxiv.org/abs/2401.04088?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral published the Mixtral MoE 8x7B paper, shedding light on its architecture and training process</a></p></li><li><p><a href="https://arxiv.org/abs/2401.05566?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic uncovers that LLMs can learn and retain deceptive behaviors, with standard safety techniques proving ineffective</a></p></li><li><p><a href="https://arxiv.org/abs/2401.12070?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">University of Maryland introduces Binoculars - a 90% accurate detection method for distinguishing LLM-generated text, outperforming existing tools with minimal false positives</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/january-2024">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[December 2023 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Google&#8217;s long-awaited Gemini, Apple's first major strides into generative AI, a state-of-the-art transcription model outperforming Whisper, and an autonomous AI that operates smartphone apps for you]]></description><link>https://www.aitidbits.ai/p/december-2023</link><guid isPermaLink="false">https://www.aitidbits.ai/p/december-2023</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sat, 06 Jan 2024 16:00:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F265943ce-cf85-4670-a785-b80d17536cd5_600x338.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the <strong>monthly curated round-up</strong>, where I curate the firehose of AI research papers and tools so you won&#8217;t have to. If you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><div><hr></div><p>Welcome to the December edition of AI Tidbits Monthly, where we unravel the latest and greatest in AI. December provided an exciting finale to a year filled with innovative breakthroughs and groundbreaking research.</p><p>This December, Google debuted its long-awaited large multimodal AI, Gemini, incorporating it into its Bard chatbot and providing API access. Apple also made its first major strides into generative AI with its efficient on-device inference framework, an open-source multimodal model named Ferret, and a new robust Apple silicon framework for enhanced ML efficiency.</p><p>On the open-source front, Mistral released a fully open-source Mixture of Experts model that outperforms GPT-3.5 and Llama 70B. Deci released a state-of-the-art 7B base model, and Microsoft introduced a coding LLM, CodeOcean, that beats the current SOTA open and closed LLMs on coding tasks.</p><p>Also on the open-source front, though for speech understanding and generation, Nvidia released Parakeet, a speech-to-text model that outperforms Whisper v3. Additional noteworthy developments include the unveiling of <strong>OpenVoice</strong>, a novel voice cloning technology, and Amphion, an extensive toolkit dedicated to generating audio, music, and speech.</p><p>These and many more exciting updates across novel promoting frameworks, autonomous agents, multimodal AI, and open-source repositories are part of this month&#8217;s roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (6 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (9 entries)</p></li><li><p>Prompting techniques (3 entries)</p></li><li><p>Research (8 entries)</p></li></ul></li><li><p>Autonomous Agents (3 entries)</p></li><li><p>Image and Video (8 entries)</p></li><li><p>Audio (5 entries)</p></li><li><p>Multimodal (5 entries)</p></li><li><p>Open-source packages (4 entries)</p></li></ul><h2><strong>Recent Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;acd2b690-4992-4ce4-a059-9a9b4a82aa1a&quot;,&quot;caption&quot;:&quot;Note: \&quot;SOTA\&quot; stands for state-of-the-art, referring to the most advanced and effective models currently available in the field. Exactly a year ago, ChatGPT was one month old, Anthropic just released Claude, and Microsoft unveiled the first zero-shot model to clone someone&#8217;s voice. Long before Google Bard&#8217;s debut, Stanford&#8217;s inaugural autonomous agents pa&#8230;&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;AI Tidbits 2023 SOTA Report&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-28T16:00:54.496Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22e5f043-5e0c-41b3-b685-b5c6c62806d1_2008x1130.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/2023-sota-report&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:140124891,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:30,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a8e936f-10d6-4914-8cef-329879ccbf9e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a dedicated AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Most popular and upcoming Generative AI tools and APIs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-19T15:30:19.597Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52307a3c-6727-4ca5-a4da-208969e7b833_1944x1090.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/most-used-tools&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139821359,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:18,&quot;comment_count&quot;:4,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;8b8c380f-b395-412b-befd-700c624ec991&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Over ten papers outlining novel prompting techniques were published in the last few months alone. While our X and LinkedIn feeds buzz with countless secret prompting tips &#8220;97% of ChatGPT users don&#8217;t know about&#8221;, a definitive, research-backed guide aggregating these advanced prompting strategies is hard to come by. This gap prevents LLM developers and everyday users from harnessing these novel frameworks to enhance performance and achieve more accurate results.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Harnessing research-backed prompting techniques for enhanced LLM performance&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-12-10T16:00:41.722Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ccf1c5f-bca1-40ef-be43-2a7ec84c2f40_2014x1132.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/advanced-prompting&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:139449913,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:33,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;56767746-10eb-4b6f-9aa5-6ecc7b2a5503&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! In June 2020, OpenAI unveiled GPT-3. As a veteran in the document processing domain, I had long recognized the limitations of prevailing document extraction technologies, which largely relied on rigid, rule-based logic. I wondered if language models could be the answer to intelligent data extraction. And indeed, they were.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Revolutionizing document processing with multimodal GPT&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-10-30T14:30:30.962Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4c326a-53e0-492d-b375-9c69899b8fcd_800x1032.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/doc-extraction-gpt4&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138339915,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:10,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://blog.google/technology/ai/google-gemini-ai/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google debuts Gemini - its large multimodal AI coming in three sizes: Nano, optimized for mobile devices and offering offline functionality; Pro, now powering Google's Bard and designed for a wide range of AI services; and Ultra, the most powerful version, targeting data centers and enterprise applications, set to release in Q1 &#8217;24</a></p></li><li><p><a href="https://mistral.ai/news/la-plateforme/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">With &#8364;385M in new funding, Mistral AI debuts a new platform featuring its Mixtral MoE model, becoming a direct competitor of OpenAI, Google, and Anthropic</a></p></li><li><p><a href="https://deepmind.google/technologies/imagen-2/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeepMind releases Imagen 2 - its next-generation text-to-image model that competes with DALL-E 3 and Midjourney</a></p></li><li><p><a href="https://imagine.meta.com/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta launches a standalone AI-powered image generator</a></p></li><li><p><a href="https://twitter.com/nickfloats/status/1734706803097710667?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Midjourney breaks out of Discord and launches an alpha web-based image generation platform, offering an enhanced interface</a></p></li><li><p><a href="https://www.theverge.com/2023/12/27/24016212/new-york-times-openai-microsoft-lawsuit-copyright-infringement?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">The New York Times files a billion-dollar lawsuit against OpenAI and Microsoft, accusing them of copyright infringement for using its articles in training ChatGPT and Copilot</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ToOU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ToOU!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ToOU!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb4665be-e130-405c-8c60-b042e5920f46_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ToOU!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!ToOU!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feb4665be-e130-405c-8c60-b042e5920f46_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Google&#8217;s Gemini demo</figcaption></figure></div></li></ol><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://mistral.ai/news/mixtral-of-experts/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Mistral AI releases a commercially permissive Mixture of Experts model, featuring eight experts with 7B parameters each, outperforming models like GPT-3.5 and Llama 2 70B</a>&nbsp;</p></li><li><p><a href="https://arxiv.org/abs/2312.14187?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft publishes CodeOcean and WaveCoder, outperforming existing models in code-related tasks by generating and leveraging high-quality instruction data</a></p></li><li><p><a href="https://arxiv.org/abs/2401.00368?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Microsoft open sources E5 mistral-7b - a groundbreaking text embedding approach, leveraging synthetic data from LLMs and efficient training to achieve top performance on major benchmarks</a></p></li><li><p><a href="https://twitter.com/huybery/status/1730127387109781932?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Alibaba releases Qwen-72B, a 32K context window LLM, and Qwen-1.8B, an efficient AI model requiring only 3GB GPU memory</a></p></li><li><p><a href="https://arxiv.org/abs/2311.16867?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">TII open-sources its family of Falcon LLMs, led by Falcon-180B, and publishes a paper outlining detailed evaluations and training methods used to develop Falcon</a></p></li><li><p><a href="https://deci.ai/blog/introducing-decilm-7b-the-fastest-and-most-accurate-7b-large-language-model-to-date?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeciAI open sources DeciLM 7B - the fastest and most cost-effective 7B pretrained model, topping the OpenLLM Leaderboard</a></p></li><li><p><a href="https://arxiv.org/abs/2312.15166?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Korean AI startup Upstage introduces SOLAR 10.7B, leveraging Depth Up-Scaling to surpass larger models in natural language tasks and achieve top rankings in the HF Open Leaderboard without the complexities of MoE scaling</a></p></li><li><p><a href="https://ai.meta.com/llama/purple-llama/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Meta releases Purple Llama - a project encompassing CyberSec Eval and Llama Guard to enhance generative AI safety and trust</a></p></li><li><p><a href="https://www.theverge.com/2023/12/6/23990678/apple-foundation-models-generative-ai-mlx?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Apple open-sources MLX - a robust framework for Apple silicon featuring user-friendly APIs and advanced computational features for enhanced ML efficiency</a>&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_lx8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_lx8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 424w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 848w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 1272w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_lx8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png" width="696" height="371.8078602620087" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:734,&quot;width&quot;:1374,&quot;resizeWidth&quot;:696,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!_lx8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 424w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 848w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 1272w, https://substackcdn.com/image/fetch/$s_!_lx8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c2c95aa-6da2-425c-8df3-508da8b1d22e_1374x734.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Microsoft&#8217;s WaveCoder outperforms previous state-of-the-art LLMs across programming languages</figcaption></figure></div></li></ol><pre><code><code>Become a premium member to get full access to my content and $1k in free credits for leading AI tools and APIs. It&#8217;s common to expense the paid membership from your company&#8217;s learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h3><strong>Prompting techniques</strong></h3><ol><li><p><a href="https://platform.openai.com/docs/guides/prompt-engineering/strategy-test-changes-systematically?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI releases a prompt engineering guide to improve LLM performance</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/december-2023">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[November 2023 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Open-source LLMs outperforming GPT-4 and the first 200k tokens LLM, Stability AI&#8217;s and Pika Labs&#8217; revolutionary text-to-video models, and prompting GPT-4 to outperform Med-PaLM 2 on medical tasks]]></description><link>https://www.aitidbits.ai/p/november-2023-ai-tidbits-monthly</link><guid isPermaLink="false">https://www.aitidbits.ai/p/november-2023-ai-tidbits-monthly</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 03 Dec 2023 16:00:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff8e84d73-fc0e-467f-8310-e16c52cbbd41_400x401.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a&nbsp;subscriber-only edition<strong> </strong></em><strong>&#128274;</strong><em><strong>&nbsp;</strong>of AI Tidbits, where I curate the firehose of AI research papers and tools so you won&#8217;t have to. </em></p><p><em>This is the <strong>monthly curated round-up</strong>, so if you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><p><em>Upgrade to premium to access monthly AI round-ups and deep dives into key topics like <a href="https://www.aitidbits.ai/p/openai-devday">OpenAI's DevDay</a> and <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">autonomous agents</a>.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Become a premium member&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.aitidbits.ai/subscribe"><span>Become a premium member</span></a></p><div><hr></div><p>Welcome to the November edition of AI Tidbits, where we unravel the latest and greatest in AI. November was filled with groundbreaking announcements from industry leaders and exciting progress in open-source AI.</p><p>This month, OpenAI announced GPTs, Assistants, and GPT-4 Turbo, followed by Anthropic's new release of Claude 2.1 and Perplexity's novel language models having access to online information. On the open source front, Yi open-sourced a 200k context window model, and Berkeley unveiled Starling 7B which has achieved performance on par with GPT-4, thanks to RLHF.</p><p>In the arenas of generative image and video, there has been a surge of remarkable advancements. Pika unveiled a groundbreaking text-to-video model, pushing the boundaries of visual content creation. Stability AI introduced Stable Diffusion XL, revolutionizing the generation of high-quality images at remarkable speeds, and launched Stable Video Diffusion, marking their foray into video generation.</p><p>These and many more exciting updates across novel promoting frameworks, autonomous agents, multimodal AI, and open-source repositories are part of this month&#8217;s roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Industry announcements (12 entries)</p></li><li><p>Large Language Models</p><ul><li><p>Open-source (5 entries)</p></li><li><p>Research (8 entries)</p></li><li><p><strong>&#10024; Special feature: Prompting techniques (6 entries) </strong>-<strong> </strong><em>Next Sunday, a new AI Tidbits Deep Dive will be sent out, detailing research-backed advanced prompting strategies to boost the performance of LLMs</em></p></li></ul></li><li><p>Autonomous Agents (5 entries)</p></li><li><p>Image and Video (9 entries)</p></li><li><p>Audio (3 entries)</p></li><li><p>Multimodal (5 entries)</p></li><li><p>Open-source (13 entries)</p></li><li><p>Cool Tools (4 entries)</p></li></ul><h2><strong>Recent AI Tidbits Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b0055811-e141-477d-877b-4b143ceeb600&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;OpenAI DevDay - a pivotal moment for AI &quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-11-07T15:30:25.921Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98fdd507-a5dc-4517-a165-87cab224ee7c_2300x1286.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/openai-devday&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138650315,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:27,&quot;comment_count&quot;:3,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;10f63c7e-648f-4d63-914d-0a79a15962b2&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! Last February, Stanford published a paper that sparked everyone&#8217;s imagination. In this paper, the researchers leveraged ChatGPT to power human-like agents. A mini-simulation of humanity.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The rise of autonomous agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-11-19T16:30:28.414Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56d15b18-239c-4403-839e-544d2e9dac77_600x378.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/the-rise-of-autonomous-agents&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138981811,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:13,&quot;comment_count&quot;:3,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;56767746-10eb-4b6f-9aa5-6ecc7b2a5503&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! In June 2020, OpenAI unveiled GPT-3. As a veteran in the document processing domain, I had long recognized the limitations of prevailing document extraction technologies, which largely relied on rigid, rule-based logic. I wondered if language models could be the answer to intelligent data extraction. And indeed, they were.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Revolutionizing document processing with multimodal GPT&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-10-30T14:30:30.962Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4c326a-53e0-492d-b375-9c69899b8fcd_800x1032.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/doc-extraction-gpt4&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138339915,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:10,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Industry announcements</strong></h2><ol><li><p><a href="https://www.aitidbits.ai/p/openai-devday">OpenAI announces a host of product releases in its inaugural dev conference: GPT-4 Turbo with a 128k-context window, a new text-to-speech model, and an autonomous agents API</a></p></li><li><p><a href="https://www.anthropic.com/index/claude-2-1?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Anthropic announces Claude 2.1 - a new version of its chatbot with a 200k context window, function calling support, and fewer hallucinations</a></p></li><li><p><a href="https://blog.perplexity.ai/blog/introducing-pplx-online-llms?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Perplexity unveils two new language models, pplx-7b-online and pplx-70b-online, providing up-to-date and accurate, accessible through a first-of-its-kind public API</a></p></li><li><p><a href="https://x.ai/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">xAI announces Grok-1 - a language model outperforming GPT-3.5 and Llama 2 having real-time direct access to tweets</a></p></li><li><p><a href="https://www.theverge.com/2023/11/28/23980203/aws-amazon-query-generative-ai?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Amazon introduces Amazon Q - its new chatbot for the enterprise, allowing companies to synthesize content and answer questions based on the company's data</a></p></li><li><p><a href="https://twitter.com/pika_labs/status/1729510078959497562?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Pika Labs launches Pika 1.0 - a substantially improved model for video generation and editing</a></p></li><li><p><a href="https://inflection.ai/inflection-2?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Inflection releases Inflection-2 - a new version of its language model</a></p></li><li><p><a href="https://www.theverge.com/2023/11/9/23953901/humane-ai-pin-launch-date-price-openai?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Humane introduces the AI Pin - a $699 OpenAI-powered wearable device for everyday use</a></p></li><li><p><a href="https://www.adept.ai/blog/experiments?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Adept announces Adept Experiments - a new program for users to explore Adept's models, along with ACT-2 - Adept's new multimodal model for UI understanding and action-taking</a></p></li><li><p><a href="https://www.together.ai/blog/together-inference-engine-v1?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Together AI releases Together Inference Engine - the world's fastest, achieving unparalleled speeds of up to 171 tokens per second</a></p></li><li><p><a href="https://www.figma.com/blog/introducing-ai-to-figjam/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Figma introduces the new FigJam AI to automate mundane design tasks</a></p></li><li><p><a href="https://venturebeat.com/ai/runways-gen-2-update-is-blowing-peoples-minds-with-incredible-ai-video/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Runway ML has updated its Gen-2 video generator to improve video quality and consistency, and to support higher-resolution video generation from images</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n_uF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n_uF!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 424w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 848w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 1272w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n_uF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif" width="594" height="334.125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:288,&quot;width&quot;:512,&quot;resizeWidth&quot;:594,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n_uF!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 424w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 848w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 1272w, https://substackcdn.com/image/fetch/$s_!n_uF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8aaa0a6-632c-4aad-8a70-498267c876f7_512x288.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Pika 1.0</figcaption></figure></div></li></ol><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://01.ai/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">01.ai openly releases Yi-34B - the first Chinese model to top Hugging Face's LLM Leaderboard supporting a 200k context window</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/november-2023-ai-tidbits-monthly">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[October 2023 - AI Tidbits Monthly Roundup]]></title><description><![CDATA[Multimodal AI soars with Adept&#8217;s Fuyu, LLaVA 1.5, and Obsidian, new language models deliver unmatched performance at a fraction of the cost, and a host of new techniques to better robotics.]]></description><link>https://www.aitidbits.ai/p/october-2023</link><guid isPermaLink="false">https://www.aitidbits.ai/p/october-2023</guid><dc:creator><![CDATA[Arthur Mor]]></dc:creator><pubDate>Sun, 12 Nov 2023 16:00:49 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a&nbsp;subscriber-only edition<strong> </strong></em><strong>&#128274;</strong><em><strong>&nbsp;</strong>of AI Tidbits, where I curate the firehose of AI research papers and tools so you won&#8217;t have to. </em></p><p><em>This is the <strong>monthly curated round-up</strong>, so if you're pressed for time and can only catch one AI Tidbits edition, <strong>this is the one to read</strong>&#8212;featuring the absolute must-knows.</em></p><p><em> If you find AI Tidbits valuable, share it with a friend and consider showing your support.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Support AI Tidbits&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Support AI Tidbits</span></a></p><div><hr></div><p>Welcome to the October edition of AI Tidbits, where we unravel the latest and greatest in AI. October was filled with innovative breakthroughs and groundbreaking research showcasing the astounding pace of progress in AI.</p><p>Leading the charge were open-source multimodal models with the likes of Adept&#8217;s Fuyu, LLaVA 1.5, and Obsidian - the world&#8217;s smallest multimodal AI. On the open-source front, Hugging Face released Zephyr, a language model beating Anthropic&#8217;s Claude 2 on AlpacaEval, and Distil-Whisper - a speech2text model that is 6x faster compared to OpenAI&#8217;s Whisper.</p><p>Apple joined the generative AI race with a few new papers (Matryoshka, SAM-CLIP) and Google DeepMind was hard at work with new techniques to generate high-quality training data for robotics.</p><p>These and many more exciting updates across multimodal AI, video models, and open-source tools are part of this month&#8217;s roundup.</p><p>Let's dive in!</p><div><hr></div><p><strong>Overview</strong></p><ul><li><p>Large Language Models</p><ul><li><p>Commercial (5 entries)</p></li><li><p>Research (4 entries)</p></li><li><p>Open-source (8 entries)</p></li></ul></li><li><p><strong>&#10024; Special feature: Multimodal AI (10 entries)</strong></p></li><li><p>Autonomous Agents (4 entries)</p></li><li><p>Image and Video (10 entries)</p></li><li><p>Robotics  (5 entries)</p></li><li><p>Cool Tools (5 entries)</p></li><li><p>Open-source (6 entries)</p></li></ul><h2><strong>Recent AI Tidbits Deep Dives</strong></h2><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;56767746-10eb-4b6f-9aa5-6ecc7b2a5503&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! In June 2020, OpenAI unveiled GPT-3. As a veteran in the document processing domain, I had long recognized the limitations of prevailing document extraction technologies, which largely relied on rigid, rule-based logic. I wondered if language models could be the answer to intelligent data extraction. And indeed, they were.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Revolutionizing document processing with multimodal GPT&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-10-30T14:30:30.962Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3a4c326a-53e0-492d-b375-9c69899b8fcd_800x1032.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/doc-extraction-gpt4&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:138339915,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:10,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;ca311286-2e37-4b49-b4c8-51818ae62287&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! Three years ago, I started a company that turns PDF and image documents into structured data. My twist? Using language models. Two years later, I decided to refocus my energy elsewhere with the main reason being commoditization. The OCR and document intelligence market were a race to the bottom, and even though I could raise VC money - I realized it was a lost war.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The era of AI-powered SMBs&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2023-09-24T15:00:30.517Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fbf9e3ae-a4ee-4758-8510-834bad752d4e_480x360.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/ai-powered-smbs&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:137343711,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:8,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;275e94f6-3de3-4bd4-a06f-a6f28a59cd3e&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a new section of AI Tidbits providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go! &#8220;What do you mean there is an open-source library for that? We built the entire thing ourselves&#8221; is a quote I often hear from builders in the LLM space. I&#8217;ve been building with LLMs for the last year and turned my personal list of >60 useful packages into a public table so others won&#8217;t experience the same frustration.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Open-source Generative AI&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;Bringing the latest in AI to the mass through writings and Github repos&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2023-08-06T16:30:15.749Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/885bba4a-9f47-4763-82f1-b7b9196ed69d_1664x958.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-llms&quot;,&quot;section_name&quot;:&quot;Deep Dives&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:135729768,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:12,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><h2><strong>Large Language Models (LLMs)</strong></h2><h3><strong>Commercial</strong></h3><ol><li><p><a href="https://venturebeat.com/ai/chatgpt-is-combining-its-different-abilities-into-a-single-voltron-style-chat/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">OpenAI's ChatGPT now supports all of its modes in one conversation: Browsing, Advanced Data Analysis, and DALL-E</a>&nbsp;</p></li><li><p><a href="https://www.phind.com/blog/phind-model-beats-gpt4-fast?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Phind releases a new 16k context model that beats GPT-4 at coding at GPT-3.5-like speed</a></p></li><li><p><a href="https://twitter.com/perplexity_ai/status/1717953875678794158?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Perplexity releases two new language models, pplx-7b-chat and pplx-70b-chat, that substantially outperform Llama 2 according to human evaluators</a></p></li><li><p><a href="https://www.theverge.com/2023/10/12/23913337/google-ai-powered-search-sge-images-written-drafts?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Google brings image generation to its Bard chatbot through its text2image model Imagen</a>&nbsp;</p></li><li><p><a href="https://www.aboutamazon.com/news/innovation-at-amazon/amazon-ads-ai-powered-image-generator?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Amazon rolls out a suite of AI-powered image generation tools to help advertisers improve their product images</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!78jp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!78jp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 424w, https://substackcdn.com/image/fetch/$s_!78jp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 848w, https://substackcdn.com/image/fetch/$s_!78jp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 1272w, https://substackcdn.com/image/fetch/$s_!78jp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!78jp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif" width="600" height="416" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:416,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2166420,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!78jp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 424w, https://substackcdn.com/image/fetch/$s_!78jp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 848w, https://substackcdn.com/image/fetch/$s_!78jp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 1272w, https://substackcdn.com/image/fetch/$s_!78jp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae65f482-dfcf-4bbb-9370-4d871f598159_600x416.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Editing images with DALL-E and GPT-4V </figcaption></figure></div><h3><strong>Research</strong></h3><ol><li><p><a href="https://arxiv.org/abs/2310.06117?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">DeepMind presents Step-Back Prompting - a two-step abstraction-and-reasoning process resulting in significant performance gains, including a 27% improvement on TimeQA and up to 36% over other prompting methods</a></p></li><li><p><a href="https://developer.nvidia.com/blog/announcing-steerlm-a-simple-and-practical-technique-to-customize-llms-during-inference/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Nvidia introduces SteerLM - a technique that enables real-time customization of LLMs during inference, showcasing superior performance on benchmarks and broad applicability across gaming, education, and enterprise sectors</a></p></li><li><p><a href="https://automix-llm.github.io/automix/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">CMU and Google introduce AutoMix - directing queries to larger LMs based on smaller LMs' output reliability to reduce costs while maintaining performance</a></p></li><li><p><a href="https://selfrag.github.io/?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Researchers release Self-RAG - a framework and models (7B + 13B) boosting LLMs' accuracy and quality by adaptively retrieving relevant information as needed</a></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mRVR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mRVR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 424w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 848w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 1272w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mRVR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png" width="1456" height="545" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:545,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mRVR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 424w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 848w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 1272w, https://substackcdn.com/image/fetch/$s_!mRVR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff767cc11-bdde-4ef3-960e-f418f2e95600_1630x610.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Step-Back Prompting performance boost across benchmarks</figcaption></figure></div><h3><strong>Open-source</strong></h3><ol><li><p><a href="https://huggingface.co/papers/2310.16944?utm_source=aitidbits.substack.com&amp;utm_medium=newsletter">Hugging Face releases Zephyr - a series of Mistral-based chat models with comparable performance to Anthropic's Claude 2 on AlpacaEval</a></p></li></ol>
      <p>
          <a href="https://www.aitidbits.ai/p/october-2023">
              Read more
          </a>
      </p>
   ]]></content:encoded></item></channel></rss>