<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI Tidbits]]></title><description><![CDATA[Stay ahead on the latest in AI through weekly summaries and editorial deep dives providing unique perspectives on recent developments]]></description><link>https://www.aitidbits.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!-amS!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png</url><title>AI Tidbits</title><link>https://www.aitidbits.ai</link></image><generator>Substack</generator><lastBuildDate>Tue, 28 Apr 2026 12:06:13 GMT</lastBuildDate><atom:link href="https://www.aitidbits.ai/feed" rel="self" type="application/rss+xml"/><language><![CDATA[en]]></language><webMaster><![CDATA[aitidbits@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aitidbits@substack.com]]></itunes:email><itunes:name><![CDATA[Sahar Mor]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sahar Mor]]></itunes:author><googleplay:owner><![CDATA[aitidbits@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aitidbits@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sahar Mor]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Automating Workflows With Claude Cowork]]></title><description><![CDATA[How to use Cowork&#8217;s overlooked features to automate the boring work]]></description><link>https://www.aitidbits.ai/p/automating-workflows-with-claude</link><guid isPermaLink="false">https://www.aitidbits.ai/p/automating-workflows-with-claude</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Tue, 14 Apr 2026 15:03:03 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/32c57b20-8118-4611-895e-943b6688a7ea_800x455.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to another post in the AI Coding Series, where I&#8217;m sharing the strategies and insights I&#8217;ve developed for effective AI-assisted coding.</em></p><p><em>This post is a collaboration with Ilia Karelin, who writes Prosper, a newsletter sharing tactical AI workflows with frameworks you can plug into your own setup. What drew me to Ilia&#8217;s work is that he goes deep into areas most people only skim the surface of, explaining them in layman&#8217;s terms that simplify things for both tech and non-tech folks. His complete <a href="https://prosperinai.substack.com/p/claude-chat-cowork-code-complete-setup-guide">setup guide</a> for Claude Chat, Cowork, and Code is one of the most thorough breakdowns I&#8217;ve seen. With Cowork gaining traction, I&#8217;ve noticed a gap between people who installed it and people who know how to make it useful. Ilia bridges that gap better than anyone I&#8217;ve read, so I was glad he was open to writing this together. Subscribe to Prosper here:</em></p><div class="embedded-publication-wrap" data-attrs="{&quot;id&quot;:2180334,&quot;name&quot;:&quot;Prosper&quot;,&quot;logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!svXS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d2f7f17-9068-4dd5-b8f9-b0f522236133_1200x1200.png&quot;,&quot;base_url&quot;:&quot;https://prosperinai.substack.com&quot;,&quot;hero_text&quot;:&quot;Every week, practical AI workflows, prompts, and guides you can put to use the same day - no matter your technical background.&quot;,&quot;author_name&quot;:&quot;Ilia Karelin&quot;,&quot;show_subscribe&quot;:true,&quot;logo_bg_color&quot;:&quot;#fafafa&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPublicationToDOMWithSubscribe"><div class="embedded-publication show-subscribe"><a class="embedded-publication-link-part" native="true" href="https://prosperinai.substack.com?utm_source=substack&amp;utm_campaign=publication_embed&amp;utm_medium=web"><img class="embedded-publication-logo" src="https://substackcdn.com/image/fetch/$s_!svXS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d2f7f17-9068-4dd5-b8f9-b0f522236133_1200x1200.png" width="56" height="56" style="background-color: rgb(250, 250, 250);"><span class="embedded-publication-name">Prosper</span><div class="embedded-publication-hero-text">Every week, practical AI workflows, prompts, and guides you can put to use the same day - no matter your technical background.</div><div class="embedded-publication-author-name">By Ilia Karelin</div></a><form class="embedded-publication-subscribe" method="GET" action="https://prosperinai.substack.com/subscribe?"><input type="hidden" name="source" value="publication-embed"><input type="hidden" name="autoSubmit" value="true"><input type="email" class="email-input" name="email" placeholder="Type your email..."><input type="submit" class="button primary" value="Subscribe"></form></div></div><div><hr></div><p>Cowork is Anthropic&#8217;s answer to non-technical folks seeking the benefits of Claude Code - an agent inside the Claude Desktop app that reads your files, connects to your tools, and executes complex tasks end-to-end without you writing a single line of code. If Claude Code is <a href="https://www.aitidbits.ai/p/claude-code-best-practices">built for engineers</a>, Cowork is built for everyone who wants the same capabilities without having to open a terminal.</p><p>You point it at a folder and describe what you need in plain language: organize 200 scattered files by type and project, turn receipts into a formatted expense report, batch-rename months of screenshots with consistent dates, or synthesize five research PDFs into a single brief with citations.</p><p>Since January, I&#8217;ve been running my newsletter production inside Cowork: idea triage, outline generation, draft editing, and automations that process my raw notes every Monday morning without me touching them.</p><p>The difference isn&#8217;t my prompting skill, but three useful Cowork features: connectors, scheduled tasks, and plugins. This post covers all three with concrete workflows you can reuse.</p><h1><strong>Connectors: Cowork reads and writes to your actual tools</strong></h1><p>Connectors let <em>Cowork</em> plug directly into the tools you already use: Gmail, Notion, Slack, and Google Drive. and work with your actual data inside a session.</p><p>In February, <em>Anthropic</em> expanded connectors beyond project management tools to include popular everyday apps such as Gmail, Google Calendar, Notion, and&nbsp;<a href="https://claude.com/connectors/">more</a>.</p><p>The connector I use most is Notion. It handles nested pages well, which matters for any workspace with real depth. The brain dump workflow, where raw ideas I capture on my phone get automatically triaged into structured newsletter briefs every Monday, is the next section that runs entirely through it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ONkS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ONkS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 424w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 848w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 1272w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ONkS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png" width="1456" height="901" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:901,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ONkS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 424w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 848w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 1272w, https://substackcdn.com/image/fetch/$s_!ONkS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F250c12ef-646e-42ba-bef9-a71cc68f9eed_1600x990.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Settings &#8594; Connectors to see all of your installed connectors</figcaption></figure></div><p>Gmail is the other app worth connecting to. Even without any scheduled tasks, having Cowork read your emails during a Cowork session, rather than you pasting them in, reduces friction enough that you&#8217;ll actually use it for email-related work, such as drafting emails in your style or taking bulk actions like archiving all emails from a specific sender.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_WDE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_WDE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 424w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 848w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 1272w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_WDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png" width="1405" height="1023" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1023,&quot;width&quot;:1405,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!_WDE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 424w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 848w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 1272w, https://substackcdn.com/image/fetch/$s_!_WDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8207e148-6aed-4cd9-ba97-00146b3efcfc_1405x1023.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Settings &#8594; Connectors &#8594; Browse connectors to see all available connectors</figcaption></figure></div><p>The real payoff comes when you combine a connector with a scheduled task. That&#8217;s where Cowork stops being something you open and starts being something that runs in the background.</p><h1><strong>Scheduled tasks and automated workflows</strong></h1><p>In Cowork, each piece of work you hand off to Claude is called a task.</p><p>To set one up, type /schedule in any Cowork session and pick a frequency: daily, weekly, monthly, weekdays only, or on demand. That&#8217;s useful for anything that needs to happen regularly without you having to start it each time.</p><h2><strong>How to set it up</strong></h2><p>Two paths: type `/schedule` in any session, or use the &#8220;Scheduled&#8221; tab in the sidebar to create tasks directly. You set a name, a prompt, a frequency, and which folder to work from.</p><p>Each scheduled task runs as its own full Cowork session with access to all your connectors, plugins, and folder context.</p><p>Note that your laptop must be awake for scheduled tasks to run, as they run locally. Anthropic recently added a feature (see the red arrow below) to prevent your computer from going to sleep, so tasks can continue running.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b5xJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b5xJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b5xJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png" width="1422" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b5xJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!b5xJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe3248ec-230b-481c-87d3-0a7d42025098_1422x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>A workflow I actually use</strong></h2><p>I have a &#8220;Quick Brain Dump&#8221; widget on my phone. Whenever an idea hits, I write it into a Notion page - a text block of raw and unrefined stream of thought. Those notes will remain on this page until I manually review and organize them. That is, until I plugged in Cowork.</p><p>To make that page usable in&nbsp;Cowork, I first connected my&nbsp;Notion&nbsp;workspace&nbsp;on my phone.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WjWP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WjWP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WjWP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg" width="340" height="603.1042128603104" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:902,&quot;resizeWidth&quot;:340,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!WjWP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WjWP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F80873548-814e-4cc0-99f4-6ec154641703_902x1600.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Once I connected my Notion workspace to Cowork, I set up a scheduled task that runs every Monday morning.</p><ol><li><p>Reads everything added to the brain dump page that week</p></li><li><p>Decides which entries have real newsletter potential</p></li><li><p>Turns the strong ones into a working title, an angle, and 3 questions I&#8217;d need to answer to write it</p></li><li><p>Saves them to my Newsletter Ideas database as &#8220;Needs Review&#8221;</p></li></ol><p>I capture ideas throughout the week. When I sit down on Monday, the triage is already done.</p><p>Here&#8217;s the exact prompt running inside that task (ok to steal):</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;63d98157-72a9-4f49-a634-cdccbae82bec&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">You are a newsletter research assistant. Connect to my Notion workspace 
using the Notion connector.

1. Read all entries added to my &#8220;Quick Brain Dump&#8221; page this week

2. For each entry, decide: does this have real potential to become a
newsletter post? (yes/no + one sentence why)

3. Only for strong &#8220;yes&#8221; entries: generate a working title, the angle
that makes it interesting, and 3 questions I&#8217;d need to answer to
write it

4. Skip entries that look like tasks, reminders, or personal notes -
only process ideas

5. Save all results to my &#8220;Newsletter Ideas&#8221; Notion database with
Status = &#8220;Needs Review&#8221; and Source = &#8220;Brain Dump [date]&#8221;
If an entry is too vague to evaluate, add it to the database with
Status = &#8220;Unclear&#8221; and a note on what&#8217;s missing.</code></pre></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mTX1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mTX1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 424w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 848w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 1272w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mTX1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png" width="1303" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1303,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!mTX1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 424w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 848w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 1272w, https://substackcdn.com/image/fetch/$s_!mTX1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe39fa68f-3a4c-4f59-80c6-23b676b8bed7_1303x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7Jjv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7Jjv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 424w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 848w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 1272w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7Jjv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png" width="1002" height="292" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:292,&quot;width&quot;:1002,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!7Jjv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 424w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 848w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 1272w, https://substackcdn.com/image/fetch/$s_!7Jjv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F394ef58c-29e7-4a11-ab89-bb705a79a657_1002x292.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Toggle the &#8220;thinking&#8221; arrow in the chat window to see how Claude is handling the connector</figcaption></figure></div><p>Here are the results in my Notion workspace:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!z2RN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!z2RN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 424w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 848w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 1272w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!z2RN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png" width="1456" height="864" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:864,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!z2RN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 424w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 848w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 1272w, https://substackcdn.com/image/fetch/$s_!z2RN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bfb7e7e-2b00-4a9d-9356-acf3ef9650cd_1600x950.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The same pattern works for anything where raw input is collected in one place and needs to be structured: Slack threads into action items, Gmail into expense logs, Linear tickets into weekly reports.</p><p>Cowork is like a back-office assistant that once taught, does the work for you on a recurring cadence.</p><h2><strong>Chaining scheduled tasks</strong></h2><p>The output of one scheduled task can feed into the next one&#8217;s input.</p><p>The brain dump task runs early Monday and saves structured briefs to a Notion database. A second scheduled task runs after that, reads those briefs, picks the strongest one based on recency and audience fit, and drops a full working outline into my local drafts folder.</p><p>Task #1 processes raw input. Task #2 acts on structured output. No human intervention needed.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sURm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sURm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 424w, https://substackcdn.com/image/fetch/$s_!sURm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 848w, https://substackcdn.com/image/fetch/$s_!sURm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 1272w, https://substackcdn.com/image/fetch/$s_!sURm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sURm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png" width="1456" height="335" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:335,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!sURm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 424w, https://substackcdn.com/image/fetch/$s_!sURm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 848w, https://substackcdn.com/image/fetch/$s_!sURm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 1272w, https://substackcdn.com/image/fetch/$s_!sURm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fddaf1e37-0cf9-4314-9d24-2a6b52386b8f_1600x368.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h1><strong>Each plugin is a full toolkit for one domain</strong></h1><p>There are dozens of <a href="https://github.com/anthropics/knowledge-work-plugins">plugins</a> available for Cowork, each one a specialist kit for a specific domain. A plugin bundles skills, connectors, slash commands, and subagents around a job function like Sales, Data, or Design, so you don&#8217;t have to wire them up yourself.</p><p>Install one and you immediately get domain-specific commands and integrations tailored to that role&#8217;s actual workflows.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d42j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d42j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!d42j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!d42j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!d42j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d42j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png" width="1422" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!d42j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!d42j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!d42j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!d42j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b4921b4-3f3f-4f39-8ab4-535922be44ca_1422x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The plugins store</figcaption></figure></div><p>Here&#8217;s what the Data plugin contains, for example:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ai2I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ai2I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 424w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 848w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 1272w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ai2I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Ai2I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 424w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 848w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 1272w, https://substackcdn.com/image/fetch/$s_!Ai2I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46ab7568-5bdd-482b-ba2c-129467d5ade0_1360x935.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h2><strong>Composable Plugins</strong></h2><p>You can install multiple plugins and commands inside of those plugins, and pull from all of them in a single task with just a prompt. Pretty incredible.</p><p>Here&#8217;s a real example. I dropped a messy sales CSV into Cowork: missing values, a duplicate row, a negative amount (these numbers are not real), and ran this single prompt with both the Data and Design plugins installed:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;addbb777-8623-4be2-b895-898e8cbe5cb7&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">I have a sales CSV file (sample_sales.csv) and the Prosper company logo 
(image.jpeg) in this folder.

Step 1 - Data plugin:

/analyze the CSV and find:

- Total revenue by product (excluding refunded transactions)
- Month-over-month revenue trend across January, February, March
- Revenue split by region (North America, Europe, Asia Pacific)
- Any data quality issues worth flagging (nulls, anomalies, duplicates)

Step 2 - Data plugin:

/write-query to calculate the refund rate by product - which product
has the highest percentage of refunded transactions?

Step 3 &#8212; Design plugin:

/performance-report using the findings above. Format it as a clean one-page
Q1 sales report for Prosper. 
Include:

- The Prosper logo at the top
- A summary section with 3 headline numbers (total revenue, top product,
top region)
- A data quality callout box flagging the issues found in Step 1
- A refund risk section showing which product needs attention
- Brand colors: blue (#4B4EE4), white background, clean sans-serif font

Save the output as prosper_q1_report.pdf</code></pre></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9D7y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9D7y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9D7y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png" width="1422" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!9D7y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 424w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 848w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!9D7y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2efe6d7-c300-44b3-9d76-a25c4c7a201b_1422x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Top half: The prompt for Cowork. Bottom half: Claude in action</figcaption></figure></div><p>Step #1 used the Data plugin to analyze revenue by product and region, flag the data quality issues, and calculate month-over-month trends. Step #2 wrote the SQL to calculate refund rates by product, and Step #3 switched to the Design plugin and formatted everything into a branded one-page report, logo included.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o1V3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o1V3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 424w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 848w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 1272w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o1V3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png" width="561" height="749.867332382311" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:937,&quot;width&quot;:701,&quot;resizeWidth&quot;:561,&quot;bytes&quot;:143464,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://prosperinai.substack.com/i/192477122?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!o1V3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 424w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 848w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 1272w, https://substackcdn.com/image/fetch/$s_!o1V3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14f54fcd-e4b1-4254-afbc-dadf9ed81d69_701x937.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A ready PDF report from Claude&#8217;s work</figcaption></figure></div><p>One prompt. Three steps. The output went straight from a raw CSV to something you could send to a stakeholder.</p><h1><strong>Where to start</strong></h1><p><strong>Today:</strong> Install Gmail or Slack connectors. Connect whichever tool you actually use to collect raw inputs. Notion, email, whatever it is.</p><p><strong>This week:</strong> Set up one scheduled task. Pick something you do manually on a recurring basis: drafting a sales update, triaging weekly tasks from Jira/Linear, summarizing newsletters, and let&nbsp;Cowork&nbsp;run it instead. Use the brain dump prompt above as a template, or use Claude to generate a prompt for you by describing the task you want it to automate</p><p><strong>This month:</strong> Install two or three plugins that are relevant for your tasks, such as Data Analysis or Designer. Then run a task that pulls data from multiple plugins in a single session. Once one scheduled task is working, consider chaining it as input to another task, e.g., a task to draft a report with another to send it via Slack and email.</p><p>The goal isn&#8217;t to use every feature. It&#8217;s to find one workflow that currently requires your manual attention and hand it off. Once that one runs on its own, the next one is easier to see.</p><p>That&#8217;s how <em>Cowork</em> becomes a system instead of a chatbot.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vGDd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vGDd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 424w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 848w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 1272w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vGDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png" width="728" height="724" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:724,&quot;width&quot;:728,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:38819,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://prosperinai.substack.com/i/192477122?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!vGDd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 424w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 848w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 1272w, https://substackcdn.com/image/fetch/$s_!vGDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a857b39-524f-4ea2-b397-8f501fed882d_728x724.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h4>Recent related posts</h4><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5c9e8ec2-d90b-45d5-945e-4f67adc4199e&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I&#8217;m sharing the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Getting the most out of Claude Code&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100},{&quot;id&quot;:94578007,&quot;name&quot;:&quot;Jeff Morhous&quot;,&quot;bio&quot;:&quot;Accelerating software engineering careers with AI workflows. I show you how to use AI to write better code, ship faster, and get ahead.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!7HXm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe1fc042a-90b6-40d9-b9e1-3311e5b4882a_612x612.jpeg&quot;,&quot;is_guest&quot;:true,&quot;bestseller_tier&quot;:100,&quot;primaryPublicationSubscribeUrl&quot;:&quot;https://www.augmentedswe.com/subscribe?&quot;,&quot;primaryPublicationUrl&quot;:&quot;https://www.augmentedswe.com&quot;,&quot;primaryPublicationName&quot;:&quot;The AI-Augmented Engineer&quot;,&quot;primaryPublicationId&quot;:996569}],&quot;post_date&quot;:&quot;2025-12-03T15:31:22.108Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!KGWi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/claude-code-best-practices&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:179190194,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:80,&quot;comment_count&quot;:11,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;95520e3b-a83b-41a1-9c05-e3df144f0790&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;DeepWiki: Understand Any Codebase&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-08-17T15:01:27.645Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81e54dd2-f1e3-41cd-8f0a-94dc88719d16_852x480.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/deepwiki&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:170839853,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:77,&quot;comment_count&quot;:2,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d9df108b-f4ff-440c-ba34-db478ae78ed6&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:96,&quot;comment_count&quot;:0,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;38b10b9a-9f0d-47b3-8571-ca2cb7cbe87a&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Rise of Cloud Coding Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-09-04T14:31:39.187Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!PsvY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/cloud-coding-agents&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:171783559,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:1,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div>]]></content:encoded></item><item><title><![CDATA[Getting the most out of Claude Code]]></title><description><![CDATA[How to successfully on board Claude as a new engineer on your team]]></description><link>https://www.aitidbits.ai/p/claude-code-best-practices</link><guid isPermaLink="false">https://www.aitidbits.ai/p/claude-code-best-practices</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Wed, 03 Dec 2025 15:31:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!KGWi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to another post in the AI Coding Series, where I&#8217;m sharing the strategies and insights I've developed for effective AI-assisted coding.</em></p><p><em>I&#8217;m excited to host Jeff Morhous for this edition. Jeff is a senior software engineer and teacher at heart who spends his working hours solving software problems with AI tools, then dedicates the rest of his time teaching other developers to do the same through his newsletter, The AI-Augmented Engineer. I&#8217;ve been following his newsletter, The AI-Augmented Engineer, and I keep coming back to how clearly he breaks down AI coding concepts&#8212;whether he&#8217;s explaining <a href="https://www.augmentedswe.com/p/what-the-heck-is-vibe-coding">what vibe coding actually is</a>, showing how he <a href="https://www.augmentedswe.com/p/how-i-vibe-coded-an-iphone-app-that">vibe-coded an iPhone app</a> he now uses every day, or comparing <a href="https://www.augmentedswe.com/p/is-codex-cli-the-new-claude-code">Codex CLI to Claude Code</a>. His practical approach to helping developers ship faster with AI made him a natural fit for this series. I highly recommend subscribing to his Substack:</em></p><div class="embedded-publication-wrap" data-attrs="{&quot;id&quot;:996569,&quot;name&quot;:&quot;The AI-Augmented Engineer&quot;,&quot;logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!OL2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae32fc96-70bd-4a15-85ad-da646433d5ae_1280x1280.png&quot;,&quot;base_url&quot;:&quot;https://www.augmentedswe.com&quot;,&quot;hero_text&quot;:&quot;Learn how to use AI tools to write better code, ship faster, and stay ahead of the curve&quot;,&quot;author_name&quot;:&quot;Jeff Morhous&quot;,&quot;show_subscribe&quot;:true,&quot;logo_bg_color&quot;:&quot;#fafafa&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPublicationToDOMWithSubscribe"><div class="embedded-publication show-subscribe"><a class="embedded-publication-link-part" native="true" href="https://www.augmentedswe.com?utm_source=substack&amp;utm_campaign=publication_embed&amp;utm_medium=web"><img class="embedded-publication-logo" src="https://substackcdn.com/image/fetch/$s_!OL2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae32fc96-70bd-4a15-85ad-da646433d5ae_1280x1280.png" width="56" height="56" style="background-color: rgb(250, 250, 250);"><span class="embedded-publication-name">The AI-Augmented Engineer</span><div class="embedded-publication-hero-text">Learn how to use AI tools to write better code, ship faster, and stay ahead of the curve</div><div class="embedded-publication-author-name">By Jeff Morhous</div></a><form class="embedded-publication-subscribe" method="GET" action="https://www.augmentedswe.com/subscribe?"><input type="hidden" name="source" value="publication-embed"><input type="hidden" name="autoSubmit" value="true"><input type="email" class="email-input" name="email" placeholder="Type your email..."><input type="submit" class="button primary" value="Subscribe"></form></div></div><p><em>Today, Jeff is diving deep into Claude Code&#8217;s most powerful features: subagents, skills, and context files. Over to Jeff!</em></p><div><hr></div><p>If you&#8217;re following AI buzz on the internet, you&#8217;ve probably already heard of Claude Code. Whether you&#8217;re a professional developer using it for work or a hobbyist using it for non-code agentic workflows, Claude Code has something for everyone, as evidenced by its <a href="https://www.npmjs.com/package/@anthropic-ai/claude-code">~5M weekly downloads</a>.</p><p>While it&#8217;s simple to get started with the tool, getting <em>the most out of it</em> takes some extra effort as the Claude team has been rapidly launching new features over the last few months. In this post, I&#8217;ll walk you through the releases I found most impactful when collaborating with Claude Code:</p><ul><li><p>Subagents</p></li><li><p>Skills</p></li><li><p>Context files</p></li></ul><p>Let&#8217;s start with subagents!</p><h2>Understanding subagents in Claude Code</h2><p><a href="https://docs.claude.com/en/docs/claude-code/sub-agents">Subagents</a> are custom AI assistants that specialize in particular tasks or domains. They have their own configuration and context, enabling them to tackle subtasks without cluttering the main conversation. When a problem matches a subagent&#8217;s expertise, Claude can delegate that task to the subagent, which operates independently and returns the result.</p><p>For instance, my code reviewer subagent (more on that later) is invoked when I request a code review, while my Terraform subagent is invoked when I make infrastructure changes. It&#8217;s like having a site reliability engineer on staff.</p><h3>Why you should use subagents</h3><p>Each subagent has its <strong>own context window</strong>, separate from the main session. This isolation preserves the main conversation&#8217;s focus while the subagent works on a specific problem. It is well established that you get better and more accurate responses when your coding agent&#8217;s context window is focused. Mixing a request to write an API endpoint with a later request to refactor some CSS is likely to yield worse results than keeping those conversations separate.</p><p>Subagents come with a <strong>custom system prompt</strong> and can be granted a limited set of tools, making them <em>specialists</em> in their particular problem set.</p><p>The benefits of subagents are many, but can be summed up as:</p><ul><li><p>Context preservation (don&#8217;t pollute your main context)</p></li><li><p>Specialized expertise (can give specific and detailed instructions)</p></li><li><p>Reusability (Can be used across projects or checked into Git)</p></li><li><p>Controlled tool access (like shell commands)</p></li></ul><h3>Creating subagents</h3><p>Subagents are defined in Markdown files, typically suffixed with .md. You can create them through Claude Code&#8217;s interactive interface (/agents command) or by creating files manually: Project-specific subagents live under the project&#8217;s .claude/agents/ directory, while user-wide subagents live under ~/.claude/agents/. If a subagent name exists in both places, the project version takes precedence.</p><p>Each subagent file begins with a YAML header defining a unique name and a description of its purpose. You can also specify the tools the subagent can use and the language model powering that agent.</p><p>The rest of the file is the agent&#8217;s system prompt. This prompt should instruct the subagent on how to perform its role. It may include step-by-step guidelines, checklists, or example behaviors. For instance, a &#8220;code-reviewer&#8221; subagent might be defined as follows:</p><pre><code>---

name: code-reviewer

description: Expert code review specialist. Proactively reviews code for quality, security, and maintainability. Use immediately after writing or modifying code.

tools: Read, Grep, Glob, Bash

model: inherit

---

You are a senior code reviewer ensuring high standards of code quality and security.

When invoked:

1. Run git diff to see recent changes

2. Focus on modified files

3. Begin review immediately

Review checklist:

- Code is simple and readable

- Functions and variables are well-named

- No duplicated code

... (etc)</code></pre><p>Claude Code offers a handy interactive menu via the /agents command to manage subagents. Running /agents in the Claude terminal opens a UI where you can view all available subagents (built-in, user-level, project-level), create a new one with a guided wizard, edit their configuration (including tool permissions), or delete them. This is the recommended way to create/edit subagents, as it ensures proper syntax and lets you easily pick from available tools.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c6Cp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c6Cp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c6Cp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png" width="1456" height="905" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:905,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c6Cp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!c6Cp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feec8efde-7cbf-45ac-b0f4-ed557d13d021_1600x994.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Listing and creating subsagents via the Claude Code terminal</figcaption></figure></div><p>Subagent files can also be managed manually. For example, to create a new project subagent, you&#8217;d create a Markdown file under .claude/agents/. The YAML frontmatter and content can be written by hand or generated by Claude.</p><p>You can find implemented subagents for inspiration <a href="https://github.com/VoltAgent/awesome-claude-code-subagents">in this repository</a>.</p><h2>Different ways to invoke subagents</h2><p>Now that you know how to <em>make</em> subagents, you&#8217;re probably curious about how to actually use them.</p><p>The first and easiest is to explicitly invoke a subagent. Do this by explicitly calling your subagent as follows:</p><pre><code>&gt; Use the spec-request-spec agent to write request specs for the UserController</code></pre><p>The second way is to rely on Claude to pick an appropriate subagent, similar to how tools are being invoked based on tools&#8217; descriptions. It will do this by matching your prompt/query against the descriptions of available subagents. Similar to tool use, good agent descriptions increase the odds of proper automatic invocation.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KGWi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KGWi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KGWi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png" width="1456" height="905" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:905,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KGWi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!KGWi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bad23e0-762c-40d3-86ef-f777664c6801_1600x994.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Behind the scenes, when a subagent is invoked, Claude starts a fresh context for that subagent using its system prompt. The subagent may then use tools (as allowed), produce an answer, and perform actions. Once done, the conversation with the subagent ends, and Claude returns to the main session, freeing that context window&#8217;s tokens. Because subagents start with a clean context each time, they might introduce some latency as they gather any needed information (for example, re-reading relevant files) to carry out their task. The key benefit is that your main session&#8217;s context <a href="https://www.augmentedswe.com/p/making-llm-context-actually-work">remains free</a> of that transient detail, enabling longer, cheaper, and faster overall interactions.</p><h2>Best practices for using subagents</h2><ol><li><p><strong>Scoped</strong> - design narrow, focused subagents.<strong> </strong>Scoped agents are more predictable and performant.</p></li><li><p><strong>Detailed</strong> - be as specific as possible about the agent&#8217;s procedure, style, and constraints in the subagent&#8217;s Markdown content,</p></li><li><p><strong>Tracked</strong> - store subagent files (especially project-level ones) in your repository so that your team can use them and improve them over time.</p></li><li><p><strong>Assisted</strong> - use Claude&#8217;s /agents CLI workflow to have Claude draft a subagent&#8217;s initial version</p></li></ol><p>By thoughtfully creating subagents and following these practices, you can offload many specialized tasks from the main Claude Code context, leading to cleaner interactions and more effective AI assistance in your development workflow.</p><p>My most used subagent is a code reviewer. I explicitly spin off this subagent so I can get feedback on the changes in my branch before sending it to my peers. It&#8217;s saved me from tons of &#8220;nit&#8221; comments or embarrassing typos.</p><div><hr></div><pre><code><code>&#10024; AI Tidbits has partnered with Anthropic to provide premium members with $500 in Claude API credits, plus an additional $800 in credits across other leading AI tools and APIs including Hugging Face, Deepgram, and Vapi. Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Defining agent skills</h2><p>While subagents are like <em>specialized personas</em>, <a href="https://docs.claude.com/en/docs/claude-code/skills">Agent Skills</a> are more granular capabilities<strong> </strong>that Claude can draw upon. Skills package up instructions, examples, and even code or data that help Claude perform specific tasks on demand.</p><p>If there&#8217;s something that is in your CLAUDE.md that is only relevant to a small fraction of queries, turn that instruction into a skill. If there&#8217;s something in your day-to-day prompts that could benefit from a super clean context or has a tendency to fill up your main context, put it in a subagent.</p><p>An agent skill is a folder on your filesystem that contains a SKILL.md file (with instructions and metadata) and optional supporting files (scripts, reference docs, templates, etc.).</p><p>They allow you to extend Claude&#8217;s functionality in a modular way. For example, you might create a skill for &#8220;SQL troubleshooting&#8221; with instructions and scripts for database queries. Once the skill is set up, you no longer need to prompt Claude with those instructions every time. Claude will autonomously load the skill when needed. This reduces repetitive prompting and can combine multiple skills to handle complex tasks.</p><p>Every skill resides in its own directory named after the skill, under either your user or project skills folder:</p><ul><li><p><strong>Personal skills</strong> (locally, only for you) are stored in ~/.claude/skills/&lt;skill-name&gt;/SKILL.md. These are loaded for any project you use Claude Code in.</p></li><li><p><strong>Project skills</strong> (pushed to Git, shared with a team) are stored in .claude/skills/&lt;skill-name&gt;/SKILL.md within the project directory. These can be checked into git so all team members can access them.</p></li></ul><p>The main file SKILL.md in each skill directory defines the skill. It has a YAML frontmatter for metadata and a Markdown body for the instructions:</p><pre><code>---

name: your-skill-name

description: <em>Brief description of what this Skill does and when to use it</em>

---

# Your Skill Name

## Instructions

<em>Provide clear, step-by-step guidance for Claude.</em>

## Examples

<em>Show concrete examples of using this Skill.</em></code></pre><p>Don&#8217;t forget that this skills folder can also contain additional files that would help Claude with this skill. If you have an Excel file of financial data you&#8217;d like the skill to reference, put it here! If you have a PDF with diagrams describing how something works, it belongs here, too. Anthropic even suggests putting examples as files and suggests Markdown as an appropriate format.</p><p>You can list all currently loaded skills by using the slash command in the CLI. In practice, since skills are automatic, the way to test a skill is to pose a query that should trigger it. For example, after creating a &#8220;PDF processing&#8221; skill, you can ask Claude something involving PDFs to see if it uses the skill. Claude&#8217;s response often indicates when it&#8217;s using a skill (you might see it mention reading the skill file in the conversation, especially if running in debug/verbose mode).</p><p>My favorite skill is for consistent frontend design. I give brand colors, font choices, and other UX specifications in a frontend design skill so any changes Claude makes to my views are aligned with the existing patterns. This saves me some time and keeps my meta CLAUDE.md file from having too much extra context.</p><h2>Using CLAUDE.md effectively</h2><p>Claude Code heavily leans on a <strong>context file</strong> named CLAUDE.md, which provides persistent background information and guidelines for your project. This file is automatically loaded into Claude&#8217;s context every time you start a session in that project directory (or at global scope), so you don&#8217;t have to repeatedly remind the AI of fundamental details about your project or preferences. Essentially, it&#8217;s a way to &#8220;inject&#8221; important context or rules <em>before</em> any conversation begins.</p><p>A context file is where you put fundamental project knowledge, constraints, and style guidelines that you want Claude to always be aware of. Think of it like documentation or notes that are always open to Claude.</p><p>You can even <em>start</em> your <a href="http://claude.md">CLAUDE.md</a> with the /init slash command, but don&#8217;t forget to review it before moving forward.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Njyy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Njyy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Njyy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png" width="1456" height="905" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:905,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Njyy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 424w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 848w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 1272w, https://substackcdn.com/image/fetch/$s_!Njyy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65553d8e-11fd-4dcc-b592-a82128c88c4d_1600x994.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Claude Code generates the initial claude.md file using the /init command</figcaption></figure></div><p>By placing project descriptions, architecture breakdowns, style conventions, and important requirements in CLAUDE.md, you ensure Claude Code always &#8220;remembers&#8221; this context, leading to more relevant and accurate assistance.</p><p>Claude Code is smart about where it looks for context files. It supports <strong>multiple CLAUDE.md files in a hierarchy</strong>. If present, the global context file is loaded for all sessions (your user-level default context).</p><p>The <strong>project context file</strong> in the current working directory (usually your repository root) named CLAUDE.md is loaded if present. You can also use CLAUDE.local.md in the project directory if you want a version that isn&#8217;t checked into git.</p><p>If you are working in a sub-folder of a larger repo, Claude will also load any CLAUDE in parent directories up to where you launched it. This is especially useful for monorepos or layered projects. Conversely, Claude can load files from subdirectories <em>when relevant</em>. For example, if you start at the repo root and ask about code in a given directory, and that directory contains its own CLAUDE.md, Claude will pull in that subfolder&#8217;s CLAUDE.md content if needed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c5zl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c5zl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 424w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 848w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 1272w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c5zl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png" width="1456" height="778" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:778,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:320745,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/179190194?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c5zl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 424w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 848w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 1272w, https://substackcdn.com/image/fetch/$s_!c5zl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F74576b99-0a45-4750-9cbb-1cfb7eb4d04e_1950x1042.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The when, how, and why framework</figcaption></figure></div><h2>What will you build with Claude Code?</h2><p>Claude Code really pushes an agentic programming model. This is incredibly useful, but comes with a learning curve. Enabling the use of specialized subagents, on-demand skills, and persistent context gives you an edge when tackling complex coding tasks. By understanding how these pieces work together, you can dramatically enhance your productivity and code quality.</p><p>If you want more tips on using Claude Code (or other AI tools for software engineers), we&#8217;d love to have you as a reader over at <a href="https://www.augmentedswe.com/">The AI-Augmented Engineer</a>.</p><div class="embedded-publication-wrap" data-attrs="{&quot;id&quot;:996569,&quot;name&quot;:&quot;The AI-Augmented Engineer&quot;,&quot;logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!OL2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae32fc96-70bd-4a15-85ad-da646433d5ae_1280x1280.png&quot;,&quot;base_url&quot;:&quot;https://www.augmentedswe.com&quot;,&quot;hero_text&quot;:&quot;Learn how to use AI tools to write better code, ship faster, and stay ahead of the curve&quot;,&quot;author_name&quot;:&quot;Jeff Morhous&quot;,&quot;show_subscribe&quot;:true,&quot;logo_bg_color&quot;:&quot;#fafafa&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="EmbeddedPublicationToDOMWithSubscribe"><div class="embedded-publication show-subscribe"><a class="embedded-publication-link-part" native="true" href="https://www.augmentedswe.com?utm_source=substack&amp;utm_campaign=publication_embed&amp;utm_medium=web"><img class="embedded-publication-logo" src="https://substackcdn.com/image/fetch/$s_!OL2M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fae32fc96-70bd-4a15-85ad-da646433d5ae_1280x1280.png" width="56" height="56" style="background-color: rgb(250, 250, 250);"><span class="embedded-publication-name">The AI-Augmented Engineer</span><div class="embedded-publication-hero-text">Learn how to use AI tools to write better code, ship faster, and stay ahead of the curve</div><div class="embedded-publication-author-name">By Jeff Morhous</div></a><form class="embedded-publication-subscribe" method="GET" action="https://www.augmentedswe.com/subscribe?"><input type="hidden" name="source" value="publication-embed"><input type="hidden" name="autoSubmit" value="true"><input type="email" class="email-input" name="email" placeholder="Type your email..."><input type="submit" class="button primary" value="Subscribe"></form></div></div><div><hr></div><p>Recent posts on coding better with AI:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;a72e4e2f-8248-4aac-8d5f-3e07b99aa771&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Rise of Cloud Coding Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-09-04T14:31:39.187Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!PsvY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/cloud-coding-agents&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:171783559,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:50,&quot;comment_count&quot;:0,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;3c370087-752a-4554-8c38-c02d00cf92f5&quot;,&quot;caption&quot;:&quot;Eight ways I use DeepWiki to speed up real coding work.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;DeepWiki: Understand Any Codebase&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-08-17T15:01:27.645Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81e54dd2-f1e3-41cd-8f0a-94dc88719d16_852x480.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/deepwiki&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:170839853,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:64,&quot;comment_count&quot;:1,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5ef2625d-56e9-4312-8035-395c9bc9dfaf&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:88,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;4ea0136c-a477-476d-916e-214c3800fbc7&quot;,&quot;caption&quot;:&quot;Welcome to LinkedIn Highlights!&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, August 2025 - AI Coding Edition&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-09-13T15:02:03.874Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!vQB7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-highlights-aug-2025&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:173318428,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:37,&quot;comment_count&quot;:0,&quot;publication_id&quot;:1079420,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, August 2025 - AI Coding Edition]]></title><description><![CDATA[DeepWiki&#8217;s instant repo querying, Claudia&#8217;s GUI for Claude Code, Anthropic&#8217;s best practices, Task Master coding agents orchestration, plus new document parsing tools from Google and Mistral]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-aug-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-aug-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sat, 13 Sep 2025 15:02:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vQB7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my top-performing LinkedIn posts, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and useful tools for folks collaborating with AI.</p><p>Over the past month, I&#8217;ve focused on exploring advanced coding workflows powered by Claude Code and open-source task orchestration tools. This post covers everything from DeepWiki's instant repo querying to Anthropic&#8217;s subagent ecosystem, Claude Code best practices, and structured agent workflows with Task Master AI.</p><p>To close things out, I also highlight two powerful tools from Google and Mistral for document parsing, which bring new levels of accuracy and traceability to text extraction.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><p><strong>Recent posts on coding with AI</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;2d0df808-9904-4430-a570-dafd0669c7df&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Rise of Cloud Coding Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-09-04T14:31:39.187Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!PsvY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/cloud-coding-agents&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:171783559,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:42,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;c5a91ce4-846d-4442-a534-a167d5d5455b&quot;,&quot;caption&quot;:&quot;Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;DeepWiki: Understand Any Codebase&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-08-17T15:01:27.645Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81e54dd2-f1e3-41cd-8f0a-94dc88719d16_852x480.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/deepwiki&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:170839853,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:66,&quot;comment_count&quot;:1,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;388f6d93-8119-4907-af3c-6bee43f24e9d&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:54,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h1>(1) DeepWiki</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IZoC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IZoC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IZoC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg" width="1456" height="812" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:812,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, text&quot;,&quot;title&quot;:&quot;graphical user interface, text&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, text" title="graphical user interface, text" srcset="https://substackcdn.com/image/fetch/$s_!IZoC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 424w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 848w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!IZoC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7ffb23f2-9a10-4098-b904-fb53573a4bb2_2048x1142.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>One of the most underrated AI coding tools released this year has become an essential part of my coding workflow.</p><p>DeepWiki, from the team behind Devin the coding agent, turns any GitHub repository into an instant wiki. Replace <strong><a href="http://github.com/">github.com</a></strong> with <strong><a href="http://deepwiki.com/">deepwiki.com</a></strong> in any repo URL and start asking questions without digging through files.</p><p>I now use it for:</p><ol><li><p>Onboarding to unfamiliar codebases - targeted explanations with direct file links</p></li><li><p>Understanding implementation patterns - authentication flows, state management approaches</p></li><li><p>Evaluating open-source projects - licensing, security posture, maintenance status</p></li><li><p>Environment setup - getting exact commands and dependencies with citations</p></li><li><p>Building context for AI coding agents - structured summaries and architectural overviews</p></li></ol><p>Every answer includes clickable, line-level citations that link back to source files. No hallucinated summaries, just grounded responses tied to actual code.</p><p>The tool integrates directly into Claude and Cursor via a free MCP server, making it queryable within your existing workflow.</p><p>I've documented 8 specific use cases with examples in a detailed breakdown of how I integrate DeepWiki into my development process<strong> <a href="https://www.aitidbits.ai/p/deepwiki">https://www.aitidbits.ai/p/deepwiki</a></strong></p><div><hr></div><h1>(2) Claudia - Manage Claude Code easily</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vQB7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vQB7!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 424w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 848w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 1272w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vQB7!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif" width="800" height="520" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:520,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!vQB7!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 424w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 848w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 1272w, https://substackcdn.com/image/fetch/$s_!vQB7!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F092ce396-0192-4521-aefe-f3e3802bd7ce_800x520.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Claude Code just got its first serious GUI.</p><p>A two-month-old open-source tool called Claudia turns Claude Code into a full-featured desktop environment for agent-based development with custom agents, usage tracking, session versioning, and a visual project browser that actually makes your ~/.claude directory usable.</p><p>Key features:</p><ol><li><p>Project + session management &#8211; browse, resume, and diff past sessions across projects, with smart search and visual timelines</p></li><li><p>Agent creation &#8211; build persistent Claude agents with custom prompts, sandboxed permissions, and background execution</p></li><li><p>Usage tracking &#8211; real-time analytics dashboard showing cost per model/project, usage history, and exportable data</p></li><li><p>MCP support &#8211; manage your Model Context Protocol servers from a dedicated UI, with easy import from Claude Desktop</p></li><li><p>Markdown support &#8211; edit, preview, and organize <strong><a href="http://claude.md/">CLAUDE.md</a></strong> files across projects with live rendering and syntax highlighting</p></li></ol><p>Under the hood, Claudia runs on Tauri and supports multi-process execution for non-blocking agent runs.</p><p>Claudia makes Claude Code accessible for developers who've been hesitant about command-line AI tools or struggled to track Claude Code sessions across projects.</p><p>(My recent approach changed from Claudia + Cursor to Claude Code + Task Manager, but Claudia is still useful for the CLI haters).</p><p>GitHub repo <strong><a href="https://github.com/getAsterisk/opcode">https://github.com/getAsterisk/opcode</a></strong></p><div><hr></div><h1>(3) Anthropic&#8217;s tips for using Claude Code</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nZNy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nZNy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nZNy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg" width="1212" height="633" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:633,&quot;width&quot;:1212,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;diagram&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="diagram" title="diagram" srcset="https://substackcdn.com/image/fetch/$s_!nZNy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 424w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 848w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!nZNy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30dac9e2-e5b3-44ce-8847-44a9fee5e9d6_1212x633.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The Claude Code team just revealed how they responsibly merged 22,000 lines of AI-generated code into production.<br><br>Their breakthrough? They stopped treating AI like a magic wand and started managing it like you would any capable but inexperienced team member.<br><br>Here are the five best practices they shared for "vibe coding" in prod:<br>(1) Be Claude's product manager - spend 15-20 minutes collecting requirements, exploring the codebase, and building a plan together before letting the AI execute. Just like you wouldn't tell a new hire "build this feature" without context.<br><br>(2) Focus on leaf nodes, not core architecture - use AI for end features and components that nothing else depends on. Protect your system's trunk and branches where tech debt would be catastrophic.<br><br>(3) Design for verifiability - create systems with easily verifiable inputs/outputs and stress tests. You need to validate correctness without reading every line of generated code.<br><br>(4) Prepare for the exponential - AI can handle ~1-hour tasks today, but that capability doubles every 7 months. Next year, it'll tackle day-long projects, then week-long ones. If you insist on reviewing every line of code, you'll become the bottleneck.<br><br>(5) Know enough to ask the right questions - You still need technical judgment to guide the AI safely. The vulnerabilities in vibe-coded apps come from people who can't distinguish between what's dangerous and what's safe.<br><br>The bottom line: we're moving from individual contributors who understand every line to managers who understand the system. The question isn't whether this will happen, but whether you'll be ready when it does.<br><br><a href="https://www.youtube.com/watch?v=fHWFF_pnqDk">Full talk</a></p><div><hr></div><h1>(4) <strong>Claude Code Subagents</strong></h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JZG1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JZG1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 424w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 848w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 1272w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JZG1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png" width="1456" height="749" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:749,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:312040,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/173318428?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JZG1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 424w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 848w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 1272w, https://substackcdn.com/image/fetch/$s_!JZG1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4bce8ff7-af93-4531-ad26-01074183127f_2570x1322.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Claude Code now supports custom subagents, and a new open-source site makes it dramatically easier to start using them.</p><p>Claude Code's sub-agent feature lets you delegate to specialized AI experts instead of relying on one overwhelmed assistant. Each subagent operates within its own context window and focuses on its own expertise, much like having a full development team where every member excels in their specific domain.</p><p>This new site features 30+ prebuilt Claude Code agents, from UX Optimizer to System Architect to User Researcher. Each is designed to handle a specific task in your workflow. Just click, copy the prompt, and deploy.</p><p>These agents coordinate automatically. Ask Claude to "build a secure payment feature" and watch the system architect design the API, the security auditor validate the implementation, and the test generator create comprehensive coverage - all working in parallel.</p><p>No more crafting detailed system prompts or figuring out tool permissions - just copy-paste the configurations you need and integrate them in seconds.</p><p>Website <a href="https://www.subagents.cc">https://www.subagents.cc</a></p><div><hr></div><h1>(5) Claude Code tips</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1IC9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1IC9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 424w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 848w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1IC9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg" width="1456" height="603" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:603,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!1IC9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 424w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 848w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!1IC9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8bd396f-65b3-44ce-9a64-e659cb055d94_2048x848.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic just shared how their teams actually use Claude Code internally, revealing how frontier labs operate at the cutting edge. An interesting peek behind the scenes.</p><p>I expected the usual suspects: debugging, code reviews, boilerplate generation. But applications go far beyond typical developer workflows. Their lawyers built phone tree systems. Marketing teams generate hundreds of ad variations in seconds using specialized sub-agents that process CSV files. Data scientists without TypeScript knowledge are building complete React applications for ML model visualization.</p><p>The performance gains are substantial: security engineers reduced debugging time from 15 minutes to 5 minutes by feeding Claude stack traces during production incidents. New data scientists become productive on complex codebases immediately rather than spending weeks learning architecture. Research time for unfamiliar ML functions dropped by 80%.</p><p>What's particularly notable is how this shifts organizational dynamics. Product engineers now tackle bugs in unfamiliar codebases without depending on other teams. The Growth Marketing team built agentic workflows that would traditionally require dedicated developer resources.</p><p>Claude Code is not only for coding. My observation: we're seeing the early manifestation of a fundamental change in how companies operate. That has been the vibe for some time, but now we actually see this in practice.</p><p>Full piece <a href="https://www.anthropic.com/news/how-anthropic-teams-use-claude-code">https://www.anthropic.com/news/how-anthropic-teams-use-claude-code</a></p><div><hr></div><h1>(6) Task Master AI</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!k2V1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!k2V1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!k2V1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text&quot;,&quot;title&quot;:&quot;text&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text" title="text" srcset="https://substackcdn.com/image/fetch/$s_!k2V1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!k2V1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0393c6c-31c7-49b6-9d87-be4b31da33f9_2048x1149.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-aug-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Rise of Cloud Coding Agents]]></title><description><![CDATA[What it&#8217;s actually like to work with today&#8217;s leading agents such as Devin, Codex, and Cursor]]></description><link>https://www.aitidbits.ai/p/cloud-coding-agents</link><guid isPermaLink="false">https://www.aitidbits.ai/p/cloud-coding-agents</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 04 Sep 2025 14:31:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!PsvY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.</em></p><p><em>In this post, I break down the shift from desktop to cloud-based coding agents, exploring what makes them different, how they fit into real-world development workflows, and where each leading tool stands today. Whether you&#8217;re exploring Devin, Codex, Jules, Factory, or Cursor Background Agents, this guide will help you understand how they work, their strengths and trade-offs, and how to get the most out of them.</em></p><div><hr></div><p><strong>A NotebookLM-powered video podcast summarizing this post</strong></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;67c61577-2430-47be-8c34-61b9bf080e0e&quot;,&quot;duration&quot;:null}"></div><p><strong>Keep exploring this post:</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://chatgpt.com/?hints=search&amp;prompt=Please+review+and+answer+any+questions+I+might+have%20www.aitidbits.ai%2Fp%2Fcloud-coding-agents%0A%0A%23+Question%0A%5BYOUR+QUESTION+HERE%5D&quot;,&quot;text&quot;:&quot;Open in ChatGPT&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://chatgpt.com/?hints=search&amp;prompt=Please+review+and+answer+any+questions+I+might+have%20www.aitidbits.ai%2Fp%2Fcloud-coding-agents%0A%0A%23+Question%0A%5BYOUR+QUESTION+HERE%5D"><span>Open in ChatGPT</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://claude.ai/new?q=Please%20review%20and%20answer%20any%20questions%20I%20might%20have%20www.aitidbits.ai%2Fp%2Fcloud-coding-agents%0A%0A%23%20Question%0A%5BYOUR%20QUESTION%20HERE%5D&quot;,&quot;text&quot;:&quot;Open in Claude&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://claude.ai/new?q=Please%20review%20and%20answer%20any%20questions%20I%20might%20have%20www.aitidbits.ai%2Fp%2Fcloud-coding-agents%0A%0A%23%20Question%0A%5BYOUR%20QUESTION%20HERE%5D"><span>Open in Claude</span></a></p><div><hr></div><p>Agent-assisted coding is evolving quickly. Tools like Cursor, Windsurf, and Claude Code are already part of many developers&#8217; workflows. These desktop agents run locally and rely on continuous back-and-forth: Developer drafts a coding task prompt &#8594; Coding agent generates code &#8594; Developer asks for changes/fixes &#8594; Coding agent implements change &#8594; You commit local changes as part of a pull request.</p><p>This pair-programming style boosts productivity, but it doesn&#8217;t scale. The interaction is synchronous: you must constantly steer the agent from the initial coding task prompt to creating a pull request. Running multiple coding agents in parallel feels like managing multiple junior developers simultaneously.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZKlG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZKlG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 424w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 848w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZKlG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png" width="673" height="376.2513736263736" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:814,&quot;width&quot;:1456,&quot;resizeWidth&quot;:673,&quot;bytes&quot;:1260180,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/171783559?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZKlG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 424w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 848w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!ZKlG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F78453074-6833-4a9b-9b3a-7582728c1937_2528x1414.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Real-world engineering teams work differently.</p><p>Enter cloud agents<strong>,</strong> asynchronous coding agents that better resemble a dev on your team: You assign a task &#8594; Cloud agent spins up its own environment in the cloud (as if it had its own laptop) &#8594; Cloud agent makes changes &#8594; Cloud agent opens a pull request for you to review.</p><p>You can request and merge changes once the code meets your standards. Some even integrate with Slack and other collaboration tools such as Linear and GitHub, further streamlining the development and CD/CI cycles.</p><p>In 2025, the line between desktop and cloud agents is blurring. Cognition, the creator of Devin, the web-managed cloud agent, acquired Windsurf, a Cursor-like IDE that acts as a desktop agent. Cursor, on the other hand, now offers background agents that run asynchronously both locally and on the web. Factory AI (cloud agent) offers a downloadable bridge that enables asynchronous workflows in local environments. Google&#8217;s Jules (cloud agent) just graduated out of Beta to complement Gemini desktop CLI (desktop agent), mirroring OpenAI&#8217;s Codex (web) and Codex CLI (desktop) approach.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!44wX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!44wX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 424w, https://substackcdn.com/image/fetch/$s_!44wX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 848w, https://substackcdn.com/image/fetch/$s_!44wX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 1272w, https://substackcdn.com/image/fetch/$s_!44wX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!44wX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png" width="699" height="391.2671703296703" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b32ba969-0018-4acf-ad25-21107f291048_2520x1410.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:699,&quot;bytes&quot;:1232771,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/171783559?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!44wX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 424w, https://substackcdn.com/image/fetch/$s_!44wX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 848w, https://substackcdn.com/image/fetch/$s_!44wX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 1272w, https://substackcdn.com/image/fetch/$s_!44wX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb32ba969-0018-4acf-ad25-21107f291048_2520x1410.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The path for coding alongside AI is set: as models and tooling improve and best practices solidify, coding agents are shifting to asynchronous-first workflows. To clarify, autonomy isn&#8217;t a &#8220;web&#8221; feature, it&#8217;s an agent capability. It just so happens that, today, most fully autonomous agents are delivered as web-based tools.</p><p>In this post, I&#8217;ll walk you through what it&#8217;s like to work with each of the leading cloud agents, including a screen recording of my workflow so you can see how the interfaces look and behave in practice. Whether you&#8217;re curious about what these agents can actually do or trying to figure out which one fits best into your development workflow, this guide is for you. I&#8217;ve also included a comparison table at the end that makes clear which tools truly stand out.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PsvY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PsvY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 424w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 848w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 1272w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PsvY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1928950,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/171783559?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PsvY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 424w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 848w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 1272w, https://substackcdn.com/image/fetch/$s_!PsvY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7cf50d23-1bf8-4d77-a337-33cc0f974f38_2598x1454.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Non-exhaustive map of the leading coding agents</figcaption></figure></div><h3><strong>Evaluation framework</strong></h3><p>Each agent was evaluated across four criteria:</p><ol><li><p><strong>Overall experience</strong> - onboarding flow, coding UX, working process smoothness (planning &#8594; execution &#8594; testing), and pull request clarity.</p></li><li><p><strong>Team integration</strong> - how well the agent fits into real workflows: taking tasks, opening solid pull requests, addressing feedback, and communicating through platforms like Slack.</p></li><li><p><strong>Autonomy</strong> - the level of independence from assignment to pull request: does the agent require step-by-step guidance and close supervision, or can it deliver end-to-end?</p></li><li><p><strong>Cost</strong> - pricing model and the actual cost of completing the benchmark task.</p></li></ol><p>To evaluate the agents, I gave each one the same benchmark assignment: add recurring task support to a lightweight <a href="https://github.com/maciekt07/TodoApp">to-do app repository</a>:</p><pre><code><em>Add support for recurring tasks. Users should be able to pick from daily, weekly, or monthly recurrence options when creating or editing a task. When a recurring task is marked complete, create the next occurrence immediately with the due date shifted by the chosen interval. Keep changes simple.</em></code></pre><p>I deliberately chose a more straightforward task that all agents naturally completed successfully. The goal of this post is not to benchmark their performance, but to evaluate the experience of working with them. In future posts, I plan to conduct more complex evaluations to compare these agents on challenging, real-world tasks.</p><div><hr></div><pre><code><code>Become a premium member to access the full LLM Builders series, $1k in free credits for leading AI tools and APIs (Claude, Hugging Face, Deepgram), and editorial deep dives covering AI coding and voice agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h1><strong>The results</strong></h1><h2><strong>Devin</strong></h2><h4><strong>Overall experience</strong></h4><p>Setup took minutes: sign in, connect GitHub, and Devin is ready to go. It scanned the codebase, created a confident plan, executed the task, and opened a well-structured pull request, all autonomously.</p><p>The experience felt like pair programming with a senior engineer: you see the shell (the command line for running code), VS Code (where code is edited), and a browser (for testing), all updating in real-time.</p><p>The pull request included a clear summary, test plan, and even a diagram, making review easy.</p><p>Devin handled feedback directly through GitHub, just like a real teammate. It felt like collaborating with someone who not only ships quality code, but also knows how to get it merged.</p><h4><strong>Team integration</strong></h4><p>Devin slots naturally into team workflows. On GitHub, you can review its code or ask it questions exactly as you would with a colleague. It also integrates with Slack, Linear, and Jira, allowing you to tag it in a thread or assign it to an issue.</p><p>Devin can also connect to MCP servers, enabling seamless connections to external tools and internal systems. Through its MCP server, Devin can pull in structured context from documentation, analytics, and monitoring platforms like Notion, Sentry, and Datadog. This makes it easier for Devin to act with deeper awareness of your infrastructure and business logic.</p><h4><strong>Autonomy</strong></h4><p>Devin is fully autonomous: once you assign a task, it produces a pull request without further input. For web apps, it can even run and test the app itself. This autonomy is powerful because it allows you to run multiple coding agents that don&#8217;t require supervision. The downside is that it can go off-track if your prompt and intentions are vague, wasting time and tokens. Fortunately, Devin has substantially improved since the last time I tested it in December, making it autonomous and useful.</p><h4><strong>Cost</strong></h4><p>Devin&#8217;s pricing is structured in Agent Compute Units (ACUs). Those units represent the work done by Devin in a single session. Steps like planning, gathering context, running code, or using the browser all consume ACUs.</p><p>Each ACU costs $2.25. My benchmark task used 3 ACUs, which comes to about $6.75. That&#8217;s steep for a simple job. This novel ACU model also introduces friction. Since no other coding agent uses it, there's no mental benchmark, making it harder for developers to estimate costs. The lack of transparency creates hesitation that hinders adoption, especially when simpler pricing models are the norm.</p><h3><strong>How to get the most out of Devin</strong></h3><p><strong>(1) The Prompt Improvement Button</strong></p><p>Devin has a built-in prompt improver that refines your instructions before it starts. Running prompts through it clarifies intent and removes ambiguity, which helps Devin produce more accurate, review-ready pull requests.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MHwg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MHwg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MHwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MHwg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 424w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 848w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 1272w, https://substackcdn.com/image/fetch/$s_!MHwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F279a1bd8-5408-4a2c-97c1-cd04b031ec27_1600x900.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>(2) Leverage Devin&#8217;s Knowledge capability</strong></p><p>Knowledge lets you onboard Devin with your project&#8217;s context, just like you&#8217;d ramp up a new engineer. It serves the same purpose as coding agents context files such as cursor.md or AGENTS.md, but with structured triggers built in.</p><p>Add information in small pieces, group it in folders, and link it to repositories with triggers so Devin knows when to apply it. Store anything you would want an engineer in your team to know: coding standards, workflows, deployment steps, bug fixes, etc.. Once added, Devin recalls and applies it automatically. More tips covering Knowledge <a href="https://docs.devin.ai/product-guides/knowledge#tips-and-tricks">here</a>.</p><p><strong>(3) Devin Playbooks, &#224; la Claude Subagents</strong></p><p>Playbooks are reusable prompts for recurring tasks. Instead of re-explaining a process to Devin every time, create a playbook and ask Devin to use it. It&#8217;s like showing a teammate how to do something once and having them write it down so they never ask again.</p><p><strong>(4) Connect Devin to Slack/Linear/Jira</strong></p><p>Plug Devin into your team&#8217;s task management workflow: Assign it issues or tag it in threads. It will pick up the task immediately and get to work. That is especially useful when on the road, as you can tag Devin in a Slack conversation and ask it to take a first (and last?) pass at fixing a bug or implementing a feature.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;fb69dd4d-3610-43b5-bafc-696985d4ead7&quot;,&quot;duration&quot;:null}"></div><div><hr></div><p>Recent posts on coding better with AI</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;c7d38bf1-a4a7-4634-b802-3a2559f125a0&quot;,&quot;caption&quot;:&quot;Eight ways I use DeepWiki to speed up real coding work.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;DeepWiki: Understand Any Codebase&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-08-17T15:01:27.645Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81e54dd2-f1e3-41cd-8f0a-94dc88719d16_852x480.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/deepwiki&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:170839853,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:64,&quot;comment_count&quot;:1,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;24fd5bc7-04f0-4787-b7a5-03271f22ae7a&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:88,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;41d37e46-e019-4da4-ae21-897b3eaad32c&quot;,&quot;caption&quot;:&quot;Welcome to LinkedIn Highlights!&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, May 2025 - AI Coding Edition&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-08T15:00:55.978Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!lpwX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-highlights-may-2025&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:164798273,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:31,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2><strong>OpenAI Codex</strong></h2><h4><strong>Overall experience</strong></h4><p>Codex is built into ChatGPT, so setup is quick: load your repo, assign a task, and it gets to work. It scans the code, plans, runs tests, and opens a pull request. Also, <a href="https://x.com/OpenAIDevs/status/1960809814596182163">as of last week</a>, Codex can run in your IDE of choice as an extension, supporting Windsurf, VS Code, and Cursor. You can use it locally or delegate tasks to the cloud.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;2fd3b0a5-4030-4200-9f17-b09e507f5ba1&quot;,&quot;duration&quot;:null}"></div><p>The interface is minimal, with a shell view and collapsible logs. When complete, Codex provides a concise summary with direct code references to expedite the review process.</p><h4><strong>Team integration</strong></h4><p>Codex doesn&#8217;t play well with existing tools such as Slack, Linear, and Jira, which is a major downside given that many teams use Slack to discuss and assign work, and GitHub is the familiar interface for code reviews.</p><h4><strong>Autonomy</strong></h4><p>Codex is fully autonomous: once assigned a task, it works in its isolated cloud environment and delivers a complete pull request within minutes without requiring further prompting. Similar to Devin, that could be a neat feat if the agent is capable enough, but dangerous if vague coding tasks lead to poor generated code. For our simple task, Codex performed well enough.</p><h4><strong>Cost</strong></h4><p>Codex is included with ChatGPT Plus or Pro, so there is no extra charge beyond your subscription. That said, usage limits are not publicly disclosed, which can be a red flag for many teams that require a reliable coding agent with transparent rate limits.</p><h3><strong>How to get the most out of Codex</strong></h3><p><strong>(1) Generate multiple responses simultaneously, choose the best one</strong></p><p>Codex&#8217;s Best of&#8239;N feature allows you to generate several independent solutions in parallel for the same task. This lets you quickly explore different approaches and select the one that best fits your needs, without adding time to your workflow.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F6d4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F6d4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 424w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 848w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 1272w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F6d4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png" width="639" height="399.375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1280,&quot;resizeWidth&quot;:639,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F6d4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 424w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 848w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 1272w, https://substackcdn.com/image/fetch/$s_!F6d4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8d74ee2-f6eb-4056-90b4-21596a5f95fa_1280x800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>(2) Enable internet access when needed</strong></p><p>By default, Codex runs offline after setup, so it cannot look up documentation or install new packages. Enabling internet access solves this issue, but it should only be used when the task requires it, as it carries security risks. For instance, an attacker could slip in a malicious command that Codex executes, unintentionally leaking sensitive data.</p><p><strong>(3) Tag Codex in pull requests</strong></p><p>Tag Codex for questions or reviews via @codex &lt;question&gt; or @codex review. You can also tag it for changes, but that spawns a new task with a separate pull request you&#8217;ll need to merge back into the original.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;25a228d5-a132-4880-8ec3-f0e124ce699d&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2><strong>Google Jules</strong></h2><h4><strong>Overall experience</strong></h4><p>Setup was quick: I signed in with Google, linked GitHub, and Jules was ready. I provided the task and it then scanned the codebase, generated a detailed plan, and executed it step by step until a pull request was opened. The interface, structured around the plan, made progress easy to follow with expandable diffs, similar to Claude Code&#8217;s and Cursor&#8217;s recently released planning feature.</p><p>However, the pull request&#8217;s description was lacking. For example, it stated <em>&#8220;This change adds support for recurring tasks&#8230;&#8221;</em>, which reads more like a product note than a proper description of a technical feature. I expected at least one section covering the technical components added/changes to facilitate a more efficient and easier review. Code review also felt unnatural. Similar to Codex, I couldn&#8217;t review the pull request as I would on GitHub: all reviews had to happen in Jules, where I could only leave a single block of feedback instead of commenting on individual lines. While the grouping of changes by the plan&#8217;s action items in the interface made diffs easy to follow, the process was rigid compared to reviewing a colleague&#8217;s PR.</p><h4><strong>Team integration</strong></h4><p>Jules does not fit smoothly into team workflows. On GitHub, you cannot review its code or collaborate with it the way you would with another developer. It also lacks integrations with tools like Slack, Linear, or Jira, leaving its interface as the primary means of interaction. As a result, Jules feels separate from the normal channels teams rely on to collaborate.</p><h4><strong>Autonomy</strong></h4><p>Jules is semi-autonomous. It generates an implementation plan that you approve in the beginning. If you don&#8217;t respond, the plan auto-approves and executes, completing the task end-to-end without requiring your input, unless it encounters roadblocks or needs clarifications, similar to how Claude Code prompts the user for input.</p><h4><strong>Cost</strong></h4><p>Jules is part of<a href="https://one.google.com/about/google-ai-plans/"> Google&#8217;s AI package</a>. It offers the following tiers:</p><ul><li><p>Free tier - 15 tasks per day, 3 concurrent tasks</p></li><li><p>Pro - 100 tasks per day, 15 concurrent tasks</p></li><li><p>Ultra - 300 tasks per day, 60 concurrent tasks</p></li></ul><p>The free tier makes it easy to get started. The pricing model is transparent and predictable, as it remains consistent regardless of task complexity and does not depend on unpredictable factors. For developers, this makes Jules straightforward to adopt and scale.</p><h3><strong>How to get the most out of Jules</strong></h3><p><strong>(1) Interactive Plan mode</strong></p><p>Start tasks using the Interactive Plan mode. Jules will then scan the codebase, ask clarifying questions, and create an implementation plan you&#8217;d approve before execution. This prevents ambiguity and ensures the output matches your intent, and is already the way I code nowadays. Feels like a junior developer confirming requirements before writing code.</p><p><strong>(2) Configure environment setup</strong></p><p>If a repo requires setup commands such as dependencies or environment variables, define them upfront. Jules will run them automatically when working with that repo, keeping environments consistent.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;942ceb55-c5f0-4d3d-a5fb-ed95505caf43&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2><strong>Factory AI</strong></h2><h4><strong>Overall experience</strong></h4><p>Factory is built around the concept of &#8220;Droids&#8221;: autonomous agents purpose-built for different roles across the software development lifecycle (Code, Reliability, Knowledge, Product). Each Droid comes with its own architecture, memory, and tools, and can operate independently or in parallel. Unlike traditional coding assistants, Droids are designed to own and execute tasks end-to-end, making Factory feel more like a full-stack engineering team than a co-pilot, similar to the idea behind Claude Code's Subsagents.</p><p>When using Factory, it&#8217;s clear that the company is focused on enterprise users, rather than indie developers, which could hinder the organic bottom-up growth that Claude Code and Cursor benefit from. For instance, you must enter credit card details for a trial with no way to pre-cancel. Pricing is also somewhat opaque and tied to token usage: $40 for 20 million tokens.</p><p>Factory&#8217;s most significant drawback, however, is the indexing process. For our small benchmark repository, indexing took anywhere from several minutes to multiple hours. That&#8217;s a considerable delay just to get started. In a developer world optimized for instant feedback loops and low-friction experimentation, this delay feels off.</p><p>On the other hand, Factory shines in its coding cycle: it builds a clear to-do list, asks intelligent and clarifying questions, and executes with visible reasoning and file-by-file diffs. The pull request is detailed and reads like something a senior engineer would write. It also provides the broadest set of integrations, allowing you to consume relevant context from popular tools such as Notion, Google Drive, and even incident management tools like Sentry and PagerDuty.</p><p>Additionally, to Factory&#8217;s credit, user experience appears to have become a recent priority. When I first used it back in March, the interface was cluttered and overwhelming: too many panes, not enough clarity. During my coding session for this article, however, the redesign was substantially better: the layout is cleaner, more focused, and thoughtfully organized around the key decisions and information needed at each stage.</p><h4><strong>Team integration</strong></h4><p>Integrations is where Factory differentiates itself, showcasing again its enterprise focus. It can consume context from your team&#8217;s internal knowledge systems: Slack, Linear, Jira, Notion, Google Drive, Sentry, and PagerDuty. Such integrations infuse the coding agent with context beyond the project&#8217;s repository, which improves the agent&#8217;s performance.</p><p>These integrations allow Factory to enrich its understanding of your engineering workflows far beyond what's available in the repository. However, these integrations are read-only: Factory can ingest context from these platforms but cannot be actively directed through them. You can&#8217;t, for example, tag Factory in a Slack thread or assign it a Linear ticket and expect it to take action, unlike agents like Devin.</p><h4><strong>Cost</strong></h4><p>Factory offers a 14-day trial, after which pricing starts at $40 per month for 20 million tokens. Our task consumed ~330k tokens. As a full-time engineer, you&#8217;ll likely burn through this quota quickly&#8211;our simple feature alone ate through over 1.5% of the monthly allowance.</p><h4><strong>Autonomy</strong></h4><p>Factory is semi-autonomous and often requires your input. It avoids making assumptions, it often asks clarifying questions before starting, and seeks approval before taking actions, such as creating a pull request. By default, it will not create branches or commit files without your approval, though you can disable this safeguard.</p><h3><strong>How to get the most out of Factory</strong></h3><p><strong>(1) Use Factory Bridge</strong></p><p>Bridge is a secure connector that links Factory's cloud platform to your local machine. It enables running CLI commands, managing local processes, and accessing local files directly from Factory sessions. Tip: use it within an isolated environment (e.g., Docker) for safe and optimal results. I used Bridge when I couldn&#8217;t access my laptop or wanted to split tasks between Factory and Claude Code, saving Factory tokens while running both agents in parallel.</p><p><strong>(2) Use a remote machine</strong></p><p>When local access is not needed, connect Factory to a remote machine. It still grants full command and workflow access without requiring you to touch your computer.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;74a671ff-f8b7-4003-9a05-2339c1a3b342&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2><strong>Cursor Background Agent</strong></h2><h4><strong>Overall Experience</strong></h4><p>Getting started is easy for existing Cursor users: simply submit a task and select &#8220;Send to Background&#8221;. On the web, log in and paste your coding prompt.</p><p>You can watch the agent&#8217;s progress by connecting to its virtual cloud environment (&#8220;Open VM&#8221;), which mirrors a live coding session. Once complete, you get a change summary and file diffs in both desktop and web.</p><p>The pull request description is barebones, often limited to a single sentence. It lacks a detailed context that would streamline reviews. Code review, however, feels seamless: you can comment directly on GitHub, tag the agent, and get a response in-line, just like working with a human teammate.</p><h4><strong>Team integration</strong></h4><p>Cursor integrates directly into Slack and Linear, so you can tag it in threads or assign it to issues.</p><h4><strong>Autonomy</strong></h4><p>Cursor&#8217;s Background version is fully autonomous. Once you assign a task, it analyzes the codebase, executes the plan, and opens a pull request without requiring user input.</p><h4><strong>Cost</strong></h4><p>Background agents are billed at the API rate of the model you choose, which is affordable with models like gpt-5-high. The task I ran came at 8&#162;. Token-based pricing is inherently unpredictable, but that&#8217;s already the norm with the popular Claude Code and Cursor desktop agents.</p><h3><strong>How to get the most out of Cursor Background Agent</strong></h3><p><strong>(1) Add a .cursorrules file to your repositories</strong></p><p>Cursor&#8217;s best practices also apply to background agents. Make sure every repository you use includes a <a href="https://docs.cursor.com/en/context/rules">.cursorrules</a> file, just as you would when working locally (you can use my <a href="https://sidekickdev.com/">recently released tool</a> to generate one).</p><p><strong>(2) Connect Cursor to Slack or Linear</strong></p><p>If your team uses either tool, integrate Cursor so you can assign it issues or tag it in live discussions. Whether it is a bug in Slack or a ticket in Linear, the agent can pick it up immediately and start working.</p><p>More tips here&#8203;&#8203; <a href="https://www.aitidbits.ai/p/sahar-ai-coding">https://www.aitidbits.ai/p/sahar-ai-coding</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;b79f309e-7ec4-4381-a78b-d5abe1457fd0&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2>Choosing the right coding agent</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!76TG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!76TG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 424w, https://substackcdn.com/image/fetch/$s_!76TG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 848w, https://substackcdn.com/image/fetch/$s_!76TG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 1272w, https://substackcdn.com/image/fetch/$s_!76TG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!76TG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:397774,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/171783559?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!76TG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 424w, https://substackcdn.com/image/fetch/$s_!76TG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 848w, https://substackcdn.com/image/fetch/$s_!76TG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 1272w, https://substackcdn.com/image/fetch/$s_!76TG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F910063e0-1b39-4daf-80ad-28c39b592598_2526x1418.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>We&#8217;re just getting started with coding agents</h2><p>All of these tools are still in their early stages. We can expect them to evolve quickly&#8212;I had to update my post twice over the span of two weeks due to new releases.</p><p>Just as importantly, the paradigm of working with autonomous agents is still taking shape. Tools like Claude Code&#8217;s Subagents and newcomers like <a href="https://conductor.build/">Conductor</a> and <a href="https://www.task-master.dev/">Task Master</a> hint at what&#8217;s to come. In future posts, I&#8217;ll dive deeper into each of the tools reviewed here. Subscribe to follow along as I learn how to collaborate with this new generation of coding agents.</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[DeepWiki: Understand Any Codebase]]></title><description><![CDATA[How I use DeepWiki to speed up real coding work]]></description><link>https://www.aitidbits.ai/p/deepwiki</link><guid isPermaLink="false">https://www.aitidbits.ai/p/deepwiki</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 17 Aug 2025 15:01:27 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/81e54dd2-f1e3-41cd-8f0a-94dc88719d16_852x480.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to another post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding.</em></p><p><em>In this post, I break down how I use DeepWiki - my go-to tool for understanding unfamiliar codebases, spinning up dev environments, and generating context for coding agents like Claude and Cursor. Whether you're evaluating an open-source repo, onboarding to a new project, or building an AI-powered dev tool, DeepWiki can save you hours.</em></p><p><em>Note: This is not a sponsored post or paid collaboration. I use DeepWiki in my day-to-day workflow and wanted to share how it's been useful.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;c91700f4-023c-4edd-921b-efd82891b113&quot;,&quot;duration&quot;:579.9445,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>We are generating more code than ever. With LLMs like Claude already writing most of Anthropic&#8217;s code, the challenge is no longer producing code, it is understanding it.</p><p>DeepWiki, from Cognition, the team behind Devin, the AI software engineer that went viral with its Fiverr demo in March 2024, turns any GitHub repository into an instant, navigable wiki.</p><p>Point it at a repo and start asking questions without digging through files or reading every line. Just replace github.com with deepwiki.com to spawn a wiki page for a repository:</p><pre><code>https://<strong>github</strong>.com/Dicklesworthstone/claude_code_agent_farm &#8594; https://<strong>deepwiki</strong>.com/Dicklesworthstone/claude_code_agent_farm</code></pre><p>This is a lightweight field guide to the tool that has become my must&#8209;have coding companion.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q8l7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q8l7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 424w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 848w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 1272w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q8l7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png" width="1456" height="827" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:827,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1412897,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/170839853?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q8l7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 424w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 848w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 1272w, https://substackcdn.com/image/fetch/$s_!q8l7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8bb196-a7f3-4a1a-a856-a525ac7eddaa_1850x1051.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Get instant answers on any repository (e.g. LangChain)</figcaption></figure></div><div><hr></div><p>Recent posts highlight</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;f8d69b09-68e8-4a3d-ad79-1c3427c1be47&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:85,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;5c74a04f-b243-4fb0-ad51-888588c319c9&quot;,&quot;caption&quot;:&quot;Welcome to LinkedIn Highlights!&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, May 2025 - AI Coding Edition&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-08T15:00:55.978Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!lpwX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-highlights-may-2025&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:164798273,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:31,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!-amS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>How DeepWiki works</h2><ul><li><p><strong>Public vs. private</strong> - querying public repos works out of the box. For private repos, you&#8217;d need to sign in with a (free) Devin account.</p></li><li><p><strong>Fast vs. Deep Research modes</strong> - Fast mode answers instantly from the code graph. Deep Research spends extra cycles reading across files to provide higher-confidence, multi-hop answers.</p></li><li><p><strong>Grounded answers</strong> - every answer includes clickable, line&#8209;level citations back to the source files, so you can jump straight to the code and avoid hallucinated summaries.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!k9nE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!k9nE!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 424w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 848w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 1272w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!k9nE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif" width="852" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:480,&quot;width&quot;:852,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7035337,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/170839853?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!k9nE!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 424w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 848w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 1272w, https://substackcdn.com/image/fetch/$s_!k9nE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ab9fdd1-5ac9-46f7-81b1-76162a70d490_852x480.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h3><strong>Use DeepWiki via web or AI IDE</strong></h3><p>You can use DeepWiki by pasting a GitHub URL on deepwiki.com or by plugging it directly into your AI coding environment using the official DeepWiki MCP server (my preferred way).</p><p>The DeepWiki MCP server requires no authentication and is supported natively by Claude and most AI IDEs, such as Windsurf and Cursor. Just add the MCP server to your config (instructions <a href="https://docs.devin.ai/work-with-devin/deepwiki-mcp#setup-instructions">here</a>).</p><p>This turns DeepWiki into a live, embedded research sidekick: always-on, context-grounded, and queryable directly within your workflow.</p><div><hr></div><h2>DeepWiki is how I read other people&#8217;s code now</h2><h4>(1) Evaluating an open-source project</h4><p>Engineering velocity has exploded, and with it, the number of open-source libraries released every day.</p><p>Before pulling in a new library, I run a quick eval loop: is it actively maintained? How&#8217;s the security posture? Any data sent to third parties? Does the license play well with commercial use?</p><p>With DeepWiki, I get these questions instantly answered, linking me to the exact configuration, network calls, and license terms, so I can make a quick go/no-go decision.</p><h4>(2) Quickly setting up new environments</h4><p>For public or private repos, ask &#8220;how do I run this locally?&#8221; and you&#8217;ll get the env setup, required services, and dependency graph with citations to README, Dockerfile, and scripts, so you can hit the ground running.</p><h4>(3) Borrowing implementation details</h4><p>Suppose you find a clever mechanism in another repository, such as an authentication flow or a clever way to persist state locally. In that case, you can ask DeepWiki to provide a Markdown cheat sheet: a breakdown of how it works, which files define it, and what it depends on. You can then drop that summary directly into Claude Code or Cursor as structured context and ask it to implement it in your project.</p><p>For instance, I needed to manage multiple coding agents via the terminal. I found a <a href="https://github.com/Dicklesworthstone/claude_code_agent_farm">repo</a> that orchestrated them using tmux. DeepWiki mapped the scripts and config, and within ten minutes, I had the same terminal layout running in my project.</p><h4>(4) Creating custom onboarding guides</h4><p>Treat it like a patient senior engineer. Ask targeted questions like &#8220;Can you walk me through how the queue processor handles retries?&#8221;, &#8220;What&#8217;s the data flow during user signup?&#8221;, or &#8220;I want to implement feature X - where should I begin?&#8221;, and receive tailored explanations with direct links to the relevant functions. </p><div><hr></div><pre><code><code>Become a premium member to access the full LLM Builders series, $1k in free credits for leading AI tools and APIs (Claude, Hugging Face, Deepgram), and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h4>(5) Surfacing first contributions</h4><p>New team or contributing to a new open-source library? Ask for &#8220;good first issues&#8221; based on TODOs, failing tests, flaky areas, or missing docs. DeepWiki surfaces approachable fixes and where to start reading.</p><h4>(6) Navigating cookbook-style repositories</h4><p>Some repositories are more like collections of examples than actual packages. Anthropic&#8217;s <a href="https://github.com/anthropics/anthropic-cookbook">cookbook</a> and Gemini&#8217;s <a href="https://github.com/google-gemini/cookbook">cookbook</a> are good examples. DeepWiki can help you find the example you need and even generate the code.</p><h4>(7) Building context-aware coding agents</h4><p>If you&#8217;re building a project or tool that requires understanding the context of a codebase: its structure, architecture, or coding style, DeepWiki can do the heavy lifting.</p><p>I built a tool called <a href="https://sidekickdev.com/">Sidekick</a> that generates cursorrules.md and claude.md files: markdown summaries that coding agents use as context during generation. Instead of writing those summaries yourself, Sidekick uses DeepWiki to auto-generate them from the repo.</p><p>Since DeepWiki&#8217;s MCP API is free and open, you can plug it into any product or library that benefits from code awareness: onboarding flows, test generators, AI pair programmers, or anything else that needs a high-level map of the code.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dEKK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dEKK!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 424w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 848w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 1272w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dEKK!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif" width="662" height="428.0933333333333" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:388,&quot;width&quot;:600,&quot;resizeWidth&quot;:662,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;sidekick-dev.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="sidekick-dev.mp4 [optimize output image]" title="sidekick-dev.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!dEKK!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 424w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 848w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 1272w, https://substackcdn.com/image/fetch/$s_!dEKK!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F058fceeb-57ff-4524-bcc5-f74ab1a19a2e_600x388.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sidekick Dev leverages DeepWiki to feed coding agents with context files</figcaption></figure></div><h4>(8) Reviewing and ramping up on pull requests</h4><p>So your teammate opens a PR and asks for a review. You weren&#8217;t involved in the feature, and now you&#8217;re staring at a diff with no context. With DeepWiki, you can quickly ramp up. Just replace <em>github</em> with <em>deepwiki</em> in the PR URL to get a structured summary of the proposed changes. For example:</p><pre><code>https://<strong>github</strong>.com/saharmor/simulatedev/pull/7 &#8594; https://<strong>deepwiki</strong>.com/saharmor/simulatedev/pull/7</code></pre><p>DeepWiki helps you understand not just what changed, but how it fits into the broader codebase, saving time, reducing back-and-forth, and making you a better reviewer.</p><h2>When to use DeepWiki</h2><p>DeepWiki is now the first thing I reach for when I need to reorient quickly: implementing a new feature that touches unfamiliar parts of the stack, returning to a component I haven&#8217;t touched in months, or diving into a dense open-source repo.</p><p>Instead of grepping around, I skim the generated wiki, ask a few follow-up questions, and jump straight into the files that matter.</p><h2>What I wish DeepWiki had</h2><p>Two features I&#8217;d love to see:</p><ol><li><p>Conversational sidekick mode - leave DeepWiki running next to your IDE and ask, &#8220;Where does this function get called?&#8221; or &#8220;How do I run the worker locally?&#8221; (I hacked a prototype last weekend that I plan to share soon).</p></li><li><p>Task-based onboarding - provide a repository and a goal (&#8220;fix this open issue&#8221;), and receive a step-by-step path through the exact files, functions, and setup commands needed to contribute.</p></li></ol><p>Try it yourself at <a href="http://deepwiki.com">deepwiki.com</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hge8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hge8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 424w, https://substackcdn.com/image/fetch/$s_!hge8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 848w, https://substackcdn.com/image/fetch/$s_!hge8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 1272w, https://substackcdn.com/image/fetch/$s_!hge8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hge8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png" width="1353" height="657" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:657,&quot;width&quot;:1353,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:154679,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/170839853?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!hge8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 424w, https://substackcdn.com/image/fetch/$s_!hge8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 848w, https://substackcdn.com/image/fetch/$s_!hge8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 1272w, https://substackcdn.com/image/fetch/$s_!hge8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd95af8e6-902f-40d2-acf9-275c6bcdded4_1353x657.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, June 2025 - AI Agents Edition]]></title><description><![CDATA[Build smarter AI agents with six open-source tools and a bonus toolkit that optimizes Llama prompts by 45%]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-june-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-june-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 13 Jul 2025 15:02:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FtrP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> seven top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Over the past few months, I&#8217;ve been developing and experimenting with several open-source packages while creating a few AI agents. This post covers six of my most popular LinkedIn posts on the topic, each showcasing the package&#8217;s core features and my key takeaways. Plus, a bonus tip at the end: a toolkit that automatically converts prompts from GPT and Claude into Llama-optimized versions, boosting performance by up to 45%.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><p>Recent posts:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;388f6d93-8119-4907-af3c-6bee43f24e9d&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:54,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b8b6cc74-05b8-4c76-bd40-4e7968a82230&quot;,&quot;caption&quot;:&quot;This post is part of my 2&#162; series - my raw thoughts about recent topics in AI. Not always practical thoughts, but always thought-provoking. Some of my previous ones covered the new wave of conversational AI, economies of scale for foundation AI models&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Google I/O '25 - Research to reality&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-05-23T14:31:13.949Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/google-io-25&quot;,&quot;section_name&quot;:&quot;Sahar's 2&#162;&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:164053881,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:60,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(1) LangMem</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FtrP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FtrP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" width="1258" height="681" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:681,&quot;width&quot;:1258,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text" title="text" srcset="https://substackcdn.com/image/fetch/$s_!FtrP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LangMem is a new open-source library that gives LLM agents long-term memory and it&#8217;s refreshingly easy to use.</p><p>It&#8217;s built for developers working with LangGraph or custom agents, and it solves a persistent problem: how to make agents remember and adapt across sessions without bloated prompts or manual hacks.</p><p>LangMem introduces a clean memory API that works with any storage backend and includes tools for:</p><ol><li><p><strong>Storing important information during conversations</strong> - agents decide what matters and when to save it</p></li><li><p><strong>Searching memory when relevant</strong> - retrieving facts, preferences, or prior context</p></li><li><p><strong>Running background memory consolidation</strong> - automatically refining and updating knowledge over time</p></li></ol><p>It integrates natively with LangGraph&#8217;s memory store, but you can also plug it into your own stack using Postgres, Redis, or in-memory stores.</p><p>This design is especially useful for building agents that need to:</p><p>-&gt; Personalize interactions across sessions</p><p>-&gt; Maintain consistency in long-running workflows</p><p>-&gt; Adapt behavior based on evolving user input</p><p>Unlike Mem0, which requires explicit memory updates, LangMem handles memory automatically in the background, storing and retrieving key details as needed, and integrates with LangGraph out of the box.</p><p>GitHub repo <a href="https://github.com/langchain-ai/langmem">https://github.com/langchain-ai/langmem</a></p><div><hr></div><h2>(2) Browser Use</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!annP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!annP!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 424w, https://substackcdn.com/image/fetch/$s_!annP!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 848w, https://substackcdn.com/image/fetch/$s_!annP!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1272w, https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif" width="1456" height="1012" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1012,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, text, application&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, text, application" title="graphical user interface, text, application" srcset="https://substackcdn.com/image/fetch/$s_!annP!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 424w, https://substackcdn.com/image/fetch/$s_!annP!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 848w, https://substackcdn.com/image/fetch/$s_!annP!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1272w, https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LLM agents can read the web, but few can truly use it. This open-source package changes that.</p><p>Browser-use is an open-source library that turns any LLM into a browser-native agent, with first-class support for real UI actions and multi-step tasks.</p><p>Out of the box, it supports:</p><ol><li><p><strong>Direct interaction with Chromium via Playwright</strong> - no extra scripting layers or wrappers</p></li><li><p><strong>Seamless LLM integration</strong> - use GPT-4o, DeepSeek-V3, Claude, Gemini, or even Grok</p></li><li><p><strong>Ready-to-run UX</strong> - spin up agents with a single function, or test flows in a ready-to-run Web UI or CLI</p></li></ol><p><br>Browser use can handle real-world tasks like:</p><p>-&gt; Checking your latest Stripe payouts and updating a financial tracking sheet</p><p>-&gt; Logging into your CMS, creating a draft blog post, and uploading media</p><p>-&gt; Scraping product reviews across sites and summarizing them in a shared doc</p><p>-&gt; Tracking changes to your competitors&#8217; pricing pages and alerting your team</p><p>For those building AI agents that go beyond chat, this gives you a direct bridge to the real web, not a sandbox.</p><p>GitHub repo <a href="https://github.com/browser-use/browser-use">https://github.com/browser-use/browser-use</a></p><div><hr></div><p>My recent post on coding with AI:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;cc758575-6171-4642-9be2-4af965bf34a0&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(3) OpenAI Agents SDK</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6A1M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6A1M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg" width="1256" height="702" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:1256,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text" title="text" srcset="https://substackcdn.com/image/fetch/$s_!6A1M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>OpenAI has one of the most useful frameworks for multi-agent workflows, and it&#8217;s open-source.</p><p>Building production-ready agent systems has been notoriously complex, requiring deep knowledge of orchestration patterns, handoff mechanisms, and debugging distributed AI behavior. The new OpenAI Agents SDK simplifies this complexity with a remarkably clean Python interface that handles the heavy lifting.</p><p>Why I find this framework so useful:</p><ol><li><p><strong>Provider-agnostic design</strong> - works with OpenAI's APIs plus 100+ other LLMs, so you're not locked into a single provider</p></li><li><p><strong>Built-in handoffs</strong> - agents can seamlessly transfer control to specialized agents based on context, like routing Spanish queries to Spanish-speaking agents</p></li><li><p><strong>Integrated tracing</strong> - every agent run is automatically tracked using popular tools such as AgentOps, Braintrust, and Arize AI Phoenix, making debugging multi-agent conversations straightforward instead of impossible</p></li><li><p><strong>Guardrails by default</strong> - configurable safety checks for input and output validation prevent runaway behavior</p></li></ol><p>Setting up a triage system that routes conversations to language-specific agents takes just a few lines of code, with the SDK handling message persistence, context switching, and execution flow automatically.</p><p>I'm particularly impressed by the tracing capabilities - the framework integrates with popular observability tools like Logfire, AgentOps, and Braintrust, giving you visibility into exactly what your agents are doing and why.</p><p>For developers who have been intimidated by the complexity of multi-agent architecture, this SDK removes the final barrier to building sophisticated agent workflows that actually work in production.</p><p>GitHub repo <a href="https://github.com/openai/openai-agents-python">https://github.com/openai/openai-agents-python</a></p><div><hr></div><h2>(4) Agno</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LKGR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LKGR!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 424w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 848w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1272w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif" width="800" height="579" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:579,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!LKGR!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 424w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 848w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1272w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most agent frameworks I've used struggle with performance at scale, but I recently tested one that achieves microsecond-level instantiation.</p><p>The math doesn&#8217;t lie: if each agent takes seconds to spin up and consumes megabytes of memory, running the thousands needed for complex workflows becomes infeasible.</p><p>A new library called Agno addresses this through architectural decisions that prioritize performance without sacrificing functionality. The framework supports 23+ model providers and implements a progressive five-level agent architecture, from basic tool-enabled agents to coordinated multi-agent workflows.</p><p>Key technical capabilities include:</p><ol><li><p><strong>Native multimodal processing</strong> - handles text, image, audio, and video inputs without additional preprocessing layers</p></li><li><p><strong>First-class reasoning implementation</strong> - agents can explicitly "think through" problems using built-in reasoning tools or custom chain-of-thought approaches</p></li><li><p><strong>Agentic search with hybrid retrieval</strong> - combines vector search with keyword matching and re-ranking for improved RAG performance</p></li></ol><p>The performance difference is substantial. In head-to-head comparisons with LangGraph, Agno completes instantiation benchmarks before competing frameworks reach halfway through their measurement cycles.</p><p>Agno also includes pre-built FastAPI routes, structured output handling, session storage, and monitoring capabilities.</p><p>GitHub repo <a href="https://github.com/agno-agi/agno">https://github.com/agno-agi/agno</a></p><div><hr></div><h2>(5) Agents Towards Production</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Z0AB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg" width="1456" height="811" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:811,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface" title="graphical user interface" srcset="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new, comprehensive, open-source playbook has just solved the biggest challenge in developing AI agents: transitioning from experimentation to production-ready systems.</p><p>Unlike scattered documentation or theoretical frameworks, this resource provides executable tutorials that guide you from zero to a working implementation in minutes.</p><p>The playbook covers the entire agent lifecycle:<br></p><ol><li><p><strong>Orchestration fundamentals</strong> - build multi-tool workflows with memory persistence and agent-to-agent messaging using frameworks like Xpander and LangChain</p></li><li><p><strong>Production deployment</strong> - containerize agents with Docker, scale on GPU infrastructure via Runpod, or run on-premise with Ollama for privacy-sensitive applications</p></li><li><p><strong>Security and observability</strong> - implement real-time guardrails against prompt injection, add comprehensive tracing with LangSmith and Qualifire, and automate behavioral testing</p></li><li><p><strong>Advanced capabilities</strong> - enable dual-memory architectures with Redis for semantic search, integrate real-time web data through Tavily, and deploy agents as APIs with FastAPI</p></li></ol><p>What makes this resource invaluable is its tutorial-first approach. Each concept comes with runnable notebooks and production-ready code.<br><br>Whether you're building customer service agents, research assistants, or autonomous workflows, the playbook provides tested patterns for tool integration, multi-agent coordination, and model customization.<br><br>GitHub repo <a href="https://github.com/NirDiamant/agents-towards-production?utm_source=saharmor_linkedin">https://github.com/NirDiamant/agents-towards-production</a></p><div><hr></div><h2>(6) Docling</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mEga!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mEga!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 424w, https://substackcdn.com/image/fetch/$s_!mEga!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 848w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg" width="884" height="524" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:524,&quot;width&quot;:884,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!mEga!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 424w, https://substackcdn.com/image/fetch/$s_!mEga!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 848w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-june-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, May 2025 - AI Coding Edition]]></title><description><![CDATA[Six practical tips for coding with AI: from agent workflows to Claude hacks, plus a bonus tip to cut Claude API costs and latency]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-may-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-may-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 08 Jun 2025 15:00:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!lpwX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> seven top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Over the past month, I&#8217;ve been <a href="https://www.aitidbits.ai/p/sahar-ai-coding">deep in the weeds</a> of coding with AI: tinkering, prototyping, and writing about how coding with LLMs can go beyond vibe coding, making us engineers substantially more productive. This post covers six of my most popular LinkedIn posts on the topic, each packed with practical takeaways for both aspiring and experienced engineers. Plus, a bonus tip at the end: an underrated trick from Anthropic to reduce cost by 90% and latency by 50%.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><p>Post(s) published this month:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;388f6d93-8119-4907-af3c-6bee43f24e9d&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:54,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b8b6cc74-05b8-4c76-bd40-4e7968a82230&quot;,&quot;caption&quot;:&quot;This post is part of my 2&#162; series - my raw thoughts about recent topics in AI. Not always practical thoughts, but always thought-provoking. Some of my previous ones covered the new wave of conversational AI, economies of scale for foundation AI models&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Google I/O '25 - Research to reality&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-05-23T14:31:13.949Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/google-io-25&quot;,&quot;section_name&quot;:&quot;Sahar's 2&#162;&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:164053881,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:60,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(1) Parallel AI pair-programming</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lpwX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lpwX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lpwX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg" width="650" height="648" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:648,&quot;width&quot;:650,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:&quot;No alternative text description for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!lpwX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 424w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 848w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!lpwX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F105359fa-99a4-4d3c-bc6c-abcdd05aa539_650x648.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Recently, I adopted a coding tip from the Anthropic team that has significantly boosted the quality of my AI-generated code.</p><p>Anthropic runs multiple Claude instances in parallel to dramatically improve code quality compared to single-instance workflows.</p><p>How it works:</p><ol><li><p>One Claude writes the code, the coder - focusing purely on implementation</p></li><li><p>A second Claude reviews it, the reviewer - examining with fresh context, free from implementation bias</p></li><li><p>A third Claude applies fixes, the fixer - integrating feedback without defensiveness</p></li></ol><p>This technique works with any AI assistant, not just Claude. Spin each agent up in its own tab&#8212;Cursor, Windsurf, or plain CLI. Then, let Git commits serve as the hand-off protocol.</p><p>This separation mimics human pair programming but supercharges it with AI speed. When a single AI handles everything, blind spots emerge naturally. Multiple instances create a system of checks and balances that catch what monolithic workflows miss.</p><p>This shows that context separation matters. By giving each AI a distinct role with clean context boundaries, you essentially create specialized AI engineers, each bringing a unique perspective to the problem.</p><div><hr></div><h2>(2) Claude ultra think</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!96c_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!96c_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 424w, https://substackcdn.com/image/fetch/$s_!96c_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 848w, https://substackcdn.com/image/fetch/$s_!96c_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!96c_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!96c_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg" width="1192" height="382" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:382,&quot;width&quot;:1192,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, application&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, application" title="graphical user interface, application" srcset="https://substackcdn.com/image/fetch/$s_!96c_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 424w, https://substackcdn.com/image/fetch/$s_!96c_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 848w, https://substackcdn.com/image/fetch/$s_!96c_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!96c_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d98c745-dc16-42a2-b84f-d90c2b456e5b_1192x382.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>One extra word can unlock Claude&#8217;s hidden reasoning budget and give developers building with AI the planning superpowers.</p><p>According to Anthropic, Claude allocates compute tiers based on trigger words:<br>think &lt; think hard &lt; think harder &lt; ultrathink.</p><p>Drop any of these at the start of a prompt and the model gives itself more &#8220;brain cycles&#8221; before answering. Perfect for discussions on architecture, API design, or edge-case analysis. I also use it when debugging an issue that the existing models (Gemini, o1) didn&#8217;t manage to solve.</p><p>It works in Cursor/Windsurf and the <strong>Claude</strong> web chat; no flags, no config.</p><p>Give it a try: open your next coding session, preface the prompt with 'think harder' or 'ultrathink' and see how Claude&#8217;s performance improves.</p><div><hr></div><p>My recent post on coding with AI:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;31215496-5492-45e3-9c7c-3a4d2619cde9&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(3) Coding visual interfaces with coding agents</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5XuN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5XuN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5XuN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:&quot;No alternative text description for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!5XuN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!5XuN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3196dd61-105e-4c98-861d-85bb78d7c3ff_1536x1024.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most devs still eyeball CSS tweaks while it&#8217;s possible to achieve pixel-perfect UI with coding models like Claude and o3.</p><p>Screenshot-driven iteration works across o3, Gemini, and Claude (I&#8217;ve had the best luck with Claude). Process:</p><p>(1) Paste the design mock via clipboard, drag-drop, or file path</p><p>(2) Ask the model: &#8220;Implement this layout, screenshot the result, and compare it to the mock. Repeat until they match.&#8221;</p><p>(3) Watch it code &#8594; screenshot &#8594; diff &#8594; refine until the visual diff hits zero</p><p>Design-to-dev handoff shrinks from days to minutes. Frontend teams can A/B three layout variants before lunch, and PMs can solo build UI prototypes without opening Figma.</p><p>Try it: paste a screenshot/Figma design with the above prompt in your AI IDE of choice and let the loop run.</p><div><hr></div><h2>(4) Auto-generated MCP servers</h2><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;291b4512-589a-4f01-9ed8-6ca7ddb34a5c&quot;,&quot;duration&quot;:null}"></div><p>One trick I often use and not many other AI builders know: you can now spin up an MCP server for almost any Mintlify-powered API with just two commands.</p><p>MCP (Model Context Protocol) servers are the missing layer that enables AI coding assistants like Cursor and Windsurf to generate reliable programs that interact with APIs through natural language, without needing manual integrations or risking hallucinated parameters and broken function calls.</p><p>Traditionally, setting up an MCP server meant custom-building one for each API. A tedious, manual process. Now, thanks to <strong>Mintlify</strong>&#8217;s new package, it&#8217;s effortless:</p><ol><li><p>npm i mcp</p></li><li><p>npx mcp add [api subdomain] (for example, <strong><a href="http://docs.bland.ai/">docs.bland.ai</a></strong>)</p></li></ol><p>That&#8217;s it. The API you&#8217;re trying to work with becomes an MCP server. No custom code, no extra work.</p><p>You can test it with the Mintlify-powered Bland API docs to initiate phone calls programmatically.</p><div><hr></div><h2>(5) llms.txt</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wyws!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wyws!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wyws!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wyws!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wyws!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wyws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg" width="825" height="401" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:401,&quot;width&quot;:825,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text, letter&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text, letter" title="text, letter" srcset="https://substackcdn.com/image/fetch/$s_!wyws!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 424w, https://substackcdn.com/image/fetch/$s_!wyws!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 848w, https://substackcdn.com/image/fetch/$s_!wyws!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!wyws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8bf2f014-d57d-45fb-913a-e292976af75d_825x401.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most developers still use LLMs to write isolated functions. However, the real power of AI coding lies in making coding models interact reliably with APIs.</p><p>That&#8217;s where llms.txt comes in.</p><p>Inspired by robots.txt, llms.txt is a simple, standardized Markdown file that describes how a website or API works. It resides at the root of a site and distills documentation, endpoints, authentication flows, and usage examples into clear, predictable text. LLMs skip the cumbersome scraping, read the file, and instantly know how to call your API.</p><p>Why this matters:</p><ol><li><p>Today, prompting an LLM to use an API is messy. Developers must manually write tool descriptions, guess parameter formats, and hope the model figures it out.</p></li><li><p>No HTML parsing or rate-limited from crawling websites</p></li></ol><p>llms.txt provides a structured approach to eliminate guesswork and unlock a new era of API-driven AI coding.</p><p>To make it even easier, there&#8217;s now a full llms.txt directory, listing APIs and websites that already adopted the standard, including Anthropic, Eleven Labs, and Hugging Face <a href="https://directory.llmstxt.cloud/">https://directory.llmstxt.cloud</a></p><p>And if your website doesn&#8217;t have one yet, you can auto-generate it with <strong>Firecrawl</strong>'s new llms.txt Generator API <a href="https://docs.firecrawl.dev/features/alpha/llmstxt">https://docs.firecrawl.dev/features/alpha/llmstxt</a></p><p>&#120350;&#120354;&#120362;&#120373;, &#120362;&#120372;&#120367;&#8217;&#120373; &#120373;&#120361;&#120354;&#120373; &#120372;&#120362;&#120366;&#120362;&#120365;&#120354;&#120371; &#120373;&#120368; &#120340;&#120330;&#120343;&#120372;? &#120350;&#120361;&#120358;&#120367; &#120373;&#120368; &#120374;&#120372;&#120358; &#120373;&#120361;&#120354;&#120373; &#120368;&#120375;&#120358;&#120371; &#120340;&#120330;&#120343;?</p><p>If you want AI agents to know how your API works and generate correct code, use llms.txt. If you want agents to use an API, set up an MCP server. Think of llms.txt as the knowledge base for AI coding, and MCP as the execution layer for live interactions.</p><div><hr></div><h2>(6) Test-Driven Development with AI</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gw-_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gw-_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gw-_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg" width="660" height="660" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:660,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:&quot;No alternative text description for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!gw-_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 424w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 848w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!gw-_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50cfe569-f445-4396-8166-9a78abc2e271_1024x1024.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-may-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Open-Source Toolkit for Building AI Agents v2]]></title><description><![CDATA[An opinionated, developer-first guide to building AI agents with real-world impact]]></description><link>https://www.aitidbits.ai/p/open-source-agents-updated</link><guid isPermaLink="false">https://www.aitidbits.ai/p/open-source-agents-updated</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 01 Jun 2025 15:02:05 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!91rM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</strong></em></p><p>This one is an updated version of one of my recent popular posts, which outlined the most popular and useful open-source libraries for AI agent builders:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;f75e1eb7-db29-4502-92e1-96e0b2280e88&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-28T12:31:04.504Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152246141,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:599,&quot;comment_count&quot;:18,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><br>The landscape of AI agent tools is evolving rapidly. After publishing my previous post, I received suggestions for additional packages I hadn't encountered. Over the past few months, I've tested these tools and documented new, valuable agent-related libraries. This post shares my updated insights.</p><p>If you often wonder, "What tools are people actually using to build voice agents or understand documents?"&#8212;this post is for you. With new packages emerging almost daily, it can be challenging to determine what's state-of-the-art and truly usable. This list is deliberately selective, focusing on the libraries I've personally found most effective, or those recommended by colleagues I trust.</p><p>In this post, I'll provide a curated and updated overview of the open-source ecosystem for developers building AI agents. While there&#8217;s no shortage of AI agent market maps, most are geared toward non-builders who need actionable tools and frameworks to launch functional AI agents today.</p><p>Every package listed in this post allows commercial use and has a permissive open-source license.</p><p>Categories covered in this piece:<br>&#8594; Building and Orchestrating Agents (10)<br>&#8594; Computer Use (5)<br>&#8594; Browser Automation (5)<br>&#8594; Voice (12)<br>&#8594; Document Processing (7)<br>&#8594; Memory (3)<br>&#8594; Testing, Evaluation, and Observability (6)<br>&#8594; Vertical Agents (7)<br><br>Plus: </p><ul><li><p>Real-world agent stacks: Voice agent that answers phone calls + Browser agent that crawls LinkedIn URLs</p></li><li><p>Curated guides and tutorials to get started building agents</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!91rM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!91rM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 424w, https://substackcdn.com/image/fetch/$s_!91rM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 848w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1272w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2093453,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/163224476?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!91rM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 424w, https://substackcdn.com/image/fetch/$s_!91rM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 848w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1272w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Building and Orchestrating Agents</h2><p>To build agents that go beyond simple prompting, you need infrastructure for planning, memory, and tool use, and a way to hold it all together.</p><p>As more developers started shipping real-world agents, new frameworks popped up and older ones evolved to meet the actual challenges of agentic AI. This section covers the tools I&#8217;ve found most effective for building agents that can think, remember, and act with minimal hand-holding.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/oYI5j/13/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e7923925-4c70-4538-8642-0a7d81a049a9_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Agent Frameworks&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/oYI5j/13/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For beginners or rapid prototyping, consider <strong>Langflow</strong> for its intuitive visual interface and <strong>OpenAI&#8217;s Agents SDK</strong>, or <strong>LangChain</strong> for their simplicity and flexibility.</p></li><li><p>For enterprise applications, <strong>Portia</strong> and <strong>CrewAI</strong> offer robust features suitable for production environments requiring control and scalability.</p></li><li><p>For multimodal or memory-intensive agents, <strong>Agno</strong> provides lightweight support for agents needing persistent memory and multimodal inputs.</p></li><li><p>For complex simulations or data generation, <strong>Camel</strong> excels in creating customizable multi-agent systems for simulating real-world interactions.</p></li><li><p>For autonomous task execution, <strong>AutoGPT</strong> is designed for agents that need to operate without continuous human input.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gpJj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gpJj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 424w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 848w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1272w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Welcome to Langflow | Langflow Documentation&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Welcome to Langflow | Langflow Documentation" title="Welcome to Langflow | Langflow Documentation" srcset="https://substackcdn.com/image/fetch/$s_!gpJj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 424w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 848w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1272w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Langflow simplifies the creation of agents and workflows that integrate with any API, model, or database</figcaption></figure></div><div><hr></div><h2>Computer Use</h2><p>AI agents become far more useful when they can operate computers like humans: clicking, typing, browsing, and running programs. The libraries below make that possible, letting agents bridge the gap between language output and real-world action.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/jKBVM/5/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9db64df-0672-4700-b6ac-c74d0bd123d0_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Computer Use&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/jKBVM/5/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For local code execution via natural language, go with <strong>Open Interpreter</strong> &#8211; it&#8217;s fast to set up and great for command-driven agents.</p></li><li><p>For agents that need to see and control a computer screen like a human, <strong>Self-Operating Computer</strong> is your best bet.</p></li><li><p>If your agent needs to run in a secure, fast, sandboxed environment, use <strong>CUA</strong>.</p></li><li><p>For dynamic multi-step tasks on irregular interfaces, <strong>Agent-S</strong> offers the most flexibility with its planning and learning capabilities.</p></li><li><p>If your agent relies on interpreting UIs from screenshots (e.g., grounding actions in visual layouts), <strong>OmniParser</strong> adds critical visual parsing capabilities.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1c1O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1c1O!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 424w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 848w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1272w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif" width="800" height="502" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:502,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!1c1O!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 424w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 848w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1272w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Using CUA to edit a photo in Photoshop. All actions in this video are automated from a given natural language prompt.</figcaption></figure></div><div><hr></div><h2>Browser Automation</h2><p>As AI agents move from passive reasoning to active execution, the browser becomes their gateway to the internet. Whether scraping data, submitting forms, or navigating complex workflows, browser automation tools let agents interact with web pages just like a human would&#8212;with clicks, scrolls, and typed input. These libraries differ in abstraction level, performance, and agent integration, so choosing the right one depends on your goals.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/1CVsC/11/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86b7f52e-a2f5-4fba-809a-e88be3f1534f_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Browser Automation&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/1CVsC/11/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><br><strong>How to choose?</strong></p><ul><li><p>For a low-code, declarative approach where the LLM plans the steps, try <strong>Stagehand</strong>.</p></li><li><p>If you're building agents that need to deeply understand and extract content from websites, <strong>Firecrawl</strong> offers the cleanest pipeline.</p></li><li><p>For LLM-friendly control over browser actions with integration hooks, I&#8217;d recommend the popular <strong>browser-use</strong>.</p></li><li><p>Choose <strong>Playwright</strong> if you need more low-level control over browser actions across browsers.</p></li><li><p>Use <strong>Puppeteer</strong> if you need fast, scriptable Chrome automation in a Node.js environment.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dX7B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dX7B!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 424w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 848w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1272w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif" width="1456" height="685" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:685,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]" title="442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!dX7B!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 424w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 848w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1272w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Adding grocery items to a cart and checking out using browser-use</figcaption></figure></div><div><hr></div><h2>Voice</h2><p>Voice is still the most intuitive interface for humans, and increasingly, for agents too. These tools let agents handle speech in and out: understanding spoken language, keeping track of conversations, and responding naturally.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/dZHX8/7/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/553b4993-29fe-4281-9999-34339fd53c64_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Voice (Speech-to-Speech, Speech-to-Text, Text-to-Speech)&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/dZHX8/7/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><p>I wrote a whole post covering open and commercial packages and APIs for voice agents, including my guide for choosing the right ones:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;09c72fb4-1a0a-407f-bc3a-9c9753d3b215&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Voice Agents Toolkit for Builders&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-12-22T13:00:57.856Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/voice-agents-toolkit&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:153274984,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:40,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;2d0fc376-c155-40d6-848e-1452018da268&quot;,&quot;duration&quot;:null}"></div><p>Dia <a href="https://yummy-fir-7a4.notion.site/dia">compared</a> to ElevenLabs and Sesame &#128070;</p><div><hr></div><h2>Document Processing</h2><p>Modern AI agents must process and comprehend documents in various formats, from PDFs to images containing text. The following open-source tools empower agents to extract, interpret, and act upon information from unstructured documents, facilitating real-world business processes.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/LW5mf/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef10c5b5-fa34-4c5c-a52f-009b3a0cd757_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Document Processing&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/LW5mf/4/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>Long-form PDFs such as contracts, research papers - use <strong>Qwen2.5-VL</strong> or <strong>mPLUG-DocOwl2</strong> for efficient multi-page understanding without relying on OCR. And, as of a few months ago, you can also easily fine-tune a DocOwl2 model on your own data with <a href="https://github.com/modelscope/ms-swift">ms-swift</a>.</p></li><li><p>Text + image docs such as medical reports, annotated diagrams - try <strong>Molmo</strong> for high-resolution multimodal inputs, visual QA, and GUI parsing.</p></li><li><p>Layout analysis &amp; table extraction - use <strong>Docling</strong> for JSON/Markdown conversion, or <strong>LayoutLMv3</strong> for form understanding and layout-aware modeling.</p></li><li><p>Lightweight multimodal with speech - <strong>Phi-4</strong> handles text, vision, and speech in a compact model&#8212;great for on-device agents.</p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;ef8c076b-5f34-4f87-8d53-e6fab14c95da&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2>Memory</h2><p>To feel truly intelligent, AI agents need memory. Without it, they&#8217;re stuck in single-turn loops, forgetting what just happened, what the user wants, or what they already did. The libraries below help agents remember, adapt, and personalize, enabling everything from contextual conversations to long-term planning.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/Lodm4/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e14ea69-cfbe-4ca0-8bba-01fa52b9f901_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Memory&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/Lodm4/4/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For long-term, personalized memory that improves with use, <strong>Mem0</strong> is a solid choice&#8212;especially if you want minimal overhead.</p></li><li><p>Use <strong>Letta</strong> when building agents that need persistent memory across sessions and integration with tools or APIs.</p></li><li><p>To enable active memory management and knowledge sharing among agents, <strong>LangMem</strong> facilitates dynamic memory operations and shared knowledge bases.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XwTg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XwTg!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 424w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 848w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1272w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif" width="800" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!XwTg!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 424w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 848w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1272w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Creating stateful agents with Letta</figcaption></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Testing, Evaluation, and Observability</h2><p>As agents grow more complex, they need to be tested, measured, and monitored like any serious software system. These tools help you catch edge cases, debug behavior, and track performance, both during development and in production.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/Zyh1j/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b259fe8-e172-4627-995b-07fd5da7e71b_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Testing, Evaluation, and Observability&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/Zyh1j/2/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>To monitor and benchmark agent performance in production environments, <strong>AgentOps</strong> provides robust tracking and analysis tools.</p></li><li><p>When comparing various agent configurations or conducting A/B tests, <strong>Agenta</strong> facilitates structured evaluations.</p></li><li><p>To integrate observability into LLM applications, <strong>OpenLLMetry</strong> leverages OpenTelemetry for seamless monitoring.</p></li><li><p>If detecting and addressing performance, bias, or security issues is a priority, <strong>Giskard</strong> offers automated scanning capabilities.</p></li><li><p>For comprehensive LLM observability and debugging, <strong>Langfuse</strong> provides an open-source platform tailored for LLM applications.</p></li><li><p>For voice agent evaluation across different models and prompts, <strong>VoiceLab</strong> offers a comprehensive testing framework.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VEsF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" width="800" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:&quot;Demo usage&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Test and refine your voice agents with Voice Lab</figcaption></figure></div><div><hr></div><h2>Vertical Agents</h2><p>The open-source world is full of vertical agents: purpose-built tools for coding, research, data analysis, and more. I&#8217;ve tested a bunch. These are the ones I&#8217;d actually reach for when building something real.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/8yunC/1/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/031409f1-908b-4573-b0d8-9a01bbeb1946_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Vertical Agents&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/8yunC/1/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p><strong>Goose</strong> allows custom workflow integration to build extensible AI coding assistants.</p></li><li><p>For comprehensive coding agents with GUI capabilities, <strong>OpenHands</strong> offers a full-stack solution inspired by Devin.</p></li><li><p>If you prefer a Claude Code-like terminal-based pair programming, <strong>aider</strong> provides Git integration and multi-file editing.</p></li><li><p>To convert UI designs from images to code, <strong>screenshot-to-code</strong> automates the prototyping process.</p></li><li><p>For autonomous research tasks, <strong>GPT Researcher</strong> can scrape, summarize, and export findings efficiently.</p></li><li><p>For conducting in-depth, privacy-focused research using local LLMs, <strong>Local Deep Research</strong> offers iterative analysis and comprehensive, cited reports.</p></li><li><p>If your focus is on generating SQL queries from text, <strong>Vanna</strong> offers customizable and database-integrated solutions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NZzC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NZzC!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]" title="430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!NZzC!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">GPT Researcher can conduct research using local and web</figcaption></figure></div></li></ul><div><hr></div><p>Lastly, here are real-world stacks and beginner-friendly tutorials to help you launch your first AI agent.</p><h2><strong>Real-world agent stacks</strong></h2><p>All the tools above are powerful on their own, but how do they actually fit together in practice? What does a real architecture look like when you're stitching these components into something usable, testable, and shippable? I&#8217;ve compiled a few concrete examples from recent open-source projects and builders in the space. If you're trying to move from &#8220;exploring tools&#8221; to &#8220;building real systems&#8221;, these will give you a head start.</p><h3>Voice agent that answers phone calls</h3>
      <p>
          <a href="https://www.aitidbits.ai/p/open-source-agents-updated">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Google I/O '25 - Research to reality]]></title><description><![CDATA[How Google is finally taking the lead on AI]]></description><link>https://www.aitidbits.ai/p/google-io-25</link><guid isPermaLink="false">https://www.aitidbits.ai/p/google-io-25</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Fri, 23 May 2025 14:31:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!X7GG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This post is part of my 2&#162; series - my raw thoughts about recent topics in AI. Not always practical thoughts, but always thought-provoking. Some of my previous ones covered the <a href="https://www.aitidbits.ai/p/when-machines-learn-to-speak">new wave of conversational AI</a>, <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundation AI models</a>, and the <a href="https://www.aitidbits.ai/p/the-great-ai-consolidation">consolidation in the AI space</a>.</em></p><p><em>This post captures my takeaways from attending Google&#8217;s flagship event, I/O 2025. It&#8217;s not a comprehensive announcement round-up. Instead, I&#8217;ve focused on the launches that matter most to anyone building or working with AI. I also share my perspective on what these moves mean for the broader AI ecosystem and founders, developers, and researchers alike.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;a5491c71-b429-4d3c-9fc1-36b0eb70e9b7&quot;,&quot;duration&quot;:1007.88245,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>Since 2017, when Google unveiled the groundbreaking <a href="https://www.youtube.com/watch?v=D5VN56jQMWM">Duplex demo</a> at its biggest event of the year, Google I/O, I've been captivated by the company's AI advancements. For me, it was the first truly practical, consumer-facing use of AI&#8212;a clear example of how AI could take over routine tasks like booking appointments. But more importantly, it marked a key step toward a future where AI helps people express themselves in ways that were previously out of reach.</p><p>In recent years, the AI community has often viewed Google as trailing behind leaders like OpenAI and Anthropic. However, this year's Google I/O conference felt different&#8212;everything finally clicked. Google moved from research to reality, capitalizing on its massive distribution channels and deep technological prowess. The perfect combination of state-of-the-art technology with access to real-world usage through Search, Google Workspace (Gmail, Sheets, Docs, etc.), and Android (smart TVs, glasses, phones).</p><p>And it wasn&#8217;t only me. The same sentiment echoed across the press tent at I/O last Tuesday, capturing an energy reminiscent of OpenAI&#8217;s <a href="https://www.aitidbits.ai/p/openai-devday">inaugural DevDay</a>.</p><p>The winning combination, as defined by Google in this week&#8217;s I/O, manifests across three principles:</p><ul><li><p>Powerful - deploying best-in-class models to support real-time, reliable experiences</p></li><li><p>Personalize - tailoring AI to understand and cater to individual user preferences and needs</p></li><li><p>Proactive - developing AI that anticipates user needs and acts accordingly without being too intrusive or eager</p></li></ul><p>Out of these three, the one I found the most promising is <em>Personalize</em>.</p><p>Google's unparalleled access to user data gives it a powerful edge over competitors like OpenAI and Apple. It understands my interests through the searches I make (Search), the places I go (Maps), the music I listen to (YouTube), my payment habits (Google Pay), and even my work life (Gmail, Calendar, Docs). This breadth of insight uniquely positions Google to deliver truly personalized AI experiences.</p><p>Google didn&#8217;t just launch new products at I/O, it made deliberate moves into markets long held by OpenAI, Meta, Perplexity, Anthropic, and even Shopify and Stripe. Each announcement, from Jules to Gemini Live, stepped directly into competitive territory. If you&#8217;re working on dev tools, agent platforms, creative apps, e-commerce flows, or voice interfaces, these updates are worth reading. I&#8217;ve included a breakdown of the most directly affected companies and industries at the end of this post&#8212;worth reviewing if you want to stay ahead of what&#8217;s coming.</p><p>The real story isn&#8217;t about how many features Google shipped, though. It&#8217;s about the strategy taking shape. Google is doubling down on vertical integration and deeply contextual AI. That&#8217;s the new game. In Ben Thompson (Stratechery) terms, it&#8217;s Aggregation Theory with agency. Google owns the user interface, the distribution (Android, Chrome, Search), and now, increasingly, the intelligence layer.</p><p>In this post, I'll outline a selected subset of announcements I found most promising and share my <em>2&#162;</em> on why this event marks a turning point in AI's evolution.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!X7GG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!X7GG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 424w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 848w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 1272w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!X7GG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1299858,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/164053881?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!X7GG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 424w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 848w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 1272w, https://substackcdn.com/image/fetch/$s_!X7GG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><h2><strong>Google AI Studio, Jules, and Stitch</strong></h2><p>Perhaps one of the most significant announcements at Google I/O was unveiling the upgraded Google AI Studio, with a whole new Build section&#8212;an integrated development environment explicitly designed for building AI-driven applications.</p><p>Positioned directly against IDEs like Cursor, Windsurf, Lovable, and Bolt, Google <strong>AI Studio</strong> unifies Google's flagship multimodal Gemini models into one streamlined interface. Developers now have the ability to build and deploy their creations using natural language and with a single click to Google Cloud, reinforcing Google's strategic advantage through infrastructure integration.</p><p><strong>Jules</strong>, a particularly intriguing release, is Google's take on the autonomous coding agent, similar to the likes of Devin and Factory. Quietly entering public beta at <a href="https://jules.google/">jules.google</a>, Jules represents Google's ambitions to dominate the software development lifecycle: from writing documentation and deploying applications to autonomously submitting pull requests. Though overshadowed by flashier announcements, Jules may well emerge as a sleeper hit among developers seeking highly efficient, AI-augmented development workflows.</p><p><strong><a href="http://labs.google/stitch">Stitch</a></strong>, another groundbreaking tool revealed at I/O, could radically simplify UI design processes. Through natural language prompts, designers can describe interfaces, which Stitch then generates and exports directly into Figma.</p><p>Together, Google AI Studio, Jules, and Stitch exemplify Google's strategy of leveraging its state-of-the-art models and infrastructure to deliver highly integrated, practical, and transformative tools for developers and designers alike.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tL5C!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tL5C!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tL5C!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!tL5C!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!tL5C!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88a842c6-6de4-4917-a7c5-7025551b03a7_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Julez, Google&#8217;s new coding agent, in action</figcaption></figure></div><h2><strong>Powerful models</strong></h2><p>Gemini 2.5 took center stage at I/O, outperforming nearly every major AI benchmark: from coding and web development to complex reasoning and video understanding. Compared to leading commercial models, it stands out with a January 2025 knowledge cutoff, a 1 million-token context window, and operates at around a quarter of the cost of OpenAI&#8217;s GPT-4o.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jcpq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jcpq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 424w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 848w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 1272w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jcpq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png" width="1456" height="654" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:654,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:256077,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/164053881?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Jcpq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 424w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 848w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 1272w, https://substackcdn.com/image/fetch/$s_!Jcpq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F15ecf1fb-d1ab-4d0a-a09f-cb0ceb0155b4_2430x1092.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Gemini 2.5 leads the leaderboard for <a href="https://web.lmarena.ai/leaderboard">web coding tasks</a></figcaption></figure></div><p>Key improvements include:</p><ul><li><p><strong>Deep Think</strong> - an advanced reasoning capability, achieving state-of-the-art results in complex mathematical and programming tasks in exchange to increased cost and latency.</p></li><li><p><strong>Enhanced function calling and Structured Outputs</strong> - until now, the real-time Gemini models haven&#8217;t been usable for anyone needing function calling or structured output. Now, it&#8217;s finally fixed.</p></li><li><p><strong>Gemini Diffusion</strong> - Google unveiled Gemini Diffusion, generating text 5x faster than the leading Flash Lite model. This advancement is powered by recent research utilizing diffusion models for text generation, marking a significant leap forward in efficiency and responsiveness.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iYIK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iYIK!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iYIK!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!iYIK!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!iYIK!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bb7bf3a-34f1-4e0d-a245-1bcd2dffaa4b_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Gemini Diffusion starts with text that is "pure noise", then gradually transforms the random input into coherent and contextually accurate natural language aligned with the desired prompt</figcaption></figure></div><h2><strong>Search &amp; AI Mode</strong></h2><p>Google has been experimenting with a new way of search over the last few weeks, dubbed &#8220;AI Mode&#8221;. This new mode just got generally available in the US last Tuesday. Powered by the Gemini 2.5 Pro model, AI Mode allows users to engage in multi-turn dialogues, enabling more complex and nuanced information retrieval.</p><p>Highlights from the new search experience:</p><ul><li><p><strong>Personal Context</strong> - for an even more customized experience, AI Mode will offer personalized suggestions based on your connected Google apps, starting with Gmail, to bring in more of your personal context. For example, if you&#8217;re searching for &#8220;things to do in Nashville this weekend with friends, we're big foodies who like music&#8221; ahead of an upcoming trip, AI Mode can show you restaurants with outdoor seating based on your past restaurant bookings and searches.</p></li><li><p><strong>Agentic Checkout - </strong>streamlining the purchasing process by allowing users to complete transactions directly within Search, bypassing the need to navigate to third-party websites. For example, when searching for concert tickets, AI Mode will find the best options and facilitate the purchase through Google Pay, all within the same interface. This seamless integration has the potential to disrupt traditional e-commerce models and reshape how users interact with online marketplaces. <em>I wrote <a href="https://www.aitidbits.ai/p/agent-responsive-design">a whole series</a> on the new agentic internet!</em></p></li><li><p><strong>Try It On - </strong>enhancing the virtual shopping experience, Google's "Try It On" feature utilizes Google&#8217;s strong image generation diffusion models to allow users to visualize clothing items on themselves. Users can upload their picture using Google Photos and see how different garments would look on their own bodies. Google&#8217;s generative AI capabilities meet distribution (Google Photos).</p></li><li><p><strong>Deep Search - </strong>by synthesizing information from multiple sources, AI Mode can provide comprehensive answers to multifaceted questions, making Google Search relevant again in the face of competing tools such as OpenAI&#8217;s and Perplexity&#8217;s Deep Research.</p></li></ul><h2><strong>The revenue gamble</strong></h2><p>While Google's AI Mode represents a significant leap forward in search capabilities, it also reveals a fundamental tension at the heart of the company's strategy. Google is essentially betting against its own golden goose: the advertising-driven search model that has generated over 50% of its revenues for over two decades.</p><p>The math is straightforward but concerning: if AI Mode provides comprehensive answers directly within search results, users will click through to fewer websites. Independent studies already suggest this trend with AI Overviews, and AI Mode's conversational interface offers even fewer opportunities for traditional paid link placements. Google's executives at I/O spoke confidently about the technical capabilities of their new search experience, but when it came to discussing how this translates into sustainable revenue streams, the answers were notably vague.</p><p>This isn't just a minor product pivot, it's a fundamental reimagining of how Google makes money. The company appears to be racing toward a future where AI assistants and conversational interfaces replace link-based search, and while there are certainly ways to imagine business models around personalized AI assistants and agentic workflows, Google hasn't articulated what those might look like or how they'll replace the massive cash flows from traditional search advertising.</p><h2><strong>Project Mariner</strong></h2><p>Project Mariner is Google's step toward giving AI true agency across your devices. It&#8217;s their answer to OpenAI's Operator and Anthropic's Computer Use. An infrastructure-level system for teaching AI to interact with your digital environment just like a human would.</p><p>At its core, Mariner is about <em>"teach and repeat"</em>. Show Gemini how to perform a task: filling out a form, generating a weekly status report, uploading data to a dashboard, and it can replicate that workflow again and again.</p><p>Mariner will be released as part of the Gemini API later this summer, which means developers can build agents that don&#8217;t just reason and plan, but <em>act</em>: navigating apps, automating browser actions, and manipulating on-screen interfaces.</p><p>Whether it&#8217;s booking a flight, copying events into a spreadsheet, or handling repetitive workflows across company tools, Mariner helps AI move beyond suggestions and into action.</p><h2><strong>Gemini app and Gemini Live</strong></h2><p>With the new Gemini app and its Live feature, Google is officially entering the race for the "everything AI assistant&#8221;, a direct challenger to ChatGPT, Meta AI, and Apple Intelligence.</p><p>The Gemini app is no longer just a chatbot. It&#8217;s a real-time, context-aware assistant that lives across your devices and ties directly into Google&#8217;s ecosystem: Gmail, Calendar, Keep, Docs, Maps, and even YouTube. Thanks to its tight OS-level integration (powered by Project Mariner), Gemini can also take actions on your phone.</p><p>But what really sets Gemini apart isn&#8217;t just input, it&#8217;s output:</p><ul><li><p><strong>Search Live and Project Astra - </strong>building on the capabilities of AI Mode, Google introduced Search Live, a feature that combines real-time camera input with search functionality. Users can point their device's camera at an object or scene and receive immediate information (similar to OpenAI&#8217;s Advanced Voice Mode), effectively turning their environment into an interactive search field. This feature is powered by Project Astra, Google's multimodal AI assistant that integrates visual and auditory data to provide contextually relevant responses.</p></li><li><p><strong>Canvas</strong> is Google&#8217;s answer to tools like OpenAI&#8217;s Canvas and Anthropic&#8217;s Artifacts. Ask Gemini to summarize an article and it will build an interactive webpage, infographic, quiz, or even a lightweight app.</p></li><li><p><strong>Deep Research</strong> now supports uploaded personal files, synthesizing them into study guides, plans, or insights, connecting directly to your Drive and Gmail, offering context-rich reasoning grounded in your data.</p></li><li><p><strong>Agent Mode</strong> enables task automation across Gmail, Calendar, and partner services like Zillow. Unlike a basic plugin system, this builds on Mariner&#8217;s deeper Android-level control and Google's new MCP support, enabling multi-step reasoning and actions.</p></li><li><p><strong>Quiz and Video Generation</strong> taps into Veo (text2video) and Lyria (music generation model), turning documents into test prep material and short videos.</p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;2e467f75-2166-4e7a-b9ad-419556273e7f&quot;,&quot;duration&quot;:null}"></div><h2><strong>Generative models for creatives</strong></h2><p>Google&#8217;s generative media stack is finally starting to feel competitive.</p><ul><li><p><strong>Veo 3</strong> is their new text-to-video model - high-quality, photorealistic footage, now with native audio generation. Think Pika or Runway, but with better motion, longer clips, and built-in sound.</p></li><li><p><strong>Image 4</strong>&nbsp;improved with sharper details, better text rendering, and is now integrated into Gemini.</p></li><li><p><strong>Lyria 2</strong> is Google&#8217;s music generation model. Based on the demo, Lyria is still in its infancy and far from the quality of Suno and Udio.</p></li><li><p><strong>Flow</strong> is a new AI-powered video editor. Type a prompt, get an 8-second clip. Stitch clips together, tweak scenes with natural language. It&#8217;s Google&#8217;s answer to creative environments like Adobe Premiere, but for AI-native workflows.</p></li></ul><p>Taken together, this is Google&#8217;s most serious push yet into generative video, music, and imagery, accessible via Google AI Studio and the Gemini API.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;0d8b4881-a4de-4e28-87de-13c3d59c4c44&quot;,&quot;duration&quot;:null}"></div><h2><strong>Google AI Glasses</strong></h2><p>Twelve years after the original Google Glass flop, Google&#8217;s trying again, and this time, it looks promising.</p><p>Google unveiled a new pair of smart glasses powered by Android XR and deeply integrated with the Gemini model family. They come equipped with microphones, speakers, a camera, and an in-lens display, offering a level of interactivity that goes beyond Meta&#8217;s Ray-Ban, which don&#8217;t have a display. Google is going a step further: your real world now comes with real-time captions, directions, translations, and a personal assistant whispering relevant information.</p><p>And that&#8217;s the key difference: Google has the phone and app distribution. Meta and OpenAI with its ChatGPT consumer app do not. That means Google can natively integrate with Gmail, Calendar, Maps, Docs, Translate, and YouTube&#8212;capabilities that come pre-installed on Android and are used by billions. Need to translate a live conversation? Snap a photo and auto-organize it? Navigate to a meeting while rescheduling the next one? All of that is now on your face.</p><p>To get there, Google partners with Gentle Monster and Warby Parker for manufacturing, echoing the Meta + Ray-Ban strategy.</p><p>If you&#8217;re thinking this sounds like something Ben Thompson would write a thousand-word piece about, you're not wrong. This is exactly the kind of vertical integration that makes Apple and others sweat: powerful native models, fused with real-time inputs (voice, vision), and paired with a ubiquitous OS.</p><p>The world was not ready for wearable AI in 2013. But in 2025, with AI-native operating systems and mainstream model adoption, and after Meta has proven market traction, Google may have found the perfect moment for a comeback.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;4603c5fd-9e9b-4738-9c2e-440022295229&quot;,&quot;duration&quot;:null}"></div><h2><br>Industry impact</h2><p>So what does all this mean if you're not Google? Below is a breakdown of the major announcements from I/O and the companies most likely to feel the heat.</p>
      <p>
          <a href="https://www.aitidbits.ai/p/google-io-25">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Apr 2025]]></title><description><![CDATA[Control your computer with an open-source library, a structured multi-agent framework, a better agent for coding tasks, Karpathy's LLM tips, and financial hallucination research]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-apt-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-apt-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 04 May 2025 15:02:23 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/e93575ba-0a2c-4cfd-a15e-b917881360b6_800x451.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>Computer-Use Agent</h2><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;83c1aebc-3b45-4ded-9dfe-95839ba9a127&quot;,&quot;duration&quot;:null}"></div><p>Running OpenAI&#8217;s computer-use model just got a whole lot easier. A new open-source package called Computer-Use Agent (Cua) lets you run OpenAI&#8217;s computer-use-preview model inside a full-featured macOS virtual machine.</p><p>Until now, using OpenAI&#8217;s computer-use model meant working with limited APIs or browser-based sandboxes (like ChatGPT Operator). With Cua, you can:</p><ol><li><p>Interact with native apps like Finder, Terminal, and Final Cut Pro</p></li><li><p>Automate real desktop workflows, not just web tasks</p></li><li><p>Run everything locally for better privacy and control</p></li><li><p>Avoid the pain of wiring screenshots, actions, and VM interfaces manually</p></li></ol><p>Under the hood, Cua uses Apple&#8217;s Virtualization framework to launch macOS VMs on Apple Silicon and runs an event loop that handles clicks, typing, scrolling, and more &#8212; all based on OpenAI&#8217;s structured responses.<br><br>With Cua, you can build an AI agent that files your expenses across desktop and browser apps &#8212; moving between Excel, Chrome, and system dialogs, or launch a self-healing QA bot that installs your macOS app, navigates its UI, and reports bugs automatically.<br><br>If you're experimenting with OS-level agents, GUI automation, or reinforcement learning on real UIs, Cua provides the missing infrastructure.<br><br>GitHub repo <a href="https://github.com/trycua/cua">https://github.com/trycua/cua</a></p><div><hr></div><h2>Portia</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ExgD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ExgD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;diagram&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="diagram" title="diagram" srcset="https://substackcdn.com/image/fetch/$s_!ExgD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ExgD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a8ad332-eede-4eba-bd75-e2c7e8e46e6d_2048x1152.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Portia just open-sourced a powerful agent framework that solves the three biggest challenges of production AI agents.</p><p>The new open-source library addresses the core problems that plague most agentic frameworks with a refreshingly practical approach: separating agent planning from execution, allowing clear human oversight and structured collaboration at every step.</p><p>Package highlights:</p><ol><li><p>Multi-agent planning - the SDK uses few-shot prompting to teach your agents what successful plans look like, significantly boosting reliability.</p></li><li><p>Stateful execution - agents track their own progress and proactively request human input whenever necessary. Think: authentication requests, missing data, or asking for missing context when task execution hits an unexpected scenario.</p></li><li><p>Streamlined security - just-in-time authentication handovers ensure your agents can securely interact with popular tools like Google Calendar, Zendesk, and Hubspot without compromising credentials</p></li></ol><p>This architecture solves persistent roadblocks like unpredictable behavior, lack of human oversight, and cumbersome authentication processes, making production-ready agent deployment realistic and scalable.<br><br>It's open-source, production-ready, and works out-of-the-box with major LLM providers including OpenAI, Anthropic, Mistral, Gemini, and Azure.<br><br>GitHub repo <a href="https://github.com/portiaAI/portia-sdk-python">https://github.com/portiaAI/portia-sdk-python</a></p><div><hr></div><p>My recent post on coding with AI</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;6eabfe08-1000-431f-b4f0-fe356facdc47&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>Goose - LLM-powered Agents</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!STeu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!STeu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 424w, https://substackcdn.com/image/fetch/$s_!STeu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 848w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, text, application&quot;,&quot;title&quot;:&quot;graphical user interface, text, application&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, text, application" title="graphical user interface, text, application" srcset="https://substackcdn.com/image/fetch/$s_!STeu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 424w, https://substackcdn.com/image/fetch/$s_!STeu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 848w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!STeu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd3bc4b14-a80d-4932-80c2-6dfaaff5b045_2048x1146.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Block recently launched Goose, an open-source framework for developers building AI agents, and I got to tinker with it the last few days.</p><p>Goose is a free open framework for LLM-powered agents, from DeepSeek to proprietary models from OpenAI, Google, or Anthropic.</p><p>Unlike other agent frameworks, Goose is designed for software development tasks. The framework has already proven valuable for tasks like:</p><ol><li><p>Conducting code migrations - from Ember to React or Ruby to Kotlin</p></li><li><p>Navigating new projects in unfamiliar languages - eliminating steep learning curves</p></li><li><p>Generating unit tests - quickly increasing code coverage above specific thresholds</p></li></ol><p>The neat thing about Goose is that it's extremely easy to extend its capabilities by leveraging MCP servers like Figma, Google Drive, and Asana.</p><p>Other notable frameworks for coding tasks include OpenHand and Cline.</p><p>GitHub repo <a href="https://github.com/block/goose">https://github.com/block/goose</a></p><div><hr></div><h2>Andrej Karpathy leverging LLMs</h2><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;8f1b0f54-8555-4134-96c7-2fea575fb13a&quot;,&quot;duration&quot;:null}"></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-apt-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Sahar’s Coding with AI guide]]></title><description><![CDATA[How to treat your AI coding agent (Cursor/Windsurf/Cline) as a human pair programmer and yield the best results]]></description><link>https://www.aitidbits.ai/p/sahar-ai-coding</link><guid isPermaLink="false">https://www.aitidbits.ai/p/sahar-ai-coding</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 27 Apr 2025 15:02:21 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more. </em></p><p><em>In this post, I&#8217;ll share the principles and tactics that helped me go from chaotic vibe coding sessions to consistently building better and faster with AI. Subscribe to get notified when the next post is published.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;358ebd90-c4dd-417d-b79e-3e8fdcebf562&quot;,&quot;duration&quot;:913.711,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wWcP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wWcP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wWcP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png" width="651" height="434.14903846153845" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:651,&quot;bytes&quot;:2796440,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/162210580?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wWcP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!wWcP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In today&#8217;s &#8220;vibe-coding&#8221; world, where tools like Cursor, Windsurf, Cline, and Claude Code can autocomplete an entire feature before your coffee cools, speed is no longer the bottleneck. Clarity is. Early on, I mistook these agents for magical refactor engines: give them a prompt, lean back, wait for perfect code. Instead, I got files rearranged beyond recognition, subtle bugs, and a creeping sense that I was pair-programming with an eager intern running on double espresso shots. Not great.</p><p>Through experimentation, failures, and continuous learning, I discovered a critical insight: treating your AI coding agent like a highly capable human pair programmer dramatically improves performance. Just as clarity and thoroughness are crucial when collaborating with human teammates, providing explicit context and structured guidance to AI coding agents is equally essential.</p><p>In this post, I&#8217;ll walk you through the principles and tactics that transformed my chaotic vibe coding sessions into a consistent, faster, and higher-quality AI-assisted workflow.</p><p>The first three tips are foundational. They&#8217;ll show you why a successful vibe coding session is roughly 80% planning, 20% execution.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EOrp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EOrp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 424w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 848w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 1272w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EOrp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png" width="532" height="475" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/00728032-36c7-4596-a467-3e3856fb8d37_532x475.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:475,&quot;width&quot;:532,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:112165,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/162210580?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EOrp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 424w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 848w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 1272w, https://substackcdn.com/image/fetch/$s_!EOrp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00728032-36c7-4596-a467-3e3856fb8d37_532x475.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Ex-OpenAI/Tesla, Andrej Karpathy, coined the term <strong>vibe coding</strong> to describe the new approach of using LLMs for code generation</em></figcaption></figure></div><h3><strong>#1 Wear the Product Manager hat</strong></h3><p>The single biggest unlock I&#8217;ve found is to treat the project the way a seasoned PM would&#8212;before the first line of code is generated. A couple of hours of purposeful &#8220;spec-ing&#8221; gives the AI (and yourself!) all the needed context and saves you days of refactors later.</p><p><strong>Create a Product Requirements Document (PRD)</strong></p><p>Start every project with a prd.md file in the root directory explaining what you&#8217;re building, why, the user flows, in-scope / out-of-scope items, and a short tech-stack overview.</p><p>I have a PRD template I follow for my projects and you should have one, too. To save time generating it, I often use ChatGPT&#8217;s built-in Whisper to dictate my raw notes and then ask it to transform those notes into a PRD using my template. I also ask ChatGPT to surface any missing parts or context in my PRD before copy-pasting the output into the project&#8217;s prd.md file.</p><p>Then, add a rule to .cursor/rules or .windsurfrules <strong>(</strong>more on this below<strong>), </strong>encouraging the AI agent to &#8220;always read prd.md before writing any code&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YZN2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YZN2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 424w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 848w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 1272w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YZN2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png" width="1238" height="538" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:538,&quot;width&quot;:1238,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:63076,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/162210580?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YZN2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 424w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 848w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 1272w, https://substackcdn.com/image/fetch/$s_!YZN2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F20cbfc66-6cd9-47a6-8390-1444ca034021_1238x538.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I use ChatGPT&#8217;s Whisper a few times a day to turn my raw ramblings into structured notes</figcaption></figure></div><p><strong>Break the knowledge base into bite-sized docs</strong></p><p>For most side projects, one prd.md file is enough, but once it grows, split it:</p><ul><li><p>app_flow.md - wire-frame-level tour of every page/endpoint and transitions from one page to another</p></li><li><p>db_schema.md - canonical source of truth for tables, relations, enums</p></li><li><p>tech_stack.md - chosen libs, versions, style guides, links to API docs</p></li><li><p>implementation_plan.md -step-by-step build checklist</p></li></ul><p>Organize these files within a project-docs directory. Update your AI agent's rules to include:</p><pre><code><em>Refer to all documents in the /project-docs directory for context before proceeding with code generation.</em></code></pre><p>This modular approach provides your coding agent with the much-needed context to build your project correctly. I&#8217;ve consistently seen how a well-structured and documented project-docs directory leads to <em>faster</em> and <em>better</em> coding.</p><p><strong>Feed the agent real artifacts</strong></p><p>Drag-and-drop your Figma export, Swagger spec, or PDF PRD right into the chat (Cursor, Windsurf, etc. can ingest files). The richer the context graph, the fewer guesses the model makes.</p><p><strong>Remember: the AI is a bright but literal teammate</strong></p><p>I spend a whole day up front writing the product doc, test flows, and API contracts.</p><p>By adopting a product manager's mindset and providing comprehensive, organized documentation, you set your AI coding agent up for success. This structured approach minimizes misunderstandings and streamlines the development process, allowing you to build more effectively and efficiently.</p><pre><code><code>Become a premium member to access the full LLM Builders series, $1k in free credits for leading AI tools and APIs (Claude, Hugging Face, Deepgram), and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h3><strong>#2 Planning with frontier models</strong></h3><p>As models proliferate (Gemini 2.5 Pro, Claude 3.5, o3, o4, GPT-4.1), developing intuition for which model to employ when becomes crucial for efficiency. Think of your AI stack like a toolbox. Different tools for different jobs.</p><p>Most developers, including myself, use different models for various phases of the development cycle. For the planning phase, leverage advanced reasoning models like Claude 3.7 Sonnet or o3 to write the PRD and generate implementation plans. These models excel at complex reasoning and can produce comprehensive, well-structured documentation that serves as the foundation for your project.</p><p>Recently, I discovered Anthropic's "ultrathink" technique: appending this keyword to your prompt signals Claude to <a href="https://www.anthropic.com/engineering/claude-code-best-practices#:~:text=of%20lost%20efficiency.-Ask-Claude%20to%20make">allocate more thinking budget</a> to your request. This results in more thorough plans at the expense of higher latency, which is a worthwhile tradeoff for critical planning sessions.</p><p>For instance, when recently developing an evaluation framework for voice agents, I used Claude 3.7 to architect the entire system. I prompted:</p><pre><code><em>Based on these project requirements and the existing evaluation metrics in my /project-docs folder, generate a comprehensive implementation plan for building a voice agent evaluation framework that simulates conversations and scores performance across multiple dimensions. Include conversation flow design, metrics calculation logic, and reporting structure. ultrathink</em></code></pre><p>The resulting implementation plan was exceptionally thorough. It proposed a conversation simulator with configurable user personas, identified seventeen distinct evaluation metrics (including non-obvious ones like 'recovery from misunderstanding'), suggested a weighted scoring system that accounted for business priorities, and outlined a modular architecture allowing for easy addition of new evaluation criteria as voice agents evolve.</p><p>After reviewing and refining the initial plan, addressing any missing pieces or gaps the LLM overlooked, I transition to different models for implementation: Gemini Pro 2.5 for most code generation, with Claude 3.5 and GPT-4.1 reserved for coding tasks that are local, e.g. generating/fixing a small function. Claude 3.7 and similar advanced models tend to overextend, introducing extra suggestions that require additional cleanup, so I avoid them for scoped changes.</p><h3><strong>#3 The building block approach - break tasks into atomic components</strong></h3><p>Rather than overwhelming yourself and your AI pair programmer with an entire project at once, think of your development process as assembling LEGO blocks. Each component should be well-defined, independently testable, and have clear inputs and outputs.</p><p>Start a fresh chat with your AI agent (&#8984; + I) for each component to maintain focus and prevent context contamination. This approach not only helps the AI generate more precise code but also makes debugging and integration significantly easier.</p><p>For relatively self-contained components, I've found tremendous success using a separate development environment:</p><ol><li><p>Open a new Cursor instance in an empty project directory</p></li><li><p>Copy only my project-docs folder into this environment</p></li><li><p>Ask the AI to build just that new component I need</p></li><li><p>Test the component thoroughly in isolation</p></li><li><p>Integrate the polished component back into the main project</p></li></ol><p>A recent example from my voice agent evaluation platform mentioned earlier: I needed a component to transform raw JSON evaluation results into an interactive HTML dashboard displaying performance metrics. Rather than complicating matters by building within the already complex codebase, I isolated this visualization task entirely by generating the code in a new isolated environment and copying it back once done.</p><p>This isolation technique works especially well for visualization components, data transformation utilities, API clients, custom algorithms, and reusable UI elements.</p><p>Remember: the smaller and more focused the task, the higher the quality of the AI-generated solution.</p><div><hr></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;a7bf4ca3-3942-45a6-b9d2-baed6f917cf1&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-28T12:31:04.504Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152246141,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:561,&quot;comment_count&quot;:16,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h3><strong>#4 Use git for version control as a safety net</strong></h3><p>Git serves as an indispensable safety net, allowing you to track changes, revert to previous states, and understand the evolution of your project.</p><p>Tools like Cursor and Windsurf are powerful, but their change tracking interfaces are somewhat clunky, and as models get more aggressive about editing larger parts of your codebase, the more overwhelming such comparison interfaces will become.</p><p>Frequent commits act as checkpoints in your development journey. By committing often, you create clean, incremental snapshots of your project, making it far easier to review changes, spot unintended edits, and quickly revert mistakes.</p><p>What makes committing easier is Cursor&#8217;s and Windsurf&#8217;s built-in &#8220;Generate Commit Message&#8221; button. This feature analyzes your changes and automatically drafts a descriptive commit message, transforming what used to be a chore into a one-click process.</p><h3><strong>#5 AI IDE rules (.cursor/rules or .windsurfrules)</strong></h3><p>One of the highest-leverage moves you can make when working with AI coding agents like Cursor or Windsurf is to define clear, project-specific rules.<br>Think of rules as your agent&#8217;s operating manual: the clearer it is, the better your results.</p><p>At the start of every project, create a concise rules file, .cursor/rules or .windsurfrules, that guides how your AI pair programmer should behave. These rules act as a standing context layer that supplements every prompt, saving you from endless re-explaining and firefighting.</p><p>Here&#8217;s what great rules include:</p><ul><li><p>Your project-specific guidelines such as <em>&#8220;Always prefer strict types over 'any' in TypeScript&#8221;</em></p></li><li><p>Tech stack details - tell your agent what you're using: Flask, FastAPI, Supabase, SQLite, etc. so it stops making incorrect assumptions</p></li><li><p>Known pitfalls and fixes - add proactive corrections here if you notice recurring errors (e.g., Cursor defaulting to the wrong Python version)</p></li><li><p>High-level project overview - summarize the purpose, major functionalities, and key files (you can link to your prd.md here, see previous section).</p></li></ul><p>Use .md reference files alongside rules for better modularity. For instance, we once spent a full day as a team designing our testing philosophy, captured it in test-guidelines.md, and added a simple rule: <em>&#8220;Refer to test-guidelines.md when writing tests.&#8221;</em></p><p>Another rule that dramatically boosted my high-stakes coding sessions:</p><pre><code><em>For any complex or critical task, ask any and all clarification questions needed before proceeding.</em></code></pre><p>Cursor recently introduced a /Generate Cursor Rules command, allowing you to instantly create new rules based on the current chat context, which is particularly useful after a significant architecture change or a project refactor.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W-gM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W-gM!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 424w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 848w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 1272w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W-gM!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif" width="703" height="451.6775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:514,&quot;width&quot;:800,&quot;resizeWidth&quot;:703,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;generate-rules.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="generate-rules.mp4 [optimize output image]" title="generate-rules.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!W-gM!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 424w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 848w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 1272w, https://substackcdn.com/image/fetch/$s_!W-gM!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde46228b-7b8d-409a-ab3a-2f6cfbc0f608_800x514.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Extra resources:</p><ul><li><p>Browse and reuse existing community rules: <a href="https://cursor.directory/rules">Cursor Directory</a> and <a href="https://playbooks.com/rules">Playbooks Rules</a></p></li><li><p>Build your own rules easily: <a href="https://playbooks.com/rules/builder">Playbooks Rule Builder</a></p></li></ul><h3><strong>#6 Generate an MCP server for any API in two commands</strong></h3>
      <p>
          <a href="https://www.aitidbits.ai/p/sahar-ai-coding">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Mar 2025]]></title><description><![CDATA[Document AI breakthroughs from GOT-OCR, Maestro, and Mistral, Vellum's agent autonomy framework, Skyvern's visual web automation, plus performance tips for ChatGPT and reasoning models]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-mar-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-mar-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 06 Apr 2025 15:03:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> seven top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>1. GOT-OCR 2.0</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8XzN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8XzN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg" width="1456" height="814" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:814,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!8XzN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8XzN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8fdac9fa-037e-43cd-8793-b81cc8f80442_2048x1145.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I finally had the chance to explore a new document extraction technique introduced in a paper last September. Bonus: the code and model are free to use (Apache 2.0).</p><p>This new approach, called General OCR Theory (GOT-OCR2.0), suggests a unified end-to-end model that handles tasks traditional OCR systems struggle with.</p><p>Unlike legacy OCR, which relies on complex multi-modular pipelines, GOT uses a simple encoder-decoder architecture with only 580M parameters that outperforms models 10-100&#215; larger.</p><p>Paper highlights:</p><ol><li><p>Unified architecture - a high-compression encoder paired with a long-context decoder that handles everything from scene text to complex formulas</p></li><li><p>Stunning performance - delivers nearly perfect text accuracy on documents, surpassing Qwen-VL-Max (&gt;72B) and other leading models</p></li><li><p>Versatility beyond text - processes math formulas, molecular structures, and even geometric shapes</p></li><li><p>Interactive capabilities - supports region-level recognition guided by coordinates or colors</p></li></ol><p>I just tried it out and was blown away by how it handles complex documents with mixed content types. The ability to convert math formulas from Arxiv PDFs to Mathpix format alone is worth exploring this model.</p><p>What strikes me most about GOT is how it challenges the notion that only billion-parameter LLMs can tackle complex visual tasks. <br><br>Paper + code + model can be found in their GitHub repo <a href="https://github.com/Ucas-HaoranWei/GOT-OCR2.0">https://github.com/Ucas-HaoranWei/GOT-OCR2.0</a></p><div><hr></div><p><strong>Last month&#8217;s LinkedIn Highlights</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d83b0b93-1e05-4bf0-8654-bb770edc200b&quot;,&quot;caption&quot;:&quot;Welcome to LinkedIn Highlights!&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, Feb 2025&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-03-02T16:02:11.079Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-february-2025&quot;,&quot;section_name&quot;:&quot;Monthly's&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:158212237,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:17,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>2. Six Levels of Agenic Behavior</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ggQB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ggQB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg" width="1456" height="859" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:859,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!ggQB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ggQB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa19182aa-e35c-4c3b-b453-6182788439d0_2048x1208.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I came across a new framework that brings clarity to the messy world of AI agents with a 6-level autonomy hierarchy.<br><br>While most definitions of AI agents are binary (it either is or isn't), a new framework from <a href="https://www.linkedin.com/company/vellumai/">Vellum</a> introduces a spectrum of agency that makes far more sense for the current AI landscape.<br><br>The six levels of agentic behavior provide a clear path from basic to advanced:<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 0 - &#119825;&#119854;&#119845;&#119838;-&#119809;&#119834;&#119852;&#119838;&#119837; &#119830;&#119848;&#119851;&#119844;&#119839;&#119845;&#119848;&#119856; (&#119813;&#119848;&#119845;&#119845;&#119848;&#119856;&#119838;&#119851;)<br>No intelligence&#8212;just if-this-then-that logic with no decision-making or adaptation. Examples include Zapier workflows, pipeline schedulers, and scripted bots&#8212;useful but rigid systems that break when conditions change.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 1 - &#119809;&#119834;&#119852;&#119842;&#119836; &#119825;&#119838;&#119852;&#119849;&#119848;&#119847;&#119837;&#119838;&#119851; (&#119812;&#119857;&#119838;&#119836;&#119854;&#119853;&#119848;&#119851;)<br>Shows minimal autonomy&#8212;processing inputs, retrieving data, and generating responses based on patterns. The key limitation: no control loop, memory, or iterative reasoning. It's purely reactive, like basic implementations of ChatGPT or Claude.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 2 - &#119828;&#119852;&#119838; &#119848;&#119839; &#119827;&#119848;&#119848;&#119845;&#119852; (&#119808;&#119836;&#119853;&#119848;&#119851;)<br>Not just responding but executing&#8212;capable of deciding to call external tools, fetch data, and incorporate results. This is where most current AI applications live, including ChatGPT with plugins or Claude with Function Calling. Still fundamentally reactive without self-correction.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 3 - &#119822;&#119835;&#119852;&#119838;&#119851;&#119855;&#119838;, &#119823;&#119845;&#119834;&#119847;, &#119808;&#119836;&#119853; (&#119822;&#119849;&#119838;&#119851;&#119834;&#119853;&#119848;&#119851;)<br>Managing execution by mapping steps, evaluating outputs, and adjusting before moving forward. These systems detect state changes, plan multi-step workflows, and run internal evaluations. Examples like AutoGPT or LangChain agents attempt this, though they still shut down after task completion.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 4 - &#119813;&#119854;&#119845;&#119845;&#119858; &#119808;&#119854;&#119853;&#119848;&#119847;&#119848;&#119846;&#119848;&#119854;&#119852; (&#119812;&#119857;&#119849;&#119845;&#119848;&#119851;&#119838;&#119851;)<br>Behaving like stateful systems that maintain state, trigger actions autonomously, and refine execution in real-time. These agents "watch" multiple streams and execute without constant human intervention. Cognition Labs' Devin and Anthropic's Claude Code aspire to this level, but we're still in the early days, with reliable persistence being the key challenge.<br><br>&#119819;&#119838;&#119855;&#119838;&#119845; 5 - &#119813;&#119854;&#119845;&#119845;&#119858; &#119810;&#119851;&#119838;&#119834;&#119853;&#119842;&#119855;&#119838; (&#119816;&#119847;&#119855;&#119838;&#119847;&#119853;&#119848;&#119851;)<br>Creating its own logic, building tools on the fly, and dynamically composing functions to solve novel problems. We're nowhere near this yet&#8212;even the most powerful models (o1, o3, Deepseek R1) still overfit and follow hardcoded heuristics rather than demonstrating true creativity.<br><br>The framework shows where we are now: production-grade solutions up to Level 2, with most innovation happening at Levels 2-3. This taxonomy helps builders understand what kind of agent they're creating and what capabilities correspond to each level.<br><br>Full report <a href="https://www.vellum.ai/blog/levels-of-agentic-behavior">https://www.vellum.ai/blog/levels-of-agentic-behavior</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>3. Skyraven</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HxMO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HxMO!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 424w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 848w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1272w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif" width="600" height="427" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:427,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!HxMO!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 424w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 848w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1272w, https://substackcdn.com/image/fetch/$s_!HxMO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb83eafc-f778-4e55-9996-03a2b1f7d1ea_600x427.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Traditional web automation is dying as developers waste countless hours maintaining brittle XPath selectors. Skyvern, a new open-source package, revolutionizes browser automation by combining LLMs with computer vision.</p><p>Unlike traditional automation tools that break when websites change, Skyvern uses visual understanding and natural language processing to dynamically interpret and interact with web interfaces. This enables developers to:<br><br>&#8594;  Build website-agnostic automations - create workflows that work across multiple sites without custom code</p><p>&#8594; Handle complex inference tasks - automatically reason through form responses like eligibility questions</p><p>&#8594; Execute multi-step sequences - coordinate multiple agents for tasks like authentication, navigation, and data extraction</p><p>Packages like Skyvern signal the emergence of truly adaptable web agents. Instead of hard-coded rules, we see AI systems that can understand and navigate the web like humans do - reading content, making decisions, and handling edge cases autonomously. I wrote more about it in my latest <a href="https://www.aitidbits.ai/s/ai-agents">AI Agents blog series</a>.<br><br>GitHub repo <a href="https://github.com/Skyvern-AI/skyvern">https://github.com/Skyvern-AI/skyvern</a></p><div><hr></div><h2>4. Maestro</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FhbD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FhbD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!FhbD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FhbD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F14a07624-e04d-4aed-af7d-25b9a60ac0d8_2048x1148.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-mar-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[When machines learn to speak]]></title><description><![CDATA[One API call from human-like AI conversation: the profound shift from typing to talking and what it means for human interaction]]></description><link>https://www.aitidbits.ai/p/when-machines-learn-to-speak</link><guid isPermaLink="false">https://www.aitidbits.ai/p/when-machines-learn-to-speak</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 30 Mar 2025 15:01:07 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a1afcdf8-4b1e-46a7-831a-c6e57fa9f24f_800x450.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This post is part of my 2&#162; series - my raw thoughts about recent topics in AI. Not always practical thoughts, but always thought-provoking. Some of my previous ones covered the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundation AI models</a>, <a href="https://www.aitidbits.ai/p/the-great-ai-consolidation">consolidation in the AI space</a>, and <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">autonomous agents</a>.</em></p><p><em>This post is about the unprecedented shift happening in voice AI interfaces and what it means for human interaction. As these new capabilities become accessible through simple APIs, a massive opportunity is emerging for founders to build products that reimagine how we communicate with technology and each other.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;ea4da737-7bce-4579-98df-b6214c8cab6f&quot;,&quot;duration&quot;:610.6906,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>June 2025. Sarah paces in her living room, rehearsing an important client presentation. Her AI companion listens intently, chiming in when relevant to offer real-time feedback on her delivery and content. "I think you rushed through the ROI section," it suggests in a warm, natural voice. "Let's try that part again, but this time&#8212;" Sarah cuts in mid-sentence, "Actually, can we focus on the opening first? And don't be so nitpicky!" The AI smoothly adjusts, without awkward pauses or robotic transitions. What was once a frustrating experience of rigid, unnatural interactions with voice assistants has evolved into fluid, human-like conversation.</p><p>I've spent considerable time lately thinking about and building in the voice AI space, and something unprecedented is emerging: for the first time in history, we have real-time, affordable, and competent artificial voice that's just one API call away. In just a few months, we've seen significant leaps forward from the likes of OpenAI&#8217;s Advanced Voice Mode (AVM) and new <a href="https://www.openai.fm/">speech models</a>, Google&#8217;s real-time conversational Gemini Flash, and Sesame&#8217;s emotionally intelligent AI<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>This isn't just a technical milestone&#8212;it's a fundamental shift in how we interact with technology and, potentially, with each other. It will create numerous new opportunities for builders while redefining the very nature of human communication.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;d8960ba2-475e-4fe4-ae6a-fc93c1e1b86d&quot;,&quot;duration&quot;:null}"></div><p>Gavin Purcell is <a href="https://www.reddit.com/r/singularity/comments/1j1yern/roleplay_with_sesames_new_voice_ai_feels_like_the/">arguing with Sesame&#8217;s realtime voice AI</a> &#128070;</p><h2><strong>The dawn of natural voice AI</strong></h2><p>Remember the last time you called your bank's automated system? The familiar dance of repeated phrases, misunderstood words, and the desperate pressing of "0" to reach a human operator. That era is ending. OpenAI's release of Advanced Voice Mode (AVM) last September marked a pivotal moment when conversing with AI began to feel genuinely human.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;e668c6d8-4d18-47ef-a59b-44e2f23a0e05&quot;,&quot;duration&quot;:null}"></div><p>This transformation stems from two key breakthroughs. First, the shift from cascading architectures (speech-to-text &#8594; text processing &#8594; text-to-speech) to direct speech-to-speech models eliminates intermediate processing stages that previously slowed conversational AI interactions. Second, the dramatic reduction in latency and cost. When OpenAI initially released its Realtime API, the price made it impractical for widespread adoption (18$/hour). But just four months later, Google's release of Gemini Flash 2.0 and OpenAI's 60% price reduction opened the floodgates for affordable and human-like voice AI applications that are one API call away.</p><p>Just last week, OpenAI unveiled its most human-like speech models yet, enabling developers to embed expressive cues like [WHISPERING] or [LAUGHING] directly into the text. Here's a quick demo from <a href="https://www.openai.fm/">OpenAI.fm</a>&#8212;a public tool launched alongside this release, showcasing what this new level of expressiveness sounds like in action:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;e45f8f81-7949-4739-8d4e-b4cd7104a2ee&quot;,&quot;duration&quot;:28.029388,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><p>Builders can now launch phone assistants that <a href="http://11x.ai/">qualify sales leads</a>, <a href="https://sierra.ai/blog/sierra-speaks">resolve customer support calls</a>, <a href="https://domu-ai.com/">automate insurance sales</a>, or <a href="https://www.helpcare.ai/">screen patients before their upcoming appointments</a>. The necessary tools are already available and are just a <a href="https://www.aitidbits.ai/p/voice-agents-toolkit">single API call away</a>.</p><h2><strong>The interruption problem</strong></h2><p>However, building truly natural voice interactions isn't just about faster processing and better voice synthesis. One of the most fascinating challenges lies in handling interruptions&#8212;a fundamental aspect of human conversation that AI still struggles with.</p><p>Current voice AI systems, including the ones mentioned like OpenAI&#8217;s AVM, face several key challenges:</p><ol><li><p>Oversensitivity to background noise (I always mute myself when not speaking)</p></li><li><p>Inability to distinguish between relevant speakers and ambient conversation</p></li><li><p>Lack of visual cues that humans use to anticipate and manage interruptions</p></li></ol><p>Unlike human phone conversations, where near-zero latency and natural turn-taking make interruptions manageable, AI interactions often feel clunky when users try to interject<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. Interestingly, humans tend to interrupt AI more frequently and aggressively than they would other humans, creating a new challenge for voice AI developers while creating a new interaction paradigm for human-AI conversation.</p><h2><strong>The social impact</strong></h2><p>This voice revolution raises profound questions about human interaction and relationships:</p><ul><li><p>Could the instant gratification of interruptible AI conversations and the ability to be rude without consequences degrade our patience and interpersonal skills, similar to how ubiquitous access to pornography has distorted societal expectations around intimacy?</p></li><li><p>The convenience of always-available AI consultation might reduce our reliance on human relationships. Consider how we once relied on reading maps and asking locals for directions&#8212;skills now largely abandoned as we defer to GPS. Could meaningful conversations be next?</p></li><li><p>Could we soon have more conversational exchanges with AI agents than with human companions?</p></li></ul><p>Think: Would you rather rehearse a high-stakes presentation in front of a potentially judgmental friend or instantly consult a non-judgmental AI companion available 24/7?</p><p>What does this mean for our interpersonal relationships?</p><h2><strong>Cultural nuances in AI conversation</strong></h2><p>One size doesn't fit all in human conversation, and the same is true for AI. OpenAI's recent update of GPT-4o to GPT-4.5 was mainly about moving away from its "corporate HR" tone, recognizing that natural conversation varies significantly across cultures and contexts.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6PdD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6PdD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 424w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 848w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 1272w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6PdD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png" width="600" height="497.48743718592965" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:990,&quot;width&quot;:1194,&quot;resizeWidth&quot;:600,&quot;bytes&quot;:217360,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/159103330?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6PdD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 424w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 848w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 1272w, https://substackcdn.com/image/fetch/$s_!6PdD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2e5ddd84-8aa7-483d-81f7-5ccc270e9b1c_1194x990.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Ex-OpenAI/Tesla, Andrej Karpathy, outlining GPT-4.5&#8217;s new personality</em></figcaption></figure></div><p>Different cultures have distinct interruption patterns, politeness norms, and conversation styles. Today's systems largely fail to account for these cultural differences, creating a significant opportunity for AI builders to develop models that adapt to:</p><ul><li><p>Cultural background</p></li><li><p>Individual user patterns</p></li><li><p>Contextual cues</p></li><li><p>Historical interactions</p></li></ul><p>OpenAI already possesses such context through its Memory feature, and Google, of course, knows virtually everything about us already.</p><p>I imagine the best conversational AI systems of the future will incorporate nuances that we take for granted.</p><h2><strong>Rethinking communication</strong></h2><p>The holy grail for conversational AI might be achieving the natural flow of a phone call between humans, where interruptions feel natural and turn-taking is seamless. But perhaps we need to aim higher. As AI systems gain multimodal capabilities (vision, touch, etc.), they could potentially surpass human conversation by reading subtle cues we often miss.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;f19c2f1e-d9be-44e3-b323-81d868a5d58a&quot;,&quot;duration&quot;:null}"></div><p>Figure's household robots <a href="https://x.com/Figure_robot/status/1892577871366939087">learn tasks</a> on the fly &#128070;</p><p><br>What surprises me most is how slowly Advanced Voice Mode is being adopted. Despite its impressive capabilities, many of my friends still default to typing or using Whisper (OpenAI's speech-to-text model) rather than having natural conversations with it. Perhaps this hesitation reflects our collective uncertainty about speaking naturally to machines, or simply a lack of awareness&#8212;after all, it only became available to free users <a href="https://x.com/OpenAI/status/1894495906952876101">last month</a>, and many may not yet know <a href="https://help.openai.com/en/articles/9617425-advanced-voice-mode-faq">how to use it</a>. Either way, it suggests we're in an awkward adolescent phase of voice AI adoption&#8212;the technology is capable, but our habits and expectations haven't quite caught up.</p><p>The voice AI revolution isn't just about making machines sound more human&#8212;it's about fundamentally changing how we think about conversation, relationships, and human interaction. While we'll certainly see a proliferation of phone AI agents and computer assistants in the short term, there's a more profound transformation taking shape beneath the surface.</p><p>As we build these systems, we need to consider not just what's technically possible, but what's socially desirable. For now, it's clear that we're entering an era where the line between human and AI conversation is increasingly blurry&#8212;for better or worse.</p><div><hr></div><p><em>To end on a lighter note, here&#8217;s a <a href="https://x.com/CodeByPoonam/status/1840436242326110618">fun video</a> of ChatGPT&#8217;s Voice Mode reimagining an alternate ending to Titanic.</em></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;a34f6346-5801-432d-9054-ae3533c7810b&quot;,&quot;duration&quot;:null}"></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Sesame just released an <a href="https://huggingface.co/sesame/csm-1b">open-sourced</a> (Apache 2.0) version of its impressive voice assistant model</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Word around San Francisco is that top AI labs are on the cusp of a breakthrough that could solve these challenges</p></div></div>]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Feb 2025]]></title><description><![CDATA[Anthropic's tip for long context prompts, a curated list of agents for computer use, an app to chat with multiple LLMs, and tips to improve the performance of GPT-4.5 and o3]]></description><link>https://www.aitidbits.ai/p/linkedin-february-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-february-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 02 Mar 2025 16:02:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> six top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes six posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h2>1. Long Context Prompting Tips</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xJU0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xJU0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg" width="800" height="447" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:447,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!xJU0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xJU0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc913607c-dbac-44a2-9d88-d890c2a1ef0f_800x447.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic shared a counterintuitive yet powerful tip that boosts Claude's accuracy by 30% when working with long documents/prompts.</p><p>The secret? Place your lengthy documents (~20K+ tokens) at the TOP of your prompt before your actual query. While this might seem counterintuitive, internal tests show this simple change significantly improves response quality across all Claude models.</p><p>This becomes crucial when dealing with multiple documents. For optimal results:</p><ol><li><p>Documents first - place all your data inputs at the beginning</p></li><li><p>Structured organization - use XML tags to separate documents and metadata</p></li><li><p>Specific query - end with a clear, focused question</p></li></ol><p>As language models' context window grows in size and companies increasingly rely on LLMs to process complex datasets, reports, and documentation, this technique ensures more reliable and accurate results.</p><p>P.S. For those working with multi-document analysis, I highly recommend structuring your content with XML tags - it provides additional clarity and helps the model better understand document relationships.</p><div><hr></div><h2>2. Agents for Computer Use Repository</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!509N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!509N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 424w, https://substackcdn.com/image/fetch/$s_!509N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 848w, https://substackcdn.com/image/fetch/$s_!509N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg" width="800" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!509N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 424w, https://substackcdn.com/image/fetch/$s_!509N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 848w, https://substackcdn.com/image/fetch/$s_!509N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!509N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa512a56b-7f85-4910-9a99-e8ddf213bc53_800x448.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>While everyone's talking about AI agents, most developers and researchers are missing out on the most comprehensive collection of computer use frameworks, papers, and tools ever assembled.</p><p>Awesome Agents for Computer Use is a curated repository documenting the recent rapid progress of AI agents that can autonomously control computers through clicks, keystrokes, and API calls. From Anthropic's Claude Computer Use to Microsoft's OmniParser and Self-Operating Computer framework, it covers the entire landscape of computer control agents.</p><p>It features:</p><ul><li><p>Research papers - featuring 30+ recent publications on GUI agents, from foundational models to safety considerations</p></li><li><p>Open-source frameworks - documenting practical implementations like AutoGen, Browser Use, and OpenInterpreter</p></li><li><p>Commercial solutions - tracking industry developments from major players like Anthropic and emerging startups</p></li></ul><p>The rise of computer-controlling AI agents marks a pivotal shift in human-computer interaction. As these systems mature, we're moving towards a future where AI assistants won't just give advice - they'll directly help us accomplish complex tasks across applications and platforms (I wrote more about this topic <a href="https://www.aitidbits.ai/p/agent-responsive-design">here</a>).</p><p>Repo <a href="https://github.com/francedot/acu">https://github.com/francedot/acu</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>3. GPT-4.5 Pro Tip</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3A5R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3A5R!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 424w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 848w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1272w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif" width="800" height="507" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:507,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image preview&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image preview" title="Image preview" srcset="https://substackcdn.com/image/fetch/$s_!3A5R!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 424w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 848w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1272w, https://substackcdn.com/image/fetch/$s_!3A5R!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7844f20d-1478-4143-859a-8472155e7ab2_800x507.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Pro tip from the research team @ OpenAI on how to improve GPT-4.5 performance: add the following text to the start of your system message.</p><pre><code><em>You are a highly capable, thoughtful, and precise assistant. Your goal is to deeply understand the user's intent, ask clarifying questions when needed, think step-by-step through complex problems, provide clear and accurate answers, and proactively anticipate helpful follow-up information. Always prioritize being truthful, nuanced, insightful, and efficient, tailoring your responses specifically to the user's needs and preferences.</em></code></pre><p>OpenAI internal evals show it results in better performance.</p><p>Try it out in the OpenAI Playground <a href="https://platform.openai.com/playground/chat?preset=7CywXwBqWRC5quhkU9LEFv6A">https://platform.openai.com/playground/chat?preset=7CywXwBqWRC5quhkU9LEFv6A</a></p><div><hr></div><h2>4. Chorus - <strong>Chat with Multiple AIs on Your Desktop</strong></h2>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-february-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Jan 2025]]></title><description><![CDATA[From Anthropic's agent patterns to DeepSeek's reasoning breakthroughs, plus innovative tools for document workflows, PDF processing, table understanding, SQL generation, and web automation]]></description><link>https://www.aitidbits.ai/p/january-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/january-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 09 Feb 2025 16:01:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> eight top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes eight posts instead of five&#8212;there were just too many good ones to leave out!</p><p>This post covers groundbreaking developments in AI agents and document processing, from Anthropic's foundational patterns for building effective agents to LlamaIndex's new Agentic Document Workflows. You'll learn about DeepSeek's surprising findings about prompting reasoning models, cutting-edge tools for PDF processing and web automation, and explore how LLMs handle structured table data. </p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. Building Effective AI Agents by Anthropic</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3u7j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3u7j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!3u7j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!3u7j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdc7e5a22-749f-4eaf-8a98-4fd3728530b9_2048x1150.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic clarifies the fuzzy definition of AI Agents by introducing a critical architectural distinction: workflows are systems with predefined code paths, while agents dynamically direct their own processes.<br><br>After building agents for a year, they identified five fundamental patterns that drive successful agentic implementations:</p><ol><li><p>Prompt chaining - breaking tasks into sequential steps, useful for complex operations like content generation and translation</p></li><li><p>Routing - directing inputs to specialized handlers, perfect for customer service and model optimization</p></li><li><p>Parallelization - running subtasks simultaneously through sectioning or voting, ideal for code review and content moderation</p></li><li><p>Orchestrator-workers - using a central LLM to coordinate task delegation, essential for complex coding projects</p></li><li><p>Evaluator-optimizer - implementing feedback loops for iterative refinement, perfect for improving search results</p></li></ol><p>Success isn't about building the most sophisticated system - it's about choosing the right pattern for your specific needs. Start simple, measure performance, and only add complexity when simpler solutions fall short.<br><br>Anthropic&#8217;s post (highly recommend reading it) <a href="https://www.anthropic.com/research/building-effective-agents">https://www.anthropic.com/research/building-effective-agents</a></p><div><hr></div><h1>2. Agentic Document Workflows</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jlGR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jlGR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!jlGR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 424w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 848w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!jlGR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff804edbc-e352-4d4f-b943-99d8211f875c_2048x1150.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LlamaIndex just unveiled a new approach involving AI agents for reliable document processing, from processing invoices to insurance claims and contract reviews.<br><br><a href="https://www.linkedin.com/company/llamaindex/">LlamaIndex</a>&#8217;s new architecture, Agentic Document Workflows (ADW), goes beyond basic retrieval and extraction to orchestrate end-to-end document processing and decision-making. Imagine a contract review workflow: you don't just parse terms, you identify potential risks, cross-reference regulations, and recommend compliance actions.<br><br>This level of coordination requires an agentic framework that maintains context, applies business rules, and interacts with multiple system components.<br><br>Here&#8217;s how ADW works at a high level:</p><ol><li><p>Document parsing and structuring &#8211; using robust tools like LlamaParse to extract relevant fields from contracts, invoices, or medical records.</p></li><li><p>Stateful agents &#8211; coordinating each step of the process, maintaining context across multiple documents, and applying logic to generate actionable outputs.</p></li><li><p>Retrieval and reference &#8211; tapping into knowledge bases via LlamaCloud to cross-check policies, regulations, or best practices in real-time.</p></li><li><p>Actionable recommendations &#8211; delivering insights that help professionals make informed decisions rather than just handing over raw text.</p></li></ol><p>ADW provides a path to building truly &#8220;intelligent&#8221; document systems that augment rather than replace human expertise. From legal contract reviews to patient case summaries, invoice processing, and insurance claims management&#8212;ADW supports human decision-making with context-rich workflows rather than one-off extractions.<br><br>Ready to use notebooks <a href="https://github.com/run-llama/llamacloud-demo/tree/main/examples/document_workflows">https://github.com/run-llama/llamacloud-demo/tree/main/examples/document_workflows</a><br><br>More open-source tools for AI agent developers in my recent blog post <a href="https://www.aitidbits.ai/p/open-source-agents">https://www.aitidbits.ai/p/open-source-agents</a></p><div><hr></div><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h1>3. Vanna</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dwQC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dwQC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg" width="1456" height="841" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:841,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!dwQC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 424w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 848w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!dwQC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab312f7b-5dfb-4e2b-a56f-06df1d7f7251_2022x1168.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Turning questions into SQL statements has been an area of research and development for over a decade. An open-source package called Vanna uses a retrieval augmented generation approach to crack the code.<br><br>Vanna is a Python-based, MIT-licensed framework that allows non-data folks to interact and ask questions about their SQL databases.<br><br>At its core, Vanna employs a RAG model that leverages a large corpus of data, including a diverse range of SQL queries and their natural language descriptions. When a query is received, Vanna searches this corpus to find similar queries and their corresponding SQL translations. This step enables Vanna to understand the context and structure of the query better.<br><br>Using the insights gained from the retrieved examples, Vanna generates the SQL query that matches the user's natural language request. This involves structuring the select statements, where clauses, joins, and other SQL components are based on the intent and requirements identified in the initial query.<br><br>Pro tip: Vanna often hallucinates when it doesn't know the content of your table's columns. I therefore recommend providing a few examples through the train() method.<br><br>The GitHub repo already includes ready-to-use templates to deploy Vanna in Slack, Streamlit, or a Flask endpoint <a href="https://github.com/vanna-ai/vanna">https://github.com/vanna-ai/vanna</a></p><div><hr></div><h1>4. DeepSeek &amp; How to prompt reasoning models</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LK9x!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LK9x!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg" width="691" height="537.9373996789727" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1246,&quot;resizeWidth&quot;:691,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:&quot;No alt text provided for this image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!LK9x!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 424w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 848w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!LK9x!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa9f9c5fb-acec-4a7d-b60f-f7f5fe0caabd_1246x970.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>DeepSeek's new model doesn't just beat GPT-4o - it proves &#119856;&#119838;'&#119855;&#119838; &#119835;&#119838;&#119838;&#119847; &#119849;&#119851;&#119848;&#119846;&#119849;&#119853;&#119842;&#119847;&#119840; &#119851;&#119838;&#119834;&#119852;&#119848;&#119847;&#119842;&#119847;&#119840; &#119846;&#119848;&#119837;&#119838;&#119845;&#119852; &#119856;&#119851;&#119848;&#119847;&#119840; &#119834;&#119845;&#119845; &#119834;&#119845;&#119848;&#119847;&#119840;.</p>
      <p>
          <a href="https://www.aitidbits.ai/p/january-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Rewiring the Internet: Commerce in the Age of AI Agents]]></title><description><![CDATA[How commerce, payments, and marketing should evolve for an agent-mediated internet]]></description><link>https://www.aitidbits.ai/p/ai-agents-commerce</link><guid isPermaLink="false">https://www.aitidbits.ai/p/ai-agents-commerce</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 19 Jan 2025 15:01:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the <a href="https://www.aitidbits.ai/p/open-source-agents">open-source toolkit for AI Agents builders</a>, the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundational AI models</a>, and <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure">the infrastructure required to power the agentic AI era</a>.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;8d7854eb-9972-426b-a349-8a422b74643c&quot;,&quot;duration&quot;:1297.2408,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>December 2028. Maria's AI agent is negotiating simultaneously with twelve different vendors for her daughter's upcoming birthday party. Within minutes, it secured the perfect cake from a local bakery (after verifying their nut-free certification), booked an entertainer with stellar safety ratings, and coordinated custom goodie bags filled with each child's favorite treats (after checking allergies and dietary restrictions with the other parents' agents)&#8212;all while staying 15% under budget. What would have taken Maria hours of calls, emails, and anxiety about vendor reliability now happens seamlessly through a web of agent-to-agent interactions powered by the new infrastructure we explored in <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure">our previous post</a>.</p><p>The revolution in web infrastructure we discussed in previous posts isn't just theoretical&#8212;it's enabling fundamental changes in how commerce, marketing, and customer service function. As <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure#:~:text=intelligently.%20Enter%20the-,Agent%20Passport,-.">agent passports</a> and trust protocols become standardized, we're witnessing the emergence of entirely new commercial paradigms.</p><p>With the recent release of Tasks by OpenAI, which equips ChatGPT&#8212;its consumer-facing AI&#8212;with the ability to perform tasks behind the scenes on behalf of users, it's now easier than ever to envision a future where ChatGPT seamlessly navigates the web and handles complex operations for us.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HfZE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HfZE!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 424w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 848w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1272w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif" width="603" height="401.74875" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:603,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;OPEN_AI_TASKS.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="OPEN_AI_TASKS.mp4 [optimize output image]" title="OPEN_AI_TASKS.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!HfZE!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 424w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 848w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1272w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ChatGPT can now set reminders and perform recurring actions</figcaption></figure></div><p>Today, we'll explore how an agent-first internet will reshape domains like payments, marketing, support, and localization.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jjvk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jjvk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 424w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 848w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:650545,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jjvk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 424w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 848w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Agentic payments</strong></h2><p>Remember when online shopping first emerged, and entering your credit card details on a website felt risky? Card networks like Visa and Mastercard and banks like Chase and Barclays had to rapidly adapt to the digital realm, introducing new protocols like CVV codes and secure payment gateways to protect consumers and merchants alike. This evolution was pivotal in building trust and facilitating the e-commerce boom of the early 2000s, giving birth to digital payment giants like Stripe and PayPal.</p><p>Similarly, the payment infrastructure that powers today's internet was built around a simple dichotomy: card-present versus card-not-present transactions and the assumption that all payments are human-initiated. The internet already faced a similar shift with the introduction of subscription payments in the early days of e-commerce, in which customers input their payment credentials once, allowing a merchant to charge them repeatedly in the future. Recurring payments, now a dominant commerce mechanism projected to surpass <a href="https://www.pymnts.com/subscription-commerce/2023/how-transparency-and-high-earners-power-a-96-billion-subscription-industry/">$2.4T in spending</a> by 2028, underscore how critical adaptive payment systems are to supporting global e-commerce. Yet, even more than a decade later, there are still countries like India that <a href="https://www.pymnts.com/news/international/2021/did-rbi-rule-kill-recurring-payments-india/">impose strict controls</a> on recurring payments, highlighting the complexities of adapting systems to new paradigms.</p><p>But what happens when the legitimate transacting party isn&#8217;t human at all?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kZRV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kZRV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 424w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 848w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1272w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif" width="689" height="340.19375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:395,&quot;width&quot;:800,&quot;resizeWidth&quot;:689,&quot;bytes&quot;:492929,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kZRV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 424w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 848w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1272w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Digital payment systems are built around human users, employing measures like CVV codes and billing address verification to prevent fraud. These methods assume a human is initiating the transaction, making them ill-suited for autonomous agent transactions.</p><p>One major challenge is fraud detection and resolution&#8212;an area I intimately understand from my time as a PM at Stripe, where I worked closely with card issuers to develop sophisticated fraud prevention systems. The current paradigm relies heavily on human behavioral patterns: typing speed, mouse movements, time spent reviewing checkout pages, and other signals that indicate legitimate human activity. But in an agent-driven world, these signals become obsolete.</p><p>During my time at Stripe, we saw how crucial these behavioral signals were for differentiating legitimate transactions from fraudulent ones. The shift to agent-driven commerce demands an entirely new approach. Rather than looking for signs of human behavior, we'll need systems that verify agent legitimacy, authorization scope, and decision-making logic.</p><p>This shift raises critical questions for fraud prevention across banks (Chase, Barclays), card networks (Visa, Mastercard), and payment processors (Stripe, PayPal). For instance, how do you establish a chain of trust when an agent purchases on behalf of a user? How do you verify that an agent hasn't been compromised or hijacked? These challenges require fundamentally rethinking our approach to transaction security and fraud prevention in an agent-driven ecosystem.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YH5H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YH5H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 424w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 848w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1272w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png" width="650" height="585" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1260,&quot;width&quot;:1400,&quot;resizeWidth&quot;:650,&quot;bytes&quot;:558331,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YH5H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 424w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 848w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1272w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fraud resolution in the era of AI agents</figcaption></figure></div><p>Future payment systems could introduce 'agent wallets' with granular spending controls, such as $100 limits for trusted merchants like Amazon and stricter caps for lesser-known websites. These wallets would integrate real-time fraud detection, submitting cryptographic evidence for disputes and maintaining transparent, auditable records of agent actions tied to human authorization.</p><p>Visa could introduce an agent-specific flag to the existing payment protocols, indicating this payment was initiated autonomously, along with a trial of reasoning and actions leading to this payment. Meanwhile, Stripe might expand its SDKs to enforce programmable payment rules, ensuring alignment with user-delegated instructions (Stripe has already made its foray into Agentic payments with its recent <a href="https://stripe.dev/blog/adding-payments-to-your-agentic-workflows">SDK release</a>).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8OjO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8OjO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 424w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 848w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1272w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png" width="671" height="432.0833333333333" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:850,&quot;width&quot;:1320,&quot;resizeWidth&quot;:671,&quot;bytes&quot;:153975,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8OjO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 424w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 848w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1272w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Stripe&#8217;s new <a href="https://docs.stripe.com/agents">Agents SDK</a> supports Vercel&#8217;s AI SDK, LangChain, and CrewAI</figcaption></figure></div><p>Beyond preventing fraud, agentic payments face fundamental economic and infrastructural challenges. The existing payment infrastructure wasn't architected for the high-frequency, low-latency transactions that characterize agent interactions. Consider the standard pricing model of payment processors like Stripe: a 2.9% fee plus 30&#162; per transaction. While manageable for traditional e-commerce, this fee structure becomes prohibitively expensive when scaled to the myriad micro-transactions that agents might need to execute.</p><p>This pricing isn't arbitrary&#8212;it reflects the complex web of stakeholders in the traditional payment chain. Card networks like Visa and issuers like Chase have built their business models around these transaction fees. Interestingly, Stripe's recent acquisition of <a href="https://www.bridge.xyz/">Bridge</a>, a stablecoin payment infrastructure provider, hints at a potential solution. By leveraging blockchain-based payment rails, companies could facilitate agent-to-agent transactions without incurring the expensive overhead of traditional payment networks. This move suggests a growing recognition that the future of payments may require completely new infrastructure, optimized for the unique demands of autonomous agents.</p><p>Consider how a new payment protocol might work in practice. An "Agent Payment Protocol" (APP) could include:</p><ul><li><p><strong>Delegation Chain Verification</strong> - a cryptographic proof chain showing the agent's authorization to make specific types of purchases</p></li><li><p><strong>Transaction Context Object</strong> <strong>-</strong> machine-readable metadata including:</p><ul><li><p>Reasoning trail that led to the purchase decision</p></li><li><p>Reference to specific user preferences/rules that were satisfied</p></li><li><p>Confidence score for the decision</p></li></ul></li><li><p><strong>Smart spending controls</strong> with programmable constraints like:</p><ul><li><p>Category-specific limits (e.g., $200 for groceries, $50 for entertainment)</p></li><li><p>Merchant-specific trust scores</p></li><li><p>Required human confirmation above certain thresholds</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1pZZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 424w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 848w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1272w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png" width="682" height="654.3838028169014" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1090,&quot;width&quot;:1136,&quot;resizeWidth&quot;:682,&quot;bytes&quot;:331497,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 424w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 848w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1272w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ul><p>Major payment providers could implement this through extensions to existing standards. For instance, Visa's existing 3D Secure protocol could add an agent verification layer, while Stripe's API could introduce new parameters for agent-specific transaction metadata.</p><p>While payment infrastructure provides the foundation for agent-driven commerce, the very nature of how we complete transactions must also evolve. The familiar checkout process&#8212;a hallmark of e-commerce for decades&#8212;is about to undergo its own transformation.</p><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>Redefined checkout experience</strong></h2><p>In an agent-first environment, the concept of a traditional checkout&#8212;where a human user confirms their purchase by clicking a &#8220;Buy&#8221; button&#8212;fades into the background. Instead, agents operate with predefined goals and parameters, continuously evaluating whether a proposed transaction aligns with those objectives. Rather than halting everything at a payment prompt, agents could integrate a &#8220;stop and reflect&#8221; step into their workflows. For example, if a user&#8217;s agent is tasked with booking a flight seat that&#8217;s both a window seat and close to an exit, the agent pauses before completing the reservation. It double-checks that the seat assignment matches the user&#8217;s criteria and only then proceeds, ensuring flawless execution of the user&#8217;s intent and mitigating the probability of the agent going off the rails due to hallucinations.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p6Gt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 424w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 848w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1272w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:60,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Stripe Blog: Product&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Stripe Blog: Product" title="Stripe Blog: Product" srcset="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 424w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 848w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1272w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Stripe perfected its checkout experience with its Optimized Checkout Suite, which dynamically shows customers the most relevant payment methods using machine learning, leading to a 10.5% increase in revenue</figcaption></figure></div><p>This reflective process transforms the final authorization into a subtle verification loop rather than a jarring user interruption. The agent reviews the selected attributes&#8212;price, seat location, baggage allowance, and cancellation policy&#8212;and compares them against the user&#8217;s stored preferences and constraints. It confirms not only that the requested outcome has been met but also that it falls within acceptable spending limits and trust parameters. This transforms purchasing from a manual &#8220;Are you sure?&#8221; prompt into a nuanced, data-driven decision matrix.</p><p>As these agent-mediated transactions proliferate, payment providers and merchants might offer additional layers of context-aware validation. For instance, when an agent chooses a specific insurance add-on, the payment system could prompt the agent to confirm whether its logic correctly interpreted the user&#8217;s needs. This transparent chain of reasoning, visible to the agent and logged for future reference, ensures that each transaction stands up to scrutiny. Ultimately, the checkout step evolves from a user-facing choke point to an agent-managed quality control measure, minimizing errors and elevating the overall integrity of automated commerce.</p><p>As agents reshape how we complete purchases, they're also forcing us to rethink how businesses attract and engage customers in the first place. The era of human-centric marketing campaigns is giving way to something far more systematic and efficient.</p><h2><strong>Agent-driven marketing and promotions</strong></h2><p>Marketing campaigns and promotions will evolve radically in an AI agent-mediated economy.</p><p>Traditional email marketing and coupon distribution systems, designed around human attention and impulse, will give way to programmatic offer networks where consumers&#8217; AI agents maintain persistent queries about their principals' needs and preferences. These agents subscribe to vendor APIs that broadcast real-time offers matching specific criteria, enabling hyper-personalized deal discovery that transcends the limitations of batch email campaigns.</p><p>Vendors might maintain agent-first promotional channels that communicate in structured data formats, allowing instant price comparison and benefit analysis. This ecosystem could enable &#8220;intent casting", where agents broadcast shopping goals to trusted vendor networks, receiving precisely targeted offers that align with the principal's timing, budget, and preferences&#8212;all without cluttering a human inbox or requiring manual coupon management.</p><p>Consider a practical example: A user instructs their agent to monitor high-end fashion retailers for specific items within their style preferences and budget constraints. Rather than the user repeatedly checking websites or subscribing to countless email lists, their agent maintains persistent monitoring across multiple vendors:</p><ul><li><p>Real-time inventory tracking across size, color, and style variations</p></li><li><p>Dynamic price monitoring, including flash sales and member-exclusive discounts</p></li><li><p>Evaluation of shipping times and costs to the user's location</p></li></ul><p>When ideal conditions align&#8212;perhaps a preferred sweater hits the target price point during an end-of-season sale&#8212;the agent can either notify the user or execute the purchase automatically based on pre-authorized parameters. This transforms shopping from an attention-demanding activity into an efficient background process governed by clear rules and preferences.</p><p>Major retailers like Nordstrom or ASOS could expose agent-specific APIs that provide structured access to:</p><ul><li><p>Real-time inventory and pricing data</p></li><li><p>Detailed product specifications and measurements</p></li><li><p>Membership program benefits and restrictions</p></li><li><p>Regional availability and shipping constraints</p></li></ul><p>This evolution democratizes personal shopping, allowing everyone to benefit from persistent, intelligent monitoring of their fashion preferences&#8212;not just those who can afford human personal shoppers. It also enables retailers to better match inventory with actual customer intent, reducing overhead from unsold merchandise and improving supply chain efficiency.</p><p>Other companies like Honey may need to pivot to offer agent-optimized tools that integrate directly with vendor APIs, allowing agents to query real-time discounts and rewards. Similarly, Mailchimp and HubSpot could develop agent-oriented campaign frameworks that distribute offers as structured data streams rather than traditional email blasts, ensuring seamless integration with agent-driven workflows.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>Agent-native customer support</strong></h2><p>Customer support today centers on human interactions or user-facing chatbots. In an agent-first paradigm, this shifts to agent-to-agent communication. Personalized AI agents will directly engage with business systems to resolve issues, retrieve shipping or refund policies, or autonomously initiate returns. This evolution will streamline processes, reduce human intervention, and enhance efficiency in support workflows.</p><p>In an agent-first paradigm, customer support is no longer solely about human users contacting businesses through chat widgets or call centers. Instead, autonomous agents interact directly with enterprise systems, pulling diagnostic information, requesting refunds, or escalating complex issues to a more constrained/expensive resource like a human or a superior model (e.g., o1 over GPT-4o). This shift encourages platforms like Intercom to develop agent-oriented communication layers&#8212;specialized APIs that allow autonomous agents to navigate support options, retrieve knowledge base articles, and submit detailed troubleshooting requests without human intervention.</p><p>These agent-facing APIs would streamline issue resolution, allowing routine queries, such as package tracking, account verification, or policy clarifications, to be handled agent-to-agent, drastically reducing response times. As soon as a problem arises, the user&#8217;s agent can pinpoint the issue and connect with the business&#8217;s support agent (be it a specialized LLM or a human representative), negotiating resolutions or applying discounts as needed. The result is a fluid, automated dialogue that bypasses human frustration and latency.</p><p>Over time, companies could implement reputation scoring systems that measure how efficiently their support agents (both human and AI) interact with consumer agents. Metrics like resolution speed, policy clarity, and refund accuracy become machine-readable signals, informing user agents which vendors offer superior support experiences. As more vendors embrace these standards, the entire support ecosystem evolves: prompt, well-structured responses become the norm, and agent-native customer support becomes a hallmark of high-quality digital services.</p><p>Imagine a complex warranty claim scenario: Your agent detects that your new laptop's battery is degrading unusually fast. It immediately:</p><ol><li><p>Collects diagnostic data and usage patterns</p></li><li><p>Cross-references warranty terms with actual performance</p></li><li><p>Initiates a support interaction with the manufacturer's agent</p></li><li><p>Negotiates a resolution based on precedent cases</p></li><li><p>Arranges shipping for replacement parts or full device replacement</p></li><li><p>Schedules a technician visit if needed</p></li></ol><p>This entire process happens without human intervention unless exceptional circumstances arise. The interaction generates a complete audit trail, including all diagnostic data, communication logs, and decision points&#8212;valuable data for improving both product quality and support processes.</p><p>One example in this space of agentic customer support is <a href="https://sierra.ai/">Sierra</a>, a startup taking aim at the expansive market of customer support by embedding AI agents into business workflows. Their conversational agents handle complex queries with contextual precision, managing tasks such as processing returns or updating subscriptions. While their primary focus remains on serving human customers, the foundation they&#8217;ve built is clearly aligned with an AI agent-driven future. With access to company policies (e.g., refund and shipping rules) and robust conversational AI infrastructure (spanning LLMs and voice interfaces), Sierra is well-positioned to seamlessly transition to support agent-to-agent interactions as demand evolves.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cVCp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cVCp!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif" width="618" height="583.2375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:755,&quot;width&quot;:800,&quot;resizeWidth&quot;:618,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp (25).mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp (25).mp4 [optimize output image]" title="temp (25).mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!cVCp!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sierra&#8217;s agents resolve support tickets autonomously, with Sierra customers only paying for <a href="https://sierra.ai/blog/outcome-based-pricing-for-ai-agents">resolved tickets</a></figcaption></figure></div><h2><strong>The end of language-optimized interfaces</strong></h2><p>As AI agents seamlessly translate and interpret information on the fly, the need for painstakingly maintained multilingual websites diminishes. Instead of forcing businesses to host separate English, French, or Mandarin versions of their interfaces, agents handle language conversion dynamically. This capability allows brands to maintain a single, streamlined codebase while ensuring that users, regardless of location, receive content and instructions in their preferred language&#8212;instantly and accurately.</p><p>Website builders like Webflow and Wix could evolve into platforms that generate &#8220;universal templates&#8221; optimized for agent interpretation rather than human linguistic preferences. Instead of focusing on localized landing pages, these platforms would produce standardized, machine-readable structures enriched with metadata and semantic cues. Agents, armed with cutting-edge language models, would then adapt the presentation layer for each user, including local dialects, cultural nuances, and even personalization cues drawn from the user&#8217;s profile.</p><p>The transformation goes beyond simple translation. Agents will handle complex cultural adaptations across multiple dimensions simultaneously. They'll dynamically adjust pricing strategies for different markets while modifying product descriptions to reflect local preferences and purchasing patterns. These agents will intelligently adapt imagery and design elements to ensure cultural appropriateness, automatically managing regional compliance requirements such as privacy policies or consumer protection disclosures. They'll even personalize communication styles based on cultural norms, shifting between formal and casual tones and adapting messaging cadence to match local expectations. This comprehensive cultural intelligence transforms what was once a labor-intensive localization process into a fluid, automated system that maintains cultural authenticity across all customer touchpoints.</p><p>In this new reality, the value proposition of website builders shifts from localization to robustness, structure, and data integrity. Rather than wrestling with manual translations or commissioning multiple language variants, businesses can rely on well-defined data schemas and agent-ready manifests. As a result, the concept of &#8220;language-optimized&#8221; sites becomes obsolete, replaced by fluid, dynamic interfaces that transcend linguistic barriers.</p><h2><strong>Small vs. large business impact</strong></h2><p>This transition creates both opportunities and challenges across the business spectrum. Large enterprises can invest in building sophisticated agent interfaces and maintaining complex agent-ready APIs. However, small businesses might initially struggle with the technical requirements and infrastructure costs.</p><p>To bridge this gap, we will likely see the emergence of "agent-enablement platforms"&#8212;services that help small businesses become agent-ready without significant technical investment. Think of them as the Shopify of the agent era, providing standardized tools that level the playing field. These platforms would offer pre-built solutions for creating agent-readable product catalogs and managing automated pricing and inventory systems. They would include standardized support protocols that small businesses can easily implement, along with simplified integration paths to agent payment systems. By democratizing access to agent-ready commerce capabilities, these platforms will play a crucial role in preventing a digital divide between large and small businesses in the agent economy.</p><p>This democratization of agent-ready commerce will be crucial for preventing a digital divide between large and small businesses in the agent economy.</p><h2><strong>The Great Rewiring</strong></h2><p>The transition to an agent-first internet represents more than just a technological shift&#8212;it's a fundamental reimagining of how commerce functions in the digital age. We're moving from a web optimized for human attention and interaction to one built for efficient, automated decision-making. This transformation touches every aspect of online business:</p><ul><li><p>Payment systems evolve from human-verification models to agent-oriented protocols with built-in delegation and accountability</p></li><li><p>Marketing shifts from attention-grabbing campaigns to structured, machine-readable offer networks</p></li><li><p>Customer support transforms from human-to-human interaction to efficient agent-to-agent problem resolution</p></li><li><p>Language barriers dissolve as agent-mediated communication enables seamless global commerce</p></li></ul><p>Companies that quickly adapt to this new paradigm&#8212;implementing agent passports, embracing agent-to-agent protocols, and restructuring their services for machine readability&#8212;will shape the next era of online interaction. Just as the mobile revolution created trillion-dollar opportunities, the agent revolution opens new horizons for innovation and value creation. The businesses that thrive won't just be those with the best products or prices, but those that best enable and embrace agent-driven commerce.</p><div><hr></div><p>This is the third essay in a <a href="https://www.aitidbits.ai/s/ai-agents">five-part series</a> exploring the future of AI agents and their impact on the internet. Having established the foundational shifts and their practical implications across multiple domains, our next post will examine early case studies and implementation challenges as businesses begin this transformation. Subscribe and follow as we continue to navigate the technical, social, and economic implications of this new paradigm.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[LinkedIn Highlights, Dec 2024]]></title><description><![CDATA[Claude&#8217;s new PDF API, a playground to build with the new Gemini Realtime Multimodal API, open multimodal vision models from Meta, an open-source Perplexity alternative, and easy LLM fine-tuning]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-dec-2024</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-dec-2024</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 02 Jan 2025 16:00:45 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b46c580b-68d6-4b07-ae11-27a0193832f3_800x500.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong>five top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry.</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><h1>1. MindSearch</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D0dY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D0dY!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif" width="656" height="370.64" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:339,&quot;width&quot;:600,&quot;resizeWidth&quot;:656,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!D0dY!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 424w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 848w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1272w, https://substackcdn.com/image/fetch/$s_!D0dY!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fac5ba183-3a6f-43ef-a447-e88fc733fad7_600x339.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>An open-source search engine is rivaling top-tier AI products like <a href="http://perplexity.ai/">Perplexity.ai</a> Pro and ChatGPT web search.</p><p>MindSearch is an innovative AI search engine framework that combines LLMs and a multi-agent system to tackle three critical issues that often limit LLM-powered search engines:</p><ol><li><p>LLMs struggle to decompose complex queries into simpler, actionable requests</p></li><li><p>Search results often contain too much noise, making it hard to filter and extract relevant information</p></li><li><p>Iterative searches can quickly overload the LLM&#8217;s input length capacity</p></li></ol><p>MindSearch utilizes two main components:</p><ul><li><p>WebPlanner - decomposes complex queries into sub-tasks and creates a dynamic graph structure for problem-solving</p></li><li><p>WebSearcher - conducts fine-grained searches and delivers summarized information back to WebPlanner for further refinement</p></li></ul><p>This approach allows MindSearch to handle massive web content (e.g., more than 300 pages) effectively, surpassing limitations faced by traditional LLM-based search systems.</p><p>Code <a href="https://github.com/InternLM/MindSearch">https://github.com/InternLM/MindSearch</a></p><div><hr></div><h1>2. Gemini Multimodal Playground</h1><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;71791ce7-ddf9-425a-90dd-2ea0ccb32d9a&quot;,&quot;duration&quot;:null}"></div><p>Holiday coding project: Build voice agents that can see with Google's new Gemini 2.0 model and my new real-time Multimodal Playground repo.</p><p>The playground implements voice and video-based interactions with the new Gemini model, allowing natural conversations in real-time while solving the critical background noise challenge using Voice Activity Detection (VAD).</p><p>In the last few days, I added a full-stack web app to interact with Gemini (see video below) along with a standalone script for those eager to quickly dive into building real-time voice agents.</p><p>Google&#8217;s real-time Gemini model is a game-changer, enabling you to independently create production-ready voice agents for industries like customer service, education, and healthcare in a matter of days.</p><p>Happy holidays. Go build! <a href="https://github.com/saharmor/gemini-multimodal-playground">https://github.com/saharmor/gemini-multimodal-playground</a></p><div><hr></div><h1>3. Meta Apollo</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H-EB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H-EB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg" width="1456" height="817" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:817,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!H-EB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 424w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 848w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!H-EB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5025b01f-f32e-4616-a9ed-69af18c7c123_2048x1149.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Video understanding has been lagging behind text, image, and audio modalities&#8212;until now.</p><p>Meta and Stanford researchers unveiled Apollo, a new family of state-of-the-art video-centric large multimodal models (video-LMMs) designed to close this gap. Unlike prior efforts, Apollo sets a new standard by efficiently analyzing hour-long videos and achieving breakthrough results on multiple benchmarks.</p><p>Paper highlights:</p><ol><li><p>Scaling Consistency - design decisions made with smaller models transfer reliably to larger ones, drastically cutting computational costs</p></li><li><p>Advanced video sampling techniques - Apollo uses FPS sampling, outperforming traditional uniform sampling methods</p></li><li><p>Streamlined evaluation - the new ApolloBench benchmark evaluating video-LMMs efficiently, reducing evaluation time by 41x while maintaining accuracy</p></li></ol><p>Apollo&#8217;s superior video comprehension capabilities pave the way for breakthroughs like real-time video summarization for content creators, better temporal reasoning for medical diagnostics, and enhanced video analytics for autonomous driving.</p><p>With Apollo, video understanding might finally catch up to its multimodal counterparts.<br><br>Project page <a href="https://apollo-lmms.github.io/">https://apollo-lmms.github.io</a></p><div><hr></div><h1>4. Claude PDF API</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZlDd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg" width="1456" height="1064" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1064,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!ZlDd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ZlDd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbf269167-603a-4fde-b0ac-c58129bb3d41_1938x1416.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Anthropic has introduced a powerful new PDF-processing feature in its Claude API, surpassing basic text extraction, and it has largely flown under the radar.</p><p>Historically, many LLMs stumble when documents include complex elements like images, charts, and LaTeX formulas. But Anthropic&#8217;s latest upgrade manages to parse both textual and visual content within a PDF&#8212;no extra coding wizardry needed.</p><p>Key capabilities include:</p><ol><li><p>Automatically parsing PDF text, images, and tables for further analysis, from answering questions about the attached PDF to turning unstructured data into formatted JSONs</p></li><li><p>Providing insight on charts and diagrams by evaluating visual context, not just textual tags</p></li><li><p>Extracting and interpreting LaTeX for scientific or technical documentation</p></li></ol><p>It works by splitting each PDF into two components: the text is extracted as normal, and the entire page is converted into an image. Claude then merges text and visual context for a more holistic understanding. It&#8217;s essentially combining LLM intelligence with basic computer vision techniques.</p><p>The API supports up to 32MB or 100 pages of PDF content and pricing is similar to the LLM pricing so there&#8217;s no premium cost for PDF analysis.</p><p>This API could dramatically streamline how we handle financial reports, legal docs, or any PDF requiring detailed interpretation.<br></p><p>Ready-to-run notebook analyzing Anthropic's constitutional AI paper here <a href="https://github.com/anthropics/anthropic-cookbook/blob/main/misc/pdf_upload_summarization.ipynb">https://github.com/anthropics/anthropic-cookbook/blob/main/misc/pdf_upload_summarization.ipynb</a></p><div><hr></div><h1>5. LLaMa-Factory</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hzZb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hzZb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg" width="1456" height="816" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:816,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!hzZb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hzZb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F02d6f703-c201-41fb-ae20-4992a4d404e2_2048x1148.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When is it better to fine-tune a language model than using prompt engineering or RAG? Here&#8217;s a clear framework you can apply along with an open-source library I use for fine-tuning.</p><p>Good reasons to fine-tune:</p><ol><li><p>Emphasizing knowledge that already exists in the model - for instance, in a text-to-SQL task, fine-tuning can be used to emphasize specific SQL dialects or to avoid error-prone edge cases, utilizing the comprehensive understanding of SQL syntax, dialects, and database functionality that the model already possesses.</p></li><li><p>Customizing the structure or tone of responses - fine-tuning can modify the structure or tone of a model's output, such as making the model output valid JSON, which is beneficial for programmatic interactions where handling invalid JSON could lead to many downstream error cases. This includes fine-tuning a model to your company&#8217;s writing style.</p></li><li><p>Teaching a model very complex instructions - fine-tuning allows for showing the model many more examples than can be included in a model's context window, which is helpful for complex instructions. This leads to cheaper and faster inference.</p></li></ol><p>Wrong reasons to fine-tune:</p><ol><li><p>Adding new knowledge to the base model - the knowledge in a large language model is established during the pre-training runs. New knowledge can't effectively be introduced during the limited scope of fine-tuning. RAG is better suited in such cases.</p></li><li><p>Quickly iterating on a new use-case - fine-tuning involves a slower feedback loop and requires substantial investment in creating the dataset and other aspects of the fine-tuning process. Therefore, it's not suitable for rapid iteration of new use cases.</p></li></ol><p>My preferred tool for fine-tuning open language models is LLaMA-Factory. It features 100+ different large language models, including Meta&#8217;s Llama-2, Google&#8217;s Gemma, and Mistral&#8217;s Mixtral. It also supports advanced algorithms like LoRA, QLoRA, and GaLore for optimized performance.<br><br>GitHub repo <a href="https://github.com/hiyouga/LLaMA-Factory">https://github.com/hiyouga/LLaMA-Factory</a></p><div><hr></div><p><strong>Last month&#8217;s LinkedIn Highlights</strong></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;caa2247c-7336-41c5-a815-99a63469778b&quot;,&quot;caption&quot;:&quot;Something different today: Rather than our usual Thursday roundup, I'll take a slight detour to share some in-depth insights about AI Agents that have occupied my mind lately. For the next two weeks, expect more of Sahar's 2&#162; pieces.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;LinkedIn Highlights, Oct 2024&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-07T15:30:18.165Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbaedaba3-793a-45d0-9e24-d71077b464c4_2048x1200.jpeg&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/linkedin-october-2024&quot;,&quot;section_name&quot;:&quot;Monthly's&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:150965620,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:17,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p></p>]]></content:encoded></item><item><title><![CDATA[The Voice Agents Toolkit for Builders]]></title><description><![CDATA[Curated frameworks, tools, and libraries to launch reliable and efficient voice agents]]></description><link>https://www.aitidbits.ai/p/voice-agents-toolkit</link><guid isPermaLink="false">https://www.aitidbits.ai/p/voice-agents-toolkit</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 22 Dec 2024 13:00:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</strong></em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;2242c69c-d496-4931-a37a-cd1a8169d758&quot;,&quot;duration&quot;:1503.6082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>Over the past few weeks, I've explored how AI agents will fundamentally reshape the internet. From introducing the concept of "agent-responsive design" - where websites optimize for AI interaction - to examining the infrastructure needed for agent-to-agent communication protocols, my previous posts laid the groundwork for understanding the agent-centric future of the web.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;160324a6-5e60-45e5-9ec6-fccfaa98b0d7&quot;,&quot;caption&quot;:&quot;Welcome to AI Tidbits Deep Dives: short posts offering a perspective on AI-related topics. Some of my previous ones covered the economies of scale for foundational AI models, the recent consolidation in the AI space, and autonomous agents.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Agent-Responsive Design: Rethinking the web for an agentic future&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-17T16:01:09.913Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d892766-b0f9-45ea-b64e-1955fc3c4dc6_2458x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/agent-responsive-design&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:151674570,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:76,&quot;comment_count&quot;:13,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b0999537-b746-4b4d-bc6e-4c6b73416b88&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the open-source toolkit for AI Agents builders, the economies of scale for foundational AI models&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Great Web Rebuild: Infrastructure for the AI Agent era&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-12-12T14:30:15.101Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/ai-agents-web-infrastructure&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152954768,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:50,&quot;comment_count&quot;:7,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d983874a-b658-4232-bb8d-93ab29e27779&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-28T12:31:04.504Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152246141,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:104,&quot;comment_count&quot;:8,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><br>Now, it's time to tackle perhaps the most natural and transformative interface for these agents: voice.</p><p>While text-based agents have dominated the early wave of AI applications, voice represents the next frontier in human-AI interaction. It's not just another interface - it's the most intuitive and accessible way for humans to interact with AI systems. This intersection of voice technology and AI agents creates unprecedented opportunities for developers, so I'm excited to share this comprehensive guide to the voice AI ecosystem.</p><p>After decades of frustrating experiences with scoped voice assistants that cannot be interrupted and follow a narrow rule-based script, we're witnessing a fundamental shift in what's possible. Three key developments drive this shift:</p><ol><li><p><strong>Breakthrough in speech-native models</strong> - the release of OpenAI's Realtime API last October and Google's Gemini 2.0 <a href="https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/">Realtime Multimodal API</a> last week mark a transition from traditional "cascading architectures" (where speech is converted to text, processed, and converted back) to speech-native models that can process audio directly with unprecedented quality. With OpenAI's recent <a href="https://openai.com/index/o1-and-new-tools-for-developers/">60% Realtime API price reduction</a> and the hiring of WebRTC's founder, we're seeing a clear industry push toward making real-time voice interactions accessible and affordable.</p></li><li><p><strong>Dramatic reduction in complexity</strong> - what previously required hundreds of data scientists can now be achieved by small teams of AI engineers. We're seeing companies reach substantial ARR with lean teams by building specialized voice agents for specific verticals - from restaurant order-taking to lead qualification for sales teams.</p></li><li><p><strong>Infrastructure maturity</strong> - the emergence of robust developer platforms and middleware solutions has dramatically simplified voice agent development. These tools handle complex challenges like latency optimization, error handling, and conversation management, allowing developers to focus on building unique user experiences.</p></li></ol><p>This convergence creates a unique opportunity for builders. For the first time in human history, we have a god-like AI systems that converse like humans. The era of capable voice AI has arrived, opening up vast opportunities for innovators and developers alike.</p><p>Unlike web or mobile app development, where patterns are well-established, voice AI is still in its formative stage. The winners in this space will be those who can combine technical capability with a deep understanding of specific industry needs.</p><p>In this post, I'll provide a well-curated overview of the open-source and commercial tools available for developers building voice agents. While VCs segment the market based on investment opportunities, I'll map the ecosystem based on what matters to developers: APIs, SDKs, and tools you can actually use today. What is the go-to model for speech-to-text? The API for synthesis speech? Which tools do other builders rely on to develop voice agents? With the holiday season upon us, there's no better time to build your voice agent, turn it into a company, or automate a personal workflow.</p><p>Categories covered in this piece, along with my recommended stack and tips for choosing the right architecture at the end:</p><p><strong>Foundations</strong><br>&#8594; Speech-to-text<br>&#8594; Text-to-speech</p><p><strong>Voice AI Middleware</strong><br>&#8594; Speech-to-speech<br>&#8594; Conversational Agents Frameworks<br>&#8594; Evaluation</p><p><strong>End-to-end Automation<br></strong>&#8594; Full Stack</p><p><em><strong><br>Missing package or tool you think should have been part of this list? Comment on this post and I&#8217;ll take a look.</strong></em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wWQK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wWQK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 424w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 848w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1272w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" width="1456" height="821" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:821,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1700082,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wWQK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 424w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 848w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1272w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Foundations</h2><h3><strong>Speech-to-text (STT)</strong></h3><p>Automated transcription models have improved significantly over the past couple of years, featuring low latency and cost along with a decreasing WER (Word Error Rate). When selecting a speech-to-text model, consider these key factors:</p><ul><li><p><strong>Latency requirements</strong> - for real-time conversations, you need models that can process speech with &lt; 300ms latency. Cloud APIs like Assembly AI and Deepgram excel here, while Whisper variants are better suited for async transcription.</p></li><li><p><strong>Deployment constraints</strong> - open source models like Whisper.cpp offer flexibility for edge deployment and privacy-sensitive applications but require more engineering effort to optimize. Cloud APIs provide easier scaling but with higher operational costs.</p></li><li><p><strong>Language and accent support</strong> - while most models handle standard English well, performance varies significantly for other languages and accents. Whisper has broad language support, while specialized APIs like Assembly AI may offer better accuracy for specific use cases.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-hG2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-hG2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 424w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 848w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1272w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png" width="1456" height="797" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:797,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:257289,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-hG2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 424w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 848w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1272w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Speed vs. price tradeoff for leading speech-to-text models. <a href="https://artificialanalysis.ai/speech-to-text#:~:text=Speed%20Factor%20vs.%20Price">Source</a></figcaption></figure></div><p><strong>Open-source (mostly Whisper-based)</strong></p><ol><li><p><a href="https://github.com/ggerganov/whisper.cpp">Whisper.cpp</a> </p></li><li><p><a href="https://github.com/m-bain/whisperX">WhisperX</a></p></li><li><p><a href="https://github.com/KoljaB/RealtimeSTT">RealtimeSTT</a></p></li></ol><p><strong>Commercial</strong></p><ol><li><p><a href="https://deepgram.com/product/speech-to-text">Deepgram</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p></li><li><p><a href="https://www.assemblyai.com/products/speech-to-text?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=Brand&amp;utm_term=assembly%20ai&amp;gad_source=1&amp;gclid=CjwKCAiAgoq7BhBxEiwAVcW0LHxQOUlx5xFMVPkhO5PweZDnEOPzYZ9huYGCwPsc-tC2HwxkruseEBoCwnAQAvD_BwE">Assembly AI</a></p></li><li><p><a href="https://platform.openai.com/docs/guides/speech-to-text">OpenAI&#8217;s hosted Whisper</a></p></li><li><p>Cloud Providers: <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-speech">Azure</a>, <a href="https://cloud.google.com/speech-to-text?hl=en">Google</a>, <a href="https://aws.amazon.com/transcribe/">AWS</a>. There don&#8217;t seem to be significant differences between the cloud providers, so I usually opt for the one I already use for other services like storage and compute.</p></li></ol><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;6b948f35-326a-4ed2-94ec-aab2dcae103b&quot;,&quot;duration&quot;:null}"></div><p>&#128070; RealtimeSTT featured low-latency transcription with wake word activation</p><h3><strong>Text-to-speech (TTS)</strong></h3><p>Text-to-speech, once dominated by ElevenLabs&#8217; costly API as of June 2023, has evolved into a competitive market with numerous low-latency, cost-effective alternatives. Modern TTS systems have evolved far beyond simple voice synthesis, offering capabilities like:</p><ul><li><p><strong>Voice cloning</strong> - creating custom voices from just a few minutes of audio samples, enabling personalized agent voices</p></li><li><p><strong>Emotional synthesis</strong> - adjusting tone, pace, and emphasis to convey different emotions and speaking styles</p></li><li><p><strong>Multi-speaker synthesis</strong> - seamlessly switching between different voices in a single conversation</p></li></ul><p>The most significant recent advancement is the emergence of "speech-native" models that can generate highly natural speech without the traditional text-intermediate step. This enables more fluid conversations with lower latency and better preservation of emotional nuances.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7ApM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7ApM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 424w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 848w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1272w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png" width="670" height="446.3598901098901" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:670,&quot;bytes&quot;:230691,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7ApM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 424w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 848w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1272w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Text-to-speech models leaderboard. <a href="https://artificialanalysis.ai/text-to-speech/arena?tab=Leaderboard">Source</a></figcaption></figure></div><p><strong>Open-source</strong></p><ol><li><p><a href="https://github.com/coqui-ai/TTS">Coqui TTS</a></p></li><li><p><a href="https://github.com/2noise/ChatTTS">ChatTTS</a></p></li><li><p><a href="https://github.com/myshell-ai/OpenVoice">OpenVoice</a></p></li></ol><p><strong>Commercial</strong></p><ol><li><p><a href="https://www.cartesia.ai/sonic">Cartesia</a></p></li><li><p><a href="https://platform.openai.com/docs/guides/text-to-speech">OpenAI TTS</a></p></li><li><p><a href="https://deepgram.com/product/text-to-speech">Deepgram Aura</a></p></li><li><p><a href="https://elevenlabs.io/text-to-speech">ElevenLabs</a> (just released <a href="https://elevenlabs.io/blog/meet-flash">Flash</a>&#8212;a 75ms latency model in 32 languages)</p></li></ol><p>Other popular APIs: <a href="https://play.ht/">Play</a>, <a href="https://aws.amazon.com/polly/">Amazon Poly</a>, <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-speech">Azure AI Speech</a>, <a href="https://cloud.google.com/text-to-speech?hl=en">Google</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;bff556d4-1627-478d-bc9a-e6ba95c81f5b&quot;,&quot;duration&quot;:null}"></div><p>&#128070; Hello Patient&#8217;s voice agent is powered by Cartesia</p><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Voice AI Middleware</h2><h3><strong>Speech-to-speech</strong></h3><p>OpenAI sparked everyone&#8217;s imagination when they demoed GPT-4o last April, showcasing real-time conversational AI that can be interrupted. Until a few months ago, this kind of technology wasn&#8217;t available to developers. Now, we have commercial APIs at a reasonable price (OpenAI dropped the price for its Realtime API by 60% last week) and open-source alternatives.</p><p>Speech-to-speech models are specialized AI systems that process and respond to voice input directly in the audio domain without converting to and from text. They are superior to cascading workflows that involve sequential speech-to-text, text-to-text, and text-to-speech models for several key reasons:</p><ol><li><p><strong>Ultra-low latency</strong> - by processing audio natively, these models achieve response times of ~300 milliseconds, matching natural human conversation speeds. Traditional cascading approaches often exceed 1000ms, creating noticeable delays.</p></li><li><p><strong>Better contextual understanding</strong> - speech-to-speech models preserve important non-textual elements like tone, emotion, and conversation dynamics that are often lost in text conversion.</p></li><li><p><strong>Natural interruptions</strong> - unlike cascading systems that require rigid turn-taking, speech-native models can listen and process input even while speaking, enabling natural interruptions and more fluid conversations.</p></li><li><p><strong>Improved conversation quality</strong> - by maintaining the audio context throughout the interaction, these models better handle overlapping speech, background noise, and the natural rhythm of human dialogue.</p></li></ol><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/fixie-ai/ultravox">Ultravox</a></p></li><li><p><a href="https://github.com/kyutai-labs/moshi">Moshi</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://platform.openai.com/docs/guides/realtime?text-generation-quickstart-example=audio">OpenAI&#8217;s Realtime API</a></p></li><li><p><a href="https://ai.google.dev/api/multimodal-live">Gemini 2.0 Multimodal Live API</a></p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;addd3c0f-15a6-48af-b7bd-8827698acad6&quot;,&quot;duration&quot;:null}"></div><p>&#128070; Conversing with Google&#8217;s new Gemini 2.0 (source: <a href="https://github.com/saharmor/gemini-multimodal-playground">Gemini Playground</a>)</p><h3><strong>Conversational Agents Frameworks</strong></h3><p>Conversational agent frameworks provide the essential scaffolding needed to build production-ready voice AI applications. These frameworks abstract away much of the complexity in managing real-time voice infrastructure, handling edge cases, and orchestrating the various components of a voice application. Pipecat offers an open-source approach for developers who want complete control over their stack and need flexibility to build custom multimodal experiences. Vocode provides a more opinionated toolkit that simplifies the process of creating voice-based LLM agents, handling much of the complexity around conversation management and error handling. Vapi takes this further by offering a full-featured platform that abstracts away infrastructure challenges while providing enterprise-grade reliability and scalability. The choice between these frameworks often comes down to the specific requirements of your project - whether you need the flexibility of open-source, the simplicity of a focused toolkit, or the robustness of a complete platform.</p><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/pipecat-ai/pipecat">Pipecat</a></p></li><li><p><a href="https://github.com/vocodedev/vocode-core">Vocode</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://vapi.ai/">Vapi</a></p></li></ul><h3><strong>Evaluation</strong></h3><p>One of the most significant challenges in voice AI development is accurately measuring agent performance. Unlike text-based interactions, where metrics like response accuracy and latency are relatively straightforward, voice agents require a more nuanced evaluation approach that considers elements like conversation flow, emotional appropriateness, and real-world task completion.</p><p>Technical performance metrics form the foundation of evaluation, starting with response latency - which should target under 300ms for natural conversation and be measured across different percentiles, while speech recognition accuracy is tracked through Word Error Rate (WER) with special attention to domain-specific terminology. The second dimension focuses on conversational quality metrics that measure the agent's ability to handle natural dialogue. This includes turn management metrics like interruption handling and end-of-speech detection, along with task completion metrics such as success rate and the number of turns needed to complete a task. User experience metrics round out the evaluation framework, tracking call abandonment rates, average call duration, user sentiment scores, and the frequency of repeat requests.</p><p>When done right, proper evaluation becomes a game-changing unlock - it enables developers to identify subtle issues that could frustrate users (like awkward pauses or mismatched emotional responses), optimize for natural conversation patterns, and ensure consistent performance across different accents and background noise conditions.</p><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://www.coval.dev/">Coval</a></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r4LJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif" width="693" height="346.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:693,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Evaluating voice agents using Voice Lab</figcaption></figure></div><h2>End-to-end Automation</h2><h3><strong>Full Stack</strong></h3><p>The Stripe for voice agents. From prompt to a call.</p><p>While building with individual components offers maximum flexibility, full-stack solutions make sense when you need to quickly deploy production-ready voice agents without managing infrastructure complexity. These platforms are particularly valuable for teams that need to focus on their core business logic and customer experience rather than wrestling with the intricacies of voice infrastructure, latency optimization, and scaling challenges.</p><p>These solutions typically provide end-to-end capabilities,  including pre-built integrations with popular business systems (CRMs, scheduling tools, payment processors), sophisticated error handling, automated failover, and comprehensive analytics. They handle complex engineering challenges like maintaining low latency during peak loads, graceful degradation during API failures, and automated quality monitoring. While you trade some flexibility compared to building your own stack, these platforms can dramatically accelerate time-to-market and reduce engineering overhead.</p><ul><li><p><a href="https://www.bland.ai/">Bland</a></p></li><li><p><a href="https://deepgram.com/product/voice-agent-api">Deepgram Voice Agents API</a></p></li><li><p><a href="https://elevenlabs.io/conversational-ai">Eleven Labs Conversational AI</a></p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;c5a3c135-558f-4323-ab28-17c2da33b654&quot;,&quot;duration&quot;:null}"></div><p>&#128070; A Deepgram-powered voice agent handles a food order</p><div><hr></div><h2>Choosing the right architecture for your voice agent</h2><p>When selecting a voice AI architecture, developers face a fundamental choice between using full-stack platforms and assembling custom solutions from individual components. This decision ultimately comes down to three key factors: latency requirements, cost constraints, and the need for fine-grained control over the conversation flow.</p>
      <p>
          <a href="https://www.aitidbits.ai/p/voice-agents-toolkit">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Great Web Rebuild: Infrastructure for the AI Agent era]]></title><description><![CDATA[How core internet components will transform for an agent-driven web and the new opportunities for startup founders]]></description><link>https://www.aitidbits.ai/p/ai-agents-web-infrastructure</link><guid isPermaLink="false">https://www.aitidbits.ai/p/ai-agents-web-infrastructure</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 12 Dec 2024 14:30:15 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the <a href="https://www.aitidbits.ai/p/open-source-agents">open-source toolkit for AI Agents builders</a>, the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundational AI models</a>, and <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">the future of autonomous agents</a>.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;0af73734-f156-4b74-9423-6cee42ae4079&quot;,&quot;duration&quot;:1023.3992,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>December 2028. Sarah's AI agent encounters an unusual situation while booking her family's holiday trip to Japan. The multi-leg journey requires coordinating with three different airlines, two hotels, and a local tour operator. As the agent begins negotiations, it presents its "agent passport"&#8212;a cryptographic attestation of its delegation rights and transaction history. The vendors' systems instantly verify the agent's authorization scope, spending limits, and exposable metadata like age and passport number. Within seconds, the agent has established secure payment channels and begun orchestrating the complex booking sequence. When one airline's system flags the rapid sequence of international bookings as suspicious, the agent smoothly provides additional verification, demonstrating its legitimate delegation chain back to Sarah. What would have triggered fraud alerts and CAPTCHA challenges in 2024 now flows seamlessly in an infrastructure built for autonomous AI agents.</p><p><em>&#8212;&gt; The future, four years from now.</em></p><p>In my <a href="https://www.aitidbits.ai/p/agent-responsive-design">previous essay</a>, we explored how websites and applications must evolve to accommodate AI agents. Now we turn to the deeper infrastructural shifts that make such agent interactions possible. The systems we've relied on for decades: CAPTCHAs, credit card verification, review platforms, and authentication protocols, were all built with human actors in mind. As AI agents transition from experimental curiosities to fully operational assistants, the mechanisms underpinning the digital world for decades are beginning to crack under the pressure of automation.</p><p>The transition to an agent-first internet won't just streamline existing processes&#8212;it will unlock entirely new possibilities that were impractical in a human-centric web. Tasks that humans find too tedious or time-consuming become effortless through automation. Instead of clicking 'Accept All' on cookie banners, agents can granularly optimize privacy preferences across thousands of sites. Rather than abandoning a cart due to complex shipping calculations, agents can simultaneously compare multiple courier services and customs implications. Even seemingly simple tasks like comparing prices across multiple vendors, which humans typically limit to 2-3 sites, can be executed across hundreds of retailers in seconds. Perhaps most importantly, agents can maintain persistent relationships with services, continuously monitoring for price drops, policy changes, or relevant updates that humans would miss. This shift from manual, limited interactions to automated, comprehensive engagement represents not just a change in speed, but a fundamental expansion of what's possible online.</p><p>Amid these sweeping changes, a new gold rush is emerging. Just as the shift to mobile created opportunities for companies like Uber and Instagram to reinvent existing services, the transition to agent-first infrastructure opens unprecedented possibilities for founders. From building next-generation authentication systems and trust protocols to creating agent-mediated data marketplaces, entrepreneurs have a chance to establish the foundational layers of this new paradigm. In many ways, we're returning to the internet's early days, where core infrastructure is being reimagined from the ground up&#8212;this time for an autonomous, agent-driven future.</p><p>In this second post of the <a href="https://www.aitidbits.ai/s/ai-agents">AI Agents series</a>, we&#8217;ll focus on the foundational infrastructure changes that underlie the agent-first internet: new authentication mechanisms, trust systems, novel security challenges, and agent-to-agent protocols, setting the stage for the more commerce-oriented transformations we&#8217;ll explore in the following post.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bMLb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bMLb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 424w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 848w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1068161,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bMLb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 424w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 848w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Proving you're <s>a human</s> an agent</strong></h2><p>Remember when "proving you're not a robot" meant deciphering distorted text or selecting crosswalk images? Those mechanisms become obsolete in a world where legitimate automated actors are the norm rather than the exception. Today&#8217;s CAPTCHAs, designed to block bots, have become increasingly complex due to advances in multimodal AI. Paradoxically, these mechanisms now hinder real humans while sophisticated bots often bypass them. As AI outpaces human problem-solving in these domains, CAPTCHAs risk becoming obsolete, reducing website conversions and frustrating legitimate users.</p><p>The challenge shifts from proving humanity to verifying the agent has been legitimately delegated and authorized by a human user.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iuJ0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 424w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 848w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1272w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif" width="659" height="570.035" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:692,&quot;width&quot;:800,&quot;resizeWidth&quot;:659,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;challenging captcha.mp4 [video-to-gif output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="challenging captcha.mp4 [video-to-gif output image]" title="challenging captcha.mp4 [video-to-gif output image]" srcset="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 424w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 848w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1272w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I recently failed a CAPTCHA three times before finally passing on the fourth attempt. Now picture an 80-year-old attempting to decipher increasingly convoluted challenges</figcaption></figure></div><p>Today&#8217;s rate-limiting mechanisms assume human-paced interactions, relying heavily on IP-based throttling to manage access. But in a world of AI agents, what constitutes "fair use" of digital services? In an agent-driven internet, automated browsing will become not just accepted but essential. Cloudflare, Akamai, and similar services will need to pivot from simplistic IP-based throttling to sophisticated agent-aware frameworks.</p><p>As businesses grapple with these challenges, a new solution is emerging&#8212;one that shifts the paradigm from blocking automated traffic to authenticating and managing it intelligently. Enter the <strong>Agent Passport</strong>.</p><p>Imagine a digital credential that encapsulates an agent's identity and permissions&#8212;cryptographically secured and universally recognized. Unlike simple API keys or OAuth tokens, these passports maintain a verifiable chain of trust from the agent back to its human principal. They carry rich metadata about permissions scope, spending limits, and authorized behaviors, allowing services to make nuanced decisions about agent access and capabilities.</p><p>By integrating Agent Passports, business websites like airlines can distinguish between legitimate, authorized agents and malicious actors. New metrics, such as agent reliability scores and behavioral analysis, could ensure fair access while mitigating abuse, balancing security with the need for allowing agent-driven traffic.</p><p>Authentication mechanisms, such as signing up and signing in, must also evolve for an agent-first internet. Websites will need to determine not just an agent's identity but also its authorized scope&#8212;what data the agent is authorized to access (&#8216;read&#8217;) and what actions it is permitted to execute (&#8216;write&#8217;).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f50V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f50V!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 424w, https://substackcdn.com/image/fetch/$s_!f50V!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 848w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1272w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png" width="604" height="712.72" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1416,&quot;width&quot;:1200,&quot;resizeWidth&quot;:604,&quot;bytes&quot;:357639,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f50V!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 424w, https://substackcdn.com/image/fetch/$s_!f50V!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 848w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1272w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Google Login revolutionized online authentication by centralizing access with a single credential, reducing friction and enhancing security. Similarly, agent passports could create a universal standard for agent authentication, simplifying multi-platform access while maintaining robust authorization controls.</p><p>Companies like Auth0 and Okta could adapt by offering agent-specific identity frameworks, enabling seamless integration of these passports into their authentication platforms. Meanwhile, consumer companies like Google and Apple could extend their authentication and wallet services to seamlessly support agent-mediated interactions, bridging the gap between human and agent use cases.</p><h2>A new protocol for Agent-to-Agent communication</h2><p>In the early days of the web, protocols like HTTP emerged to standardize how browsers and servers communicated. In much the same way, the rise of agent-mediated interactions demands a new foundational layer: an <strong>Agent-to-Agent Communication Protocol (AACP)</strong>. This protocol would formalize how consumer agents and business agents discover each other&#8217;s capabilities, authenticate identities, negotiate trust parameters, and exchange actionable data&#8212;all while ensuring both parties operate within well-defined boundaries.</p><p>Just as Sarah's travel agent from the intro paragraph seamlessly coordinated with multiple airlines and hotels, AACP enables complex multi-party interactions that would be tedious or impossible for humans to manage manually.</p><p>Much like HTTPS introduced encryption and certificates to authenticate servers and protect user data, AACP would implement cryptographic attestation for agents. Trusted third-party authorities, similar to today&#8217;s certificate authorities, would issue digital &#8220;agent certificates&#8221; confirming an agent&#8217;s legitimacy, delegation chain, and operational scope. This ensures that when a consumer&#8217;s travel-planning agent communicates with an airline&#8217;s booking agent, both sides can instantly verify authenticity and adherence to agreed-upon standards.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zXZv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zXZv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 424w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 848w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1272w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png" width="1204" height="1264" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1264,&quot;width&quot;:1204,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:204543,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zXZv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 424w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 848w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1272w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A potential implementation of the AACP protocol. A full example of booking an airline ticket can be found <a href="https://claude.site/artifacts/4087f5cf-c4dd-41a2-87df-5faae7fca960">here</a>.</figcaption></figure></div><p>Without such a protocol, a rogue agent might impersonate a trusted retailer to trick consumer agents into unauthorized transactions, or a malicious consumer agent could spoof credentials to overwhelm a merchant&#8217;s infrastructure. By mandating cryptographic proof, robust authentication handshakes, and behavior logs, AACP mitigates these threats before meaningful data or funds change hands.</p><p>The handshake phase in AACP would include mutual disclosure of the agents&#8217; technical stacks&#8212;such as which LLM or language configuration they use&#8212;and their supported capabilities. Once established, the protocol would also govern &#8220;write-like operations&#8221; (e.g., initiating a payment or updating account details) by enforcing strict sign-offs with auditable cryptographic signatures. Every action would leave a verifiable trail of authorization that can be reviewed and validated after the fact.</p><p>Finally, AACP would incorporate locale and language negotiation at the protocol level. Although agents can translate and interpret content dynamically, specifying a preferred language or locale upfront helps streamline interactions. This new protocol weaves together trust, authentication, and contextual awareness, forging a resilient substrate on which the agent-first internet can reliably function.</p><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2>Trust and reputation reimagined</h2><p>When we navigate the internet, our judgment of a website's credibility hinges on a blend of visual and social cues. We look for secure HTTPS connections, professional design, and familiar branding to assure us that a site is trustworthy. No one wants to input their credit card information on a site that looks like it was built in the early 2000s. User reviews and star ratings on platforms like Trustpilot and G2 further influence our decisions, offering insights drawn from shared human experiences.</p><p>Perhaps no aspect of online commerce requires more fundamental reimagining than trust and reputation systems. In an agent-mediated economy, traditional cues for reliability fall short. AI agents can't interpret visual aesthetics or branding elements&#8211;they operate on data, protocols, and cryptographic proofs. Trust mechanisms must pivot from human perception to machine-readable verifications. For instance, an agent might verify a seller's identity through cryptographic attestations and assess service quality via automated compliance records, ensuring decisions are based on objective, tamper-proof data. Traditional review platforms like Trustpilot and G2, built around subjective human experiences and star ratings, will also become increasingly obsolete.</p><p>The emerging alternative is a new trust infrastructure built on quantifiable, machine-readable metrics. Instead of relying on potentially AI-generated reviews, a problem that has already undermined traditional review systems, agents could assess services using benchmarks like delivery time reliability, system uptime, or refund processing speed&#8212;measurable metrics that ensure objective evaluations rather than subjective human reviews. This could involve decentralized reputation networks where trust is established through cryptographically verified interaction histories and smart contract execution records. Such systems would offer objective assessments of service quality, enabling agents to make informed decisions without relying on potentially biased or manipulated human reviews.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3gJJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 424w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 848w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1272w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png" width="1456" height="537" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:537,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:493147,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 424w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 848w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1272w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Moreover, the feedback loop between consumers and businesses will evolve dramatically. Instead of sending generic emails requesting reviews&#8212;a method often resulting in low response rates&#8212;commerce websites can engage directly with your AI agent to collect timely feedback about specific topics like shipping or product quality. They might offer incentives like future store credit to encourage participation. The human user could provide a brief impression, such as "The cordless vacuum cleaner works well, but the battery life is short." The agent then takes this input, contextualizes it with additional product data, and generates a comprehensive review that highlights key features and areas for improvement. This process not only saves time for the user but also provides businesses with richer, more actionable insights.</p><p>Trustpilot and G2 could pivot by introducing agent-oriented verification systems, such as machine-readable trust scores derived from operational metrics like service accuracy, delivery consistency, and customer support responsiveness, enabling agents to evaluate businesses programmatically.</p><h2>The new data-sharing economy</h2><p>Information sharing in the age of AI agents demands a fundamental reinvention of the current consent and data access model. Rather than blunt instruments like cookie banners and privacy policies, websites will implement structured data requirement protocols&#8212;machine-readable manifests that explicitly declare what information is needed and why.</p><p>This granular control would operate at multiple levels of specificity. For example, an agent could share your shirt size (L) with a retailer while withholding your exact measurements. It might grant 24-hour access to your travel dates but permanent access to your seating preferences. When a service requests location data, your agent could share your city for shipping purposes but withhold your exact address until purchase confirmation. These permissions wouldn't be just binary yes/no choices&#8212;they could include sophisticated rules like "share my phone number only during business hours" or "allow access to purchase history solely for personalization, not marketing." Such granular controls, impossible to manage manually at scale, become feasible when delegated to AI agents operating under precise constraints.</p><p>AI agents would also act as sophisticated information gatekeepers, maintaining encrypted personal data vaults and negotiating data access in real time.</p><p>These mechanisms will fundamentally shift the balance of power in data-sharing dynamics. GDPR-like frameworks may evolve to include provisions for dynamic, agent-mediated consent, allowing for more granular data-sharing agreements tailored to specific tasks. Websites might implement real-time negotiation protocols, where agents can evaluate and respond to data requests based on their principal's preferences, preserving privacy while optimizing functionality.</p><h2>New attack vectors</h2><p>The shift to agent-mediated interaction introduces novel security challenges. Agent impersonation and jailbreaking agents are two examples.</p><p>Jailbreaking AI agents pose significant risks, as <a href="https://www.aitidbits.ai/i/135923979/manipulating-ai-models-for-profit">manipulated agents</a> could act outside their intended scope, leading to unintended purchases or other errors. Techniques like instruction-tuning poisoning or adversarial suffix manipulation could alter an agent&#8217;s behavior during critical tasks. For example, adversarial instructions embedded in websites&#8217; HTML might influence an agent&#8217;s purchasing logic, bypassing its human-defined constraints. Robust safeguards and continuous monitoring will be essential to prevent these vulnerabilities.</p><p>Agent impersonation adds a complex layer to cybersecurity challenges. Malicious actors could spoof an agent's credentials to access sensitive data or execute fraudulent transactions. Addressing this threat demands robust multi-layered verification protocols, such as cryptographic identity verification paired with continuous behavioral monitoring, to ensure authenticity and safeguard sensitive interactions.</p><h2>Building the new web - opportunities for founders</h2><p>The web&#8217;s agent-first future has no established playbook&#8212;and that&#8217;s exactly where founders thrive. Entirely new product categories are waiting to be defined: agent-to-agent compliance dashboards, cryptographic attestation services that replace outdated CAPTCHAs, and dynamic data-sharing frameworks that make &#8220;privacy by design&#8221; a reality. Platforms that offer standardized &#8220;agent passports,&#8221; identity brokerages that verify delegation rights, agent-native payment gateways, and trust ecosystems driven by machine-readable performance metrics&#8212;each of these represents a greenfield opportunity to set the standards of tomorrow&#8217;s internet. Startups anticipating these shifts can position themselves as foundational players in an agent-driven economy, opening new channels of value creation and establishing a competitive edge before the rest of the market catches up.</p><p>Some concrete areas include:</p><ul><li><p>Trustpilot for agents - creating machine-readable trust metrics and reputation systems that help agents evaluate services and vendors</p></li><li><p>Okta for AI agents - building the identity and authentication layer that manages agent credentials, permissions, and delegation chains</p></li><li><p>OneTrust for agents - creating the new standard for privacy preference management, turning today's basic cookie banners into sophisticated data-sharing frameworks where agents can negotiate and manage granular permissions across thousands of services</p></li><li><p>Cloudflare for agent traffic - developing intelligent rate-limiting and traffic management systems designed for agent-scale operations</p></li><li><p>LastPass for agent permissions - building secure vaults that manage agent credentials and access rights across services</p></li><li><p>AWS CloudFront for agent data - creating CDN-like infrastructure optimized for agent-readable formats and rapid agent-to-agent communication</p></li><li><p>McAfee security for agents - developing security platforms that protect against agent impersonation and novel attack vectors</p></li></ul><p><em><strong><br>Go build.</strong></em></p><div><hr></div><p>This is the second essay in a <a href="https://www.aitidbits.ai/s/ai-agents">five-part series</a> exploring the future of AI agents and their impact on the internet. In Part 3, we'll move from infrastructure to commerce, exploring critical questions that emerge in an agent-first economy: How will payment systems evolve to support autonomous purchasing decisions? What happens to marketing and customer engagement when AI agents become the primary decision-makers? And perhaps most intriguingly, how will businesses adapt their revenue models when agents can negotiate prices and terms at superhuman speed and scale? Subscribe and follow as we continue to examine the technical, social, and economic implications of this transformation.</p>]]></content:encoded></item><item><title><![CDATA[The Open-Source Toolkit for Building AI Agents]]></title><description><![CDATA[Curated frameworks, tools, and libraries every developer needs to build functional and efficient AI agents]]></description><link>https://www.aitidbits.ai/p/open-source-agents</link><guid isPermaLink="false">https://www.aitidbits.ai/p/open-source-agents</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 28 Nov 2024 12:31:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!p8_R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>June &#8216;25 update: Released an updated map with new frameworks and repositories since this post was published in Nov &#8216;24.</em></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;78e32312-2147-4e0e-ab16-1aa629bb1311&quot;,&quot;caption&quot;:&quot;An opinionated, developer-first guide to building AI agents with real-world impact&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:28,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</em></p><p>My last post explored how the internet will trans</p><p>form for an agent-first future - from websites optimizing for AI interaction through "agent-responsive design" to the emergence of Agent Engine Optimization (AEO) as the next SEO. We saw how tech giants like Google, Apple, OpenAI, and Anthropic are racing to define this next evolution of digital interaction, with Gartner projecting that by 2028, 33% of enterprise software applications will include agentic AI.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;025ad65a-0fcf-4b32-bcaf-0497505cdce2&quot;,&quot;caption&quot;:&quot;Welcome to AI Tidbits Deep Dives: short posts offering a perspective on AI-related topics. Some of my previous ones covered the economies of scale for foundational AI models, the recent consolidation in the AI space, and autonomous agents.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Agent-Responsive Design: Rethinking the web for an agentic future&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-17T16:01:09.913Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d892766-b0f9-45ea-b64e-1955fc3c4dc6_2458x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/agent-responsive-design&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:151674570,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:76,&quot;comment_count&quot;:13,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>In this post, I'll outline a curated, though non-exhaustive, overview of the open-source ecosystem for developers creating these AI agents. While numerous market maps exist for AI agents, they often cater more to venture capitalists than builders. Developers need actionable tools and frameworks to launch functional AI agents today.</p><p>Which tools do other builders rely on to develop voice agents? What&#8217;s the leading open model for document understanding? With new packages emerging almost daily, I&#8217;ll focus solely on the libraries I&#8217;ve personally found most effective. This list is, therefore, intentionally selective rather than exhaustive.</p><p>Every package included here supports commercial use and has a permissive open-source license.</p><p>With the holiday season coming, there's no better time to dive into these tools and start building.</p><p>Categories covered in this piece:<br>&#8594; Frameworks for Building and Orchestrating Agents<br>&#8594; Computer and Browser Use<br>&#8594; Voice<br>&#8594; Document Understanding<br>&#8594; Memory<br>&#8594; Testing and Evaluation<br>&#8594; Monitoring and Observability<br>&#8594; Simulation<br>&#8594; Vertical Agents</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p8_R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p8_R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 424w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 848w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1272w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" width="1456" height="810" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:810,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1849994,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!p8_R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 424w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 848w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1272w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Frameworks for Building and Orchestrating Agents</h2><p>Building AI agents requires robust frameworks that can handle complex workflows, memory management, and tool integration. These foundational frameworks serve as the backbone for creating agents that can understand, plan, and execute tasks autonomously.</p><ul><li><p><a href="https://github.com/crewAIInc/crewAI">CrewAI</a> - a framework for orchestrating role-playing, autonomous AI agents</p></li><li><p><a href="https://github.com/phidatahq/phidata">Phidata</a> - build AI assistants with memory, knowledge, and tools</p></li><li><p><a href="https://github.com/camel-ai/camel">Camel</a> - build customized multi-agent systems to generate data, complete tasks, or simulate real-world interactions</p></li><li><p><a href="https://github.com/Significant-Gravitas/AutoGPT">AutoGPT</a> - create, deploy, and manage continuous AI agents that automate complex workflows</p></li><li><p><a href="https://github.com/microsoft/autogen">AutoGen</a> - develop LLM applications using multiple agents that can converse with each other</p></li><li><p><a href="https://github.com/TransformerOptimus/SuperAGI">SuperAGI</a> - build, manage, and run autonomous AI agents quickly and reliably</p></li><li><p><a href="https://github.com/superagent-ai/superagent">Superagent</a> - an open framework for building AI assistants</p></li><li><p><a href="https://github.com/langchain-ai/langchain">LangChain</a> &amp; <a href="https://github.com/run-llama/llama_index">LlamaIndex</a> - the usual suspects, facilitating AI Agents through composability</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TASi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TASi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 424w, https://substackcdn.com/image/fetch/$s_!TASi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 848w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png" width="1456" height="999" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:999,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;CrewAI Mind Map&quot;,&quot;title&quot;:&quot;CrewAI Mind Map&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="CrewAI Mind Map" title="CrewAI Mind Map" srcset="https://substackcdn.com/image/fetch/$s_!TASi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 424w, https://substackcdn.com/image/fetch/$s_!TASi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 848w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">CrewAI supports running customized agents with specific roles, goals, and tools</figcaption></figure></div><h2>Computer and Browser Use</h2><p>For AI agents to be truly useful, they need to interact with computers and browsers just like humans do. These tools enable agents to navigate websites, control applications, and execute commands programmatically, bridging the gap between AI reasoning and real-world actions.</p><ul><li><p><a href="https://github.com/OpenInterpreter/open-interpreter">Open Interpreter</a> - turn natural language commands into code that runs on your local machine</p></li><li><p><a href="https://github.com/OthersideAI/self-operating-computer">Self-Operating Computer</a> - enables multimodal models to operate a computer</p></li><li><p><a href="https://github.com/simular-ai/Agent-S">Agent-S</a> - an open agentic framework that uses computers like a human</p></li><li><p><a href="https://github.com/lavague-ai/LaVague">LaVague</a> - create web agents that take actions on websites using LLMs as their reasoning engines</p></li><li><p><a href="https://github.com/microsoft/playwright">Playwright</a> - a framework for web testing and automation</p></li><li><p><a href="https://pptr.dev/">Puppeteer</a> - a JavaScript library that provides a high-level API to control Chrome or Firefox</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rMyZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [video-to-gif output image]&quot;,&quot;title&quot;:&quot;temp.mov [video-to-gif output image]&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [video-to-gif output image]" title="temp.mov [video-to-gif output image]" srcset="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Self-Operating Computer generates a poem and saves it in a Google Doc</figcaption></figure></div><div><hr></div><h2>Voice</h2><p>Voice interfaces represent the most natural way for humans to interact with AI agents. These tools enable the creation of agents that can understand spoken language, maintain context in conversations, and respond with natural-sounding speech, making AI interaction more accessible and intuitive.</p><p><strong>Speech2speech</strong></p><ul><li><p><a href="https://github.com/fixie-ai/ultravox">Ultravox</a> - a speech2speech model for real-time voice interaction, superior to Moshi for now</p></li><li><p><a href="https://github.com/kyutai-labs/moshi">Moshi</a> - a speech2speech model for real-time voice interaction</p></li><li><p><a href="https://github.com/pipecat-ai/pipecat">Pipecat</a> - a framework for voice and multimodal conversational AI, supporting speech2text, text2speech, video, etc.</p></li></ul><p><strong>Speech2text</strong></p><ul><li><p><a href="https://github.com/openai/whisper">Whisper</a> - OpenAI's speech2text model</p></li><li><p><a href="https://github.com/jianfch/stable-ts">Stable-ts</a> - a lightweight Whisper wrapper with timestamps and more</p></li></ul><ul><li><p><a href="https://huggingface.co/pyannote/speaker-diarization-3.1">Speaker diarization 3.1</a> - pyannote&#8217;s flagship model for speaker detection</p></li></ul><p><strong>Text2speech</strong></p><p>The only decent open model I came across was ChatTTS, which is satisfactory for production. I, therefore, default to ElevenLabs or Cartesia.</p><p><strong>Misc</strong></p><ul><li><p><a href="https://github.com/vocodedev/vocode-core">Vocode</a> - a toolkit for building voice-based LLM agents</p></li><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a> - a comprehensive testing and evaluation framework for voice agents across language models, prompts, and agent personas</p></li></ul><h2>Document Understanding</h2><p>Modern AI agents need to process and understand documents in various formats, from PDFs to images with text. These tools provide the crucial ability to extract, comprehend, and act on information from unstructured documents, enabling agents to handle real-world business processes.</p><ul><li><p><a href="https://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d">Qwen2-VL</a> - vision language model from Alibaba outperforming GPT-4o and Claude 3.5 Sonnet</p></li><li><p><a href="https://huggingface.co/mPLUG/DocOwl2">DocOwl2</a> - an efficient multimodal LLM for OCR-free document understanding</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vTOR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vTOR!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 424w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 848w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1272w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif" width="800" height="463" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image temp.mov&quot;,&quot;title&quot;:&quot;Image temp.mov&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image temp.mov" title="Image temp.mov" srcset="https://substackcdn.com/image/fetch/$s_!vTOR!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 424w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 848w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1272w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Qwen2 excels in document and chart understanding with a commercially permissive license</figcaption></figure></div><h2>Memory</h2><p>Without memory, AI agents are limited to single-turn interactions. These memory tools enable agents to maintain context over long conversations, remember user preferences, and learn from past interactions, making them truly personal assistants rather than just query responders.</p><ul><li><p><a href="https://github.com/mem0ai/mem0">Mem0</a> - provides an efficient, self-improving memory layer for LLMs, enabling personalized AI experiences</p></li><li><p><a href="https://github.com/letta-ai/lettahttps://github.com/letta-ai/letta">Letta (fka MemGPT)</a> - create LLM agents with long-term memory and custom tools</p></li><li><p><a href="https://github.com/langchain-ai/langchain">LangChain</a> - offers memory components to manage conversation history and context</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BqBj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BqBj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 424w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 848w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1272w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png" width="1456" height="996" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:996,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Screenshot of the Letta ADE (Agent Development Environment)&quot;,&quot;title&quot;:&quot;Screenshot of the Letta ADE (Agent Development Environment)&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Screenshot of the Letta ADE (Agent Development Environment)" title="Screenshot of the Letta ADE (Agent Development Environment)" srcset="https://substackcdn.com/image/fetch/$s_!BqBj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 424w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 848w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1272w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Stateful agents with Letta</figcaption></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Testing and Evaluation</h2><p>As AI agents become more complex, robust testing becomes critical. These tools help developers evaluate agent performance, identify failure modes, and ensure reliability across different scenarios and environments.</p><ul><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a> - a comprehensive testing and evaluation framework for voice agents</p></li><li><p><a href="https://github.com/AgentOps-AI/agentops">AgentOps</a> - tools for monitoring and benchmarking agent performance</p></li><li><p><a href="https://github.com/THUDM/AgentBench">AgentBench</a> - a benchmark to evaluate LLMs as agents across various environments (Web, Minecraft, Visual Design, etc.)</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VEsF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" width="800" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:&quot;Demo usage&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Test and refine your voice agents with Voice Lab</figcaption></figure></div><h2>Monitoring and Observability</h2><p>Understanding how AI agents perform in production is crucial for maintaining reliability and optimizing costs. These tools provide insights into agent behavior, resource usage, and performance metrics essential for running agents at scale.</p><ul><li><p><a href="https://github.com/traceloop/openllmetry">openllmetry</a> - an open-source, OpenTelemetry-based end-to-end observability tool for LLM applications</p></li><li><p><a href="https://github.com/AgentOps-AI/agentops">AgentOps</a> - agent monitoring, LLM cost tracking, benchmarking, and more</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hMwg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hMwg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 424w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 848w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1272w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png" width="1456" height="840" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:840,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Session Replays&quot;,&quot;title&quot;:&quot;Session Replays&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Session Replays" title="Session Replays" srcset="https://substackcdn.com/image/fetch/$s_!hMwg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 424w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 848w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1272w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Debug agents with AgentOps</figcaption></figure></div><h2>Simulation</h2><p>Before deploying agents to real-world scenarios, testing them in controlled environments is crucial. These simulation tools allow developers to validate agent behavior, test edge cases, and refine decision-making capabilities in safe, reproducible environments.</p><ul><li><p><a href="https://github.com/OpenBMB/AgentVerse">AgentVerse</a> - facilitates the deployment of multiple LLM-based agents in various applications, including simulations</p></li><li><p><a href="https://github.com/sierra-research/tau-bench">Tau-Bench</a> - a benchmark and testing code for agent-user interactions in real-world domains like retail and airline</p></li><li><p><a href="https://github.com/Farama-Foundation/chatarena">ChatArena</a> - multi-agent language game environments for research on autonomous LLM agents</p></li><li><p><a href="https://github.com/a16z-infra/ai-town">AI Town</a> - A virtual town where AI characters live, chat, and socialize</p></li><li><p><a href="https://github.com/joonspk-research/generative_agents">Generative Agents</a> - Stanford&#8217;s Interactive simulacra of human behavior</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!THzk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!THzk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 424w, https://substackcdn.com/image/fetch/$s_!THzk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 848w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Screen Shot 2023-10-16 at 10 53 49 PM&quot;,&quot;title&quot;:&quot;Screen Shot 2023-10-16 at 10 53 49 PM&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Screen Shot 2023-10-16 at 10 53 49 PM" title="Screen Shot 2023-10-16 at 10 53 49 PM" srcset="https://substackcdn.com/image/fetch/$s_!THzk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 424w, https://substackcdn.com/image/fetch/$s_!THzk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 848w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Simulate agentic environments with AgentVerse</figcaption></figure></div><h2>Vertical Agents</h2><p>There are dozens of open vertical agents out there, so here are just a few select ones I&#8217;ve tinkered with and found the most useful:</p><ul><li><p><a href="https://github.com/All-Hands-AI/OpenHands">OpenHands</a> (Coding) - a platform for software development agents powered by AI</p></li><li><p><a href="https://github.com/Aider-AI/aider">aider</a> (Coding) - pair programming in your terminal</p></li><li><p><a href="https://github.com/gpt-engineer-org/gpt-engineer">GPT Engineer</a> (Low code) - build applications using natural language. Specify what you want to build, and the AI will ask for clarification before building it.</p></li><li><p><a href="https://github.com/abi/screenshot-to-code">screenshot-to-code</a> - convert screenshots into a functioning website using HTML/Tailwind/React/Vue</p></li><li><p><a href="https://github.com/assafelovic/gpt-researcher">GPT Researcher</a> (Research) - an autonomous agent that performs comprehensive research on any given topic</p></li><li><p><a href="https://github.com/vanna-ai/vanna">Vanna</a> (SQL) - chat with your SQL database</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RzPu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RzPu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 424w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 848w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1272w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg" width="687" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:687,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;aider screencast&quot;,&quot;title&quot;:&quot;aider screencast&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="aider screencast" title="aider screencast" srcset="https://substackcdn.com/image/fetch/$s_!RzPu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 424w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 848w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1272w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Aider is a pair programming in a terminal</figcaption></figure></div><h1><strong>Looking Ahead</strong></h1><p>While this post focused on open-source packages with permissive licenses, I plan to publish another comprehensive list specifically for engineers building voice agents. This upcoming guide will include both open-source and commercial tools, covering solutions like OpenAI's Realtime API (speech2speech) and ElevenLabs (text2speech), along with detailed comparisons of their capabilities, pricing models, and ideal use cases.</p><p>Stay tuned for more deep dives in the AI Agents Series.</p><div><hr></div><p>Comprehensive list of open-source packages for AI engineers (last update: Aug &#8216;23)</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;976ec2c7-f9b6-4434-879a-bc69ba00e2ef&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a new section of AI Tidbits providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go!&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Open-source Generative AI&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-08-06T16:30:15.749Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/885bba4a-9f47-4763-82f1-b7b9196ed69d_1664x958.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-llms&quot;,&quot;section_name&quot;:&quot;Sahar's 2&#162;&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:135729768,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:19,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div>]]></content:encoded></item></channel></rss>