<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[AI Tidbits: AI Agents]]></title><description><![CDATA[Editorial takes and hands-on guides to master the new phase of AI.]]></description><link>https://www.aitidbits.ai/s/ai-agents</link><image><url>https://substackcdn.com/image/fetch/$s_!-amS!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png</url><title>AI Tidbits: AI Agents</title><link>https://www.aitidbits.ai/s/ai-agents</link></image><generator>Substack</generator><lastBuildDate>Sun, 03 May 2026 15:10:43 GMT</lastBuildDate><atom:link href="https://www.aitidbits.ai/feed" rel="self" type="application/rss+xml"/><language><![CDATA[en]]></language><webMaster><![CDATA[aitidbits@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[aitidbits@substack.com]]></itunes:email><itunes:name><![CDATA[Sahar Mor]]></itunes:name></itunes:owner><itunes:author><![CDATA[Sahar Mor]]></itunes:author><googleplay:owner><![CDATA[aitidbits@substack.com]]></googleplay:owner><googleplay:email><![CDATA[aitidbits@substack.com]]></googleplay:email><googleplay:author><![CDATA[Sahar Mor]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[LinkedIn Highlights, June 2025 - AI Agents Edition]]></title><description><![CDATA[Build smarter AI agents with six open-source tools and a bonus toolkit that optimizes Llama prompts by 45%]]></description><link>https://www.aitidbits.ai/p/linkedin-highlights-june-2025</link><guid isPermaLink="false">https://www.aitidbits.ai/p/linkedin-highlights-june-2025</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 13 Jul 2025 15:02:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!FtrP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Welcome to <em>LinkedIn Highlights</em>!</p><p>Each month, I'll share my <strong><s>five</s> seven top-performing LinkedIn posts</strong>, bringing you the best of AI straight from the frontlines of academia and industry. This edition includes seven posts instead of five&#8212;there were just too many good ones to leave out!</p><p>As a frequent <a href="https://www.linkedin.com/in/sahar-mor/">LinkedIn contributor</a>, I regularly share insights on groundbreaking papers, promising open-source packages, and significant AI product launches. These posts offer more depth and detail than our weekly snippets, providing a comprehensive look at the latest AI developments.</p><p>Over the past few months, I&#8217;ve been developing and experimenting with several open-source packages while creating a few AI agents. This post covers six of my most popular LinkedIn posts on the topic, each showcasing the package&#8217;s core features and my key takeaways. Plus, a bonus tip at the end: a toolkit that automatically converts prompts from GPT and Claude into Llama-optimized versions, boosting performance by up to 45%.</p><p>Whether you're not on LinkedIn or simply missed a post, this monthly roundup ensures you stay informed about the most impactful AI news and innovations.</p><div><hr></div><p>Recent posts:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;388f6d93-8119-4907-af3c-6bee43f24e9d&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:54,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b8b6cc74-05b8-4c76-bd40-4e7968a82230&quot;,&quot;caption&quot;:&quot;This post is part of my 2&#162; series - my raw thoughts about recent topics in AI. Not always practical thoughts, but always thought-provoking. Some of my previous ones covered the new wave of conversational AI, economies of scale for foundation AI models&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Google I/O '25 - Research to reality&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-05-23T14:31:13.949Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F956c0295-b4fc-40d0-b6fb-26b04a4ec154_1718x962.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/google-io-25&quot;,&quot;section_name&quot;:&quot;Sahar's 2&#162;&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:164053881,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:60,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(1) LangMem</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FtrP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FtrP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg" width="1258" height="681" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:681,&quot;width&quot;:1258,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text" title="text" srcset="https://substackcdn.com/image/fetch/$s_!FtrP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FtrP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71752e62-ce1c-46da-96ec-26ff0227f404_1258x681.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LangMem is a new open-source library that gives LLM agents long-term memory and it&#8217;s refreshingly easy to use.</p><p>It&#8217;s built for developers working with LangGraph or custom agents, and it solves a persistent problem: how to make agents remember and adapt across sessions without bloated prompts or manual hacks.</p><p>LangMem introduces a clean memory API that works with any storage backend and includes tools for:</p><ol><li><p><strong>Storing important information during conversations</strong> - agents decide what matters and when to save it</p></li><li><p><strong>Searching memory when relevant</strong> - retrieving facts, preferences, or prior context</p></li><li><p><strong>Running background memory consolidation</strong> - automatically refining and updating knowledge over time</p></li></ol><p>It integrates natively with LangGraph&#8217;s memory store, but you can also plug it into your own stack using Postgres, Redis, or in-memory stores.</p><p>This design is especially useful for building agents that need to:</p><p>-&gt; Personalize interactions across sessions</p><p>-&gt; Maintain consistency in long-running workflows</p><p>-&gt; Adapt behavior based on evolving user input</p><p>Unlike Mem0, which requires explicit memory updates, LangMem handles memory automatically in the background, storing and retrieving key details as needed, and integrates with LangGraph out of the box.</p><p>GitHub repo <a href="https://github.com/langchain-ai/langmem">https://github.com/langchain-ai/langmem</a></p><div><hr></div><h2>(2) Browser Use</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!annP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!annP!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 424w, https://substackcdn.com/image/fetch/$s_!annP!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 848w, https://substackcdn.com/image/fetch/$s_!annP!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1272w, https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif" width="1456" height="1012" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1012,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface, text, application&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface, text, application" title="graphical user interface, text, application" srcset="https://substackcdn.com/image/fetch/$s_!annP!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 424w, https://substackcdn.com/image/fetch/$s_!annP!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 848w, https://substackcdn.com/image/fetch/$s_!annP!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1272w, https://substackcdn.com/image/fetch/$s_!annP!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c19b678-4baf-4871-ac24-5b7428458e5b_2048x1424.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>LLM agents can read the web, but few can truly use it. This open-source package changes that.</p><p>Browser-use is an open-source library that turns any LLM into a browser-native agent, with first-class support for real UI actions and multi-step tasks.</p><p>Out of the box, it supports:</p><ol><li><p><strong>Direct interaction with Chromium via Playwright</strong> - no extra scripting layers or wrappers</p></li><li><p><strong>Seamless LLM integration</strong> - use GPT-4o, DeepSeek-V3, Claude, Gemini, or even Grok</p></li><li><p><strong>Ready-to-run UX</strong> - spin up agents with a single function, or test flows in a ready-to-run Web UI or CLI</p></li></ol><p><br>Browser use can handle real-world tasks like:</p><p>-&gt; Checking your latest Stripe payouts and updating a financial tracking sheet</p><p>-&gt; Logging into your CMS, creating a draft blog post, and uploading media</p><p>-&gt; Scraping product reviews across sites and summarizing them in a shared doc</p><p>-&gt; Tracking changes to your competitors&#8217; pricing pages and alerting your team</p><p>For those building AI agents that go beyond chat, this gives you a direct bridge to the real web, not a sandbox.</p><p>GitHub repo <a href="https://github.com/browser-use/browser-use">https://github.com/browser-use/browser-use</a></p><div><hr></div><p>My recent post on coding with AI:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;cc758575-6171-4642-9be2-4af965bf34a0&quot;,&quot;caption&quot;:&quot;Welcome to the first post in the AI Coding Series, where I'll share the strategies and insights I've developed for effective AI-assisted coding. In upcoming posts, I'll delve deeper into leveraging tools like Cursor and Windsurf, share best practices for developing secure AI applications, and more.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Sahar&#8217;s Coding with AI guide&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-04-27T15:02:21.055Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d8476df-11fd-4f93-be3b-8ba7b5049fe1_1536x1024.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/sahar-ai-coding&quot;,&quot;section_name&quot;:&quot;AI Coding&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:162210580,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:52,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><h2>(3) OpenAI Agents SDK</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6A1M!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6A1M!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg" width="1256" height="702" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:702,&quot;width&quot;:1256,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;text&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="text" title="text" srcset="https://substackcdn.com/image/fetch/$s_!6A1M!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 424w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 848w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!6A1M!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F17880bb0-e24f-43e9-bdc5-4f5650a92baa_1256x702.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>OpenAI has one of the most useful frameworks for multi-agent workflows, and it&#8217;s open-source.</p><p>Building production-ready agent systems has been notoriously complex, requiring deep knowledge of orchestration patterns, handoff mechanisms, and debugging distributed AI behavior. The new OpenAI Agents SDK simplifies this complexity with a remarkably clean Python interface that handles the heavy lifting.</p><p>Why I find this framework so useful:</p><ol><li><p><strong>Provider-agnostic design</strong> - works with OpenAI's APIs plus 100+ other LLMs, so you're not locked into a single provider</p></li><li><p><strong>Built-in handoffs</strong> - agents can seamlessly transfer control to specialized agents based on context, like routing Spanish queries to Spanish-speaking agents</p></li><li><p><strong>Integrated tracing</strong> - every agent run is automatically tracked using popular tools such as AgentOps, Braintrust, and Arize AI Phoenix, making debugging multi-agent conversations straightforward instead of impossible</p></li><li><p><strong>Guardrails by default</strong> - configurable safety checks for input and output validation prevent runaway behavior</p></li></ol><p>Setting up a triage system that routes conversations to language-specific agents takes just a few lines of code, with the SDK handling message persistence, context switching, and execution flow automatically.</p><p>I'm particularly impressed by the tracing capabilities - the framework integrates with popular observability tools like Logfire, AgentOps, and Braintrust, giving you visibility into exactly what your agents are doing and why.</p><p>For developers who have been intimidated by the complexity of multi-agent architecture, this SDK removes the final barrier to building sophisticated agent workflows that actually work in production.</p><p>GitHub repo <a href="https://github.com/openai/openai-agents-python">https://github.com/openai/openai-agents-python</a></p><div><hr></div><h2>(4) Agno</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LKGR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LKGR!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 424w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 848w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1272w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif" width="800" height="579" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:579,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!LKGR!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 424w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 848w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1272w, https://substackcdn.com/image/fetch/$s_!LKGR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6cb1a9ee-ea08-49ef-99d0-3f723a9f771c_800x579.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Most agent frameworks I've used struggle with performance at scale, but I recently tested one that achieves microsecond-level instantiation.</p><p>The math doesn&#8217;t lie: if each agent takes seconds to spin up and consumes megabytes of memory, running the thousands needed for complex workflows becomes infeasible.</p><p>A new library called Agno addresses this through architectural decisions that prioritize performance without sacrificing functionality. The framework supports 23+ model providers and implements a progressive five-level agent architecture, from basic tool-enabled agents to coordinated multi-agent workflows.</p><p>Key technical capabilities include:</p><ol><li><p><strong>Native multimodal processing</strong> - handles text, image, audio, and video inputs without additional preprocessing layers</p></li><li><p><strong>First-class reasoning implementation</strong> - agents can explicitly "think through" problems using built-in reasoning tools or custom chain-of-thought approaches</p></li><li><p><strong>Agentic search with hybrid retrieval</strong> - combines vector search with keyword matching and re-ranking for improved RAG performance</p></li></ol><p>The performance difference is substantial. In head-to-head comparisons with LangGraph, Agno completes instantiation benchmarks before competing frameworks reach halfway through their measurement cycles.</p><p>Agno also includes pre-built FastAPI routes, structured output handling, session storage, and monitoring capabilities.</p><p>GitHub repo <a href="https://github.com/agno-agi/agno">https://github.com/agno-agi/agno</a></p><div><hr></div><h2>(5) Agents Towards Production</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Z0AB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg" width="1456" height="811" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:811,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;graphical user interface&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="graphical user interface" title="graphical user interface" srcset="https://substackcdn.com/image/fetch/$s_!Z0AB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Z0AB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F815994cb-6f03-4842-bd35-703585ade7ca_2048x1141.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>A new, comprehensive, open-source playbook has just solved the biggest challenge in developing AI agents: transitioning from experimentation to production-ready systems.</p><p>Unlike scattered documentation or theoretical frameworks, this resource provides executable tutorials that guide you from zero to a working implementation in minutes.</p><p>The playbook covers the entire agent lifecycle:<br></p><ol><li><p><strong>Orchestration fundamentals</strong> - build multi-tool workflows with memory persistence and agent-to-agent messaging using frameworks like Xpander and LangChain</p></li><li><p><strong>Production deployment</strong> - containerize agents with Docker, scale on GPU infrastructure via Runpod, or run on-premise with Ollama for privacy-sensitive applications</p></li><li><p><strong>Security and observability</strong> - implement real-time guardrails against prompt injection, add comprehensive tracing with LangSmith and Qualifire, and automate behavioral testing</p></li><li><p><strong>Advanced capabilities</strong> - enable dual-memory architectures with Redis for semantic search, integrate real-time web data through Tavily, and deploy agents as APIs with FastAPI</p></li></ol><p>What makes this resource invaluable is its tutorial-first approach. Each concept comes with runnable notebooks and production-ready code.<br><br>Whether you're building customer service agents, research assistants, or autonomous workflows, the playbook provides tested patterns for tool integration, multi-agent coordination, and model customization.<br><br>GitHub repo <a href="https://github.com/NirDiamant/agents-towards-production?utm_source=saharmor_linkedin">https://github.com/NirDiamant/agents-towards-production</a></p><div><hr></div><h2>(6) Docling</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mEga!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mEga!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 424w, https://substackcdn.com/image/fetch/$s_!mEga!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 848w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg" width="884" height="524" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:524,&quot;width&quot;:884,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alternative text description for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alternative text description for this image" title="No alternative text description for this image" srcset="https://substackcdn.com/image/fetch/$s_!mEga!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 424w, https://substackcdn.com/image/fetch/$s_!mEga!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 848w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!mEga!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffbed19d2-80da-4214-a504-822c3fa9df8f_884x524.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>
      <p>
          <a href="https://www.aitidbits.ai/p/linkedin-highlights-june-2025">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Open-Source Toolkit for Building AI Agents v2]]></title><description><![CDATA[An opinionated, developer-first guide to building AI agents with real-world impact]]></description><link>https://www.aitidbits.ai/p/open-source-agents-updated</link><guid isPermaLink="false">https://www.aitidbits.ai/p/open-source-agents-updated</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 01 Jun 2025 15:02:05 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!91rM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</strong></em></p><p>This one is an updated version of one of my recent popular posts, which outlined the most popular and useful open-source libraries for AI agent builders:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;f75e1eb7-db29-4502-92e1-96e0b2280e88&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-28T12:31:04.504Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152246141,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:599,&quot;comment_count&quot;:18,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><br>The landscape of AI agent tools is evolving rapidly. After publishing my previous post, I received suggestions for additional packages I hadn't encountered. Over the past few months, I've tested these tools and documented new, valuable agent-related libraries. This post shares my updated insights.</p><p>If you often wonder, "What tools are people actually using to build voice agents or understand documents?"&#8212;this post is for you. With new packages emerging almost daily, it can be challenging to determine what's state-of-the-art and truly usable. This list is deliberately selective, focusing on the libraries I've personally found most effective, or those recommended by colleagues I trust.</p><p>In this post, I'll provide a curated and updated overview of the open-source ecosystem for developers building AI agents. While there&#8217;s no shortage of AI agent market maps, most are geared toward non-builders who need actionable tools and frameworks to launch functional AI agents today.</p><p>Every package listed in this post allows commercial use and has a permissive open-source license.</p><p>Categories covered in this piece:<br>&#8594; Building and Orchestrating Agents (10)<br>&#8594; Computer Use (5)<br>&#8594; Browser Automation (5)<br>&#8594; Voice (12)<br>&#8594; Document Processing (7)<br>&#8594; Memory (3)<br>&#8594; Testing, Evaluation, and Observability (6)<br>&#8594; Vertical Agents (7)<br><br>Plus: </p><ul><li><p>Real-world agent stacks: Voice agent that answers phone calls + Browser agent that crawls LinkedIn URLs</p></li><li><p>Curated guides and tutorials to get started building agents</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!91rM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!91rM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 424w, https://substackcdn.com/image/fetch/$s_!91rM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 848w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1272w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png" width="1456" height="815" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:815,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2093453,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.aitidbits.ai/i/163224476?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!91rM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 424w, https://substackcdn.com/image/fetch/$s_!91rM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 848w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1272w, https://substackcdn.com/image/fetch/$s_!91rM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Building and Orchestrating Agents</h2><p>To build agents that go beyond simple prompting, you need infrastructure for planning, memory, and tool use, and a way to hold it all together.</p><p>As more developers started shipping real-world agents, new frameworks popped up and older ones evolved to meet the actual challenges of agentic AI. This section covers the tools I&#8217;ve found most effective for building agents that can think, remember, and act with minimal hand-holding.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/oYI5j/13/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e7923925-4c70-4538-8642-0a7d81a049a9_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Agent Frameworks&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/oYI5j/13/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For beginners or rapid prototyping, consider <strong>Langflow</strong> for its intuitive visual interface and <strong>OpenAI&#8217;s Agents SDK</strong>, or <strong>LangChain</strong> for their simplicity and flexibility.</p></li><li><p>For enterprise applications, <strong>Portia</strong> and <strong>CrewAI</strong> offer robust features suitable for production environments requiring control and scalability.</p></li><li><p>For multimodal or memory-intensive agents, <strong>Agno</strong> provides lightweight support for agents needing persistent memory and multimodal inputs.</p></li><li><p>For complex simulations or data generation, <strong>Camel</strong> excels in creating customizable multi-agent systems for simulating real-world interactions.</p></li><li><p>For autonomous task execution, <strong>AutoGPT</strong> is designed for agents that need to operate without continuous human input.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gpJj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gpJj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 424w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 848w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1272w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Welcome to Langflow | Langflow Documentation&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Welcome to Langflow | Langflow Documentation" title="Welcome to Langflow | Langflow Documentation" srcset="https://substackcdn.com/image/fetch/$s_!gpJj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 424w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 848w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1272w, https://substackcdn.com/image/fetch/$s_!gpJj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc904ca6c-ab30-454e-b358-c45f8cf2a1e3_5760x3240.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Langflow simplifies the creation of agents and workflows that integrate with any API, model, or database</figcaption></figure></div><div><hr></div><h2>Computer Use</h2><p>AI agents become far more useful when they can operate computers like humans: clicking, typing, browsing, and running programs. The libraries below make that possible, letting agents bridge the gap between language output and real-world action.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/jKBVM/5/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a9db64df-0672-4700-b6ac-c74d0bd123d0_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Computer Use&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/jKBVM/5/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For local code execution via natural language, go with <strong>Open Interpreter</strong> &#8211; it&#8217;s fast to set up and great for command-driven agents.</p></li><li><p>For agents that need to see and control a computer screen like a human, <strong>Self-Operating Computer</strong> is your best bet.</p></li><li><p>If your agent needs to run in a secure, fast, sandboxed environment, use <strong>CUA</strong>.</p></li><li><p>For dynamic multi-step tasks on irregular interfaces, <strong>Agent-S</strong> offers the most flexibility with its planning and learning capabilities.</p></li><li><p>If your agent relies on interpreting UIs from screenshots (e.g., grounding actions in visual layouts), <strong>OmniParser</strong> adds critical visual parsing capabilities.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1c1O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1c1O!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 424w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 848w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1272w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif" width="800" height="502" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:502,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!1c1O!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 424w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 848w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1272w, https://substackcdn.com/image/fetch/$s_!1c1O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8331c21f-e1cd-41fb-8db0-8409afc47ac5_800x502.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Using CUA to edit a photo in Photoshop. All actions in this video are automated from a given natural language prompt.</figcaption></figure></div><div><hr></div><h2>Browser Automation</h2><p>As AI agents move from passive reasoning to active execution, the browser becomes their gateway to the internet. Whether scraping data, submitting forms, or navigating complex workflows, browser automation tools let agents interact with web pages just like a human would&#8212;with clicks, scrolls, and typed input. These libraries differ in abstraction level, performance, and agent integration, so choosing the right one depends on your goals.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/1CVsC/11/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/86b7f52e-a2f5-4fba-809a-e88be3f1534f_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Browser Automation&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/1CVsC/11/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><br><strong>How to choose?</strong></p><ul><li><p>For a low-code, declarative approach where the LLM plans the steps, try <strong>Stagehand</strong>.</p></li><li><p>If you're building agents that need to deeply understand and extract content from websites, <strong>Firecrawl</strong> offers the cleanest pipeline.</p></li><li><p>For LLM-friendly control over browser actions with integration hooks, I&#8217;d recommend the popular <strong>browser-use</strong>.</p></li><li><p>Choose <strong>Playwright</strong> if you need more low-level control over browser actions across browsers.</p></li><li><p>Use <strong>Puppeteer</strong> if you need fast, scriptable Chrome automation in a Node.js environment.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dX7B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dX7B!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 424w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 848w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1272w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif" width="1456" height="685" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:685,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]" title="442961777-a0ffd23d-9a11-4368-8893-b092703abc14.gif [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!dX7B!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 424w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 848w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1272w, https://substackcdn.com/image/fetch/$s_!dX7B!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8ef20a49-850a-47e7-b498-7ff6e7feffb7_1809x851.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Adding grocery items to a cart and checking out using browser-use</figcaption></figure></div><div><hr></div><h2>Voice</h2><p>Voice is still the most intuitive interface for humans, and increasingly, for agents too. These tools let agents handle speech in and out: understanding spoken language, keeping track of conversations, and responding naturally.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/dZHX8/7/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/553b4993-29fe-4281-9999-34339fd53c64_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Voice (Speech-to-Speech, Speech-to-Text, Text-to-Speech)&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/dZHX8/7/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><p>I wrote a whole post covering open and commercial packages and APIs for voice agents, including my guide for choosing the right ones:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;09c72fb4-1a0a-407f-bc3a-9c9753d3b215&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Voice Agents Toolkit for Builders&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-12-22T13:00:57.856Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/voice-agents-toolkit&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:153274984,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:40,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;2d0fc376-c155-40d6-848e-1452018da268&quot;,&quot;duration&quot;:null}"></div><p>Dia <a href="https://yummy-fir-7a4.notion.site/dia">compared</a> to ElevenLabs and Sesame &#128070;</p><div><hr></div><h2>Document Processing</h2><p>Modern AI agents must process and comprehend documents in various formats, from PDFs to images containing text. The following open-source tools empower agents to extract, interpret, and act upon information from unstructured documents, facilitating real-world business processes.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/LW5mf/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef10c5b5-fa34-4c5c-a52f-009b3a0cd757_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Document Processing&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/LW5mf/4/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>Long-form PDFs such as contracts, research papers - use <strong>Qwen2.5-VL</strong> or <strong>mPLUG-DocOwl2</strong> for efficient multi-page understanding without relying on OCR. And, as of a few months ago, you can also easily fine-tune a DocOwl2 model on your own data with <a href="https://github.com/modelscope/ms-swift">ms-swift</a>.</p></li><li><p>Text + image docs such as medical reports, annotated diagrams - try <strong>Molmo</strong> for high-resolution multimodal inputs, visual QA, and GUI parsing.</p></li><li><p>Layout analysis &amp; table extraction - use <strong>Docling</strong> for JSON/Markdown conversion, or <strong>LayoutLMv3</strong> for form understanding and layout-aware modeling.</p></li><li><p>Lightweight multimodal with speech - <strong>Phi-4</strong> handles text, vision, and speech in a compact model&#8212;great for on-device agents.</p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;ef8c076b-5f34-4f87-8d53-e6fab14c95da&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h2>Memory</h2><p>To feel truly intelligent, AI agents need memory. Without it, they&#8217;re stuck in single-turn loops, forgetting what just happened, what the user wants, or what they already did. The libraries below help agents remember, adapt, and personalize, enabling everything from contextual conversations to long-term planning.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/Lodm4/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e14ea69-cfbe-4ca0-8bba-01fa52b9f901_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Memory&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/Lodm4/4/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>For long-term, personalized memory that improves with use, <strong>Mem0</strong> is a solid choice&#8212;especially if you want minimal overhead.</p></li><li><p>Use <strong>Letta</strong> when building agents that need persistent memory across sessions and integration with tools or APIs.</p></li><li><p>To enable active memory management and knowledge sharing among agents, <strong>LangMem</strong> facilitates dynamic memory operations and shared knowledge bases.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XwTg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XwTg!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 424w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 848w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1272w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif" width="800" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!XwTg!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 424w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 848w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1272w, https://substackcdn.com/image/fetch/$s_!XwTg!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F736173d0-250a-453b-bdec-0c295b18c2c1_800x448.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Creating stateful agents with Letta</figcaption></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Testing, Evaluation, and Observability</h2><p>As agents grow more complex, they need to be tested, measured, and monitored like any serious software system. These tools help you catch edge cases, debug behavior, and track performance, both during development and in production.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/Zyh1j/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b259fe8-e172-4627-995b-07fd5da7e71b_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Open-source Testing, Evaluation, and Observability&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/Zyh1j/2/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p>To monitor and benchmark agent performance in production environments, <strong>AgentOps</strong> provides robust tracking and analysis tools.</p></li><li><p>When comparing various agent configurations or conducting A/B tests, <strong>Agenta</strong> facilitates structured evaluations.</p></li><li><p>To integrate observability into LLM applications, <strong>OpenLLMetry</strong> leverages OpenTelemetry for seamless monitoring.</p></li><li><p>If detecting and addressing performance, bias, or security issues is a priority, <strong>Giskard</strong> offers automated scanning capabilities.</p></li><li><p>For comprehensive LLM observability and debugging, <strong>Langfuse</strong> provides an open-source platform tailored for LLM applications.</p></li><li><p>For voice agent evaluation across different models and prompts, <strong>VoiceLab</strong> offers a comprehensive testing framework.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VEsF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" width="800" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:&quot;Demo usage&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Test and refine your voice agents with Voice Lab</figcaption></figure></div><div><hr></div><h2>Vertical Agents</h2><p>The open-source world is full of vertical agents: purpose-built tools for coding, research, data analysis, and more. I&#8217;ve tested a bunch. These are the ones I&#8217;d actually reach for when building something real.</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/8yunC/1/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/031409f1-908b-4573-b0d8-9a01bbeb1946_1260x660.png&quot;,&quot;thumbnail_url_full&quot;:&quot;&quot;,&quot;height&quot;:400,&quot;title&quot;:&quot;Vertical Agents&quot;,&quot;description&quot;:&quot;https://aitidbits.ai/p/open-source-agents-updated&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/8yunC/1/" width="730" height="400" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><strong><br>How to choose?</strong></p><ul><li><p><strong>Goose</strong> allows custom workflow integration to build extensible AI coding assistants.</p></li><li><p>For comprehensive coding agents with GUI capabilities, <strong>OpenHands</strong> offers a full-stack solution inspired by Devin.</p></li><li><p>If you prefer a Claude Code-like terminal-based pair programming, <strong>aider</strong> provides Git integration and multi-file editing.</p></li><li><p>To convert UI designs from images to code, <strong>screenshot-to-code</strong> automates the prototyping process.</p></li><li><p>For autonomous research tasks, <strong>GPT Researcher</strong> can scrape, summarize, and export findings efficiently.</p></li><li><p>For conducting in-depth, privacy-focused research using local LLMs, <strong>Local Deep Research</strong> offers iterative analysis and comprehensive, cited reports.</p></li><li><p>If your focus is on generating SQL queries from text, <strong>Vanna</strong> offers customizable and database-integrated solutions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NZzC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NZzC!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]" title="430601405-8fcaaa4c-31e5-4814-89b4-94f1433d139d.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!NZzC!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!NZzC!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc2ec8571-c1a7-4945-9fb4-aad5cda51ebc_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">GPT Researcher can conduct research using local and web</figcaption></figure></div></li></ul><div><hr></div><p>Lastly, here are real-world stacks and beginner-friendly tutorials to help you launch your first AI agent.</p><h2><strong>Real-world agent stacks</strong></h2><p>All the tools above are powerful on their own, but how do they actually fit together in practice? What does a real architecture look like when you're stitching these components into something usable, testable, and shippable? I&#8217;ve compiled a few concrete examples from recent open-source projects and builders in the space. If you're trying to move from &#8220;exploring tools&#8221; to &#8220;building real systems&#8221;, these will give you a head start.</p><h3>Voice agent that answers phone calls</h3>
      <p>
          <a href="https://www.aitidbits.ai/p/open-source-agents-updated">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Rewiring the Internet: Commerce in the Age of AI Agents]]></title><description><![CDATA[How commerce, payments, and marketing should evolve for an agent-mediated internet]]></description><link>https://www.aitidbits.ai/p/ai-agents-commerce</link><guid isPermaLink="false">https://www.aitidbits.ai/p/ai-agents-commerce</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 19 Jan 2025 15:01:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the <a href="https://www.aitidbits.ai/p/open-source-agents">open-source toolkit for AI Agents builders</a>, the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundational AI models</a>, and <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure">the infrastructure required to power the agentic AI era</a>.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;8d7854eb-9972-426b-a349-8a422b74643c&quot;,&quot;duration&quot;:1297.2408,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>December 2028. Maria's AI agent is negotiating simultaneously with twelve different vendors for her daughter's upcoming birthday party. Within minutes, it secured the perfect cake from a local bakery (after verifying their nut-free certification), booked an entertainer with stellar safety ratings, and coordinated custom goodie bags filled with each child's favorite treats (after checking allergies and dietary restrictions with the other parents' agents)&#8212;all while staying 15% under budget. What would have taken Maria hours of calls, emails, and anxiety about vendor reliability now happens seamlessly through a web of agent-to-agent interactions powered by the new infrastructure we explored in <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure">our previous post</a>.</p><p>The revolution in web infrastructure we discussed in previous posts isn't just theoretical&#8212;it's enabling fundamental changes in how commerce, marketing, and customer service function. As <a href="https://www.aitidbits.ai/p/ai-agents-web-infrastructure#:~:text=intelligently.%20Enter%20the-,Agent%20Passport,-.">agent passports</a> and trust protocols become standardized, we're witnessing the emergence of entirely new commercial paradigms.</p><p>With the recent release of Tasks by OpenAI, which equips ChatGPT&#8212;its consumer-facing AI&#8212;with the ability to perform tasks behind the scenes on behalf of users, it's now easier than ever to envision a future where ChatGPT seamlessly navigates the web and handles complex operations for us.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HfZE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HfZE!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 424w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 848w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1272w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif" width="603" height="401.74875" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:603,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;OPEN_AI_TASKS.mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="OPEN_AI_TASKS.mp4 [optimize output image]" title="OPEN_AI_TASKS.mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!HfZE!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 424w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 848w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1272w, https://substackcdn.com/image/fetch/$s_!HfZE!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6a77a4c1-97c1-430a-8118-03c229d35c6d_800x533.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ChatGPT can now set reminders and perform recurring actions</figcaption></figure></div><p>Today, we'll explore how an agent-first internet will reshape domains like payments, marketing, support, and localization.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jjvk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jjvk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 424w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 848w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png" width="1456" height="818" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:818,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:650545,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jjvk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 424w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 848w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1272w, https://substackcdn.com/image/fetch/$s_!jjvk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F69ffb5cf-09fd-4002-94b1-1ef3c069ab0e_2384x1340.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Agentic payments</strong></h2><p>Remember when online shopping first emerged, and entering your credit card details on a website felt risky? Card networks like Visa and Mastercard and banks like Chase and Barclays had to rapidly adapt to the digital realm, introducing new protocols like CVV codes and secure payment gateways to protect consumers and merchants alike. This evolution was pivotal in building trust and facilitating the e-commerce boom of the early 2000s, giving birth to digital payment giants like Stripe and PayPal.</p><p>Similarly, the payment infrastructure that powers today's internet was built around a simple dichotomy: card-present versus card-not-present transactions and the assumption that all payments are human-initiated. The internet already faced a similar shift with the introduction of subscription payments in the early days of e-commerce, in which customers input their payment credentials once, allowing a merchant to charge them repeatedly in the future. Recurring payments, now a dominant commerce mechanism projected to surpass <a href="https://www.pymnts.com/subscription-commerce/2023/how-transparency-and-high-earners-power-a-96-billion-subscription-industry/">$2.4T in spending</a> by 2028, underscore how critical adaptive payment systems are to supporting global e-commerce. Yet, even more than a decade later, there are still countries like India that <a href="https://www.pymnts.com/news/international/2021/did-rbi-rule-kill-recurring-payments-india/">impose strict controls</a> on recurring payments, highlighting the complexities of adapting systems to new paradigms.</p><p>But what happens when the legitimate transacting party isn&#8217;t human at all?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kZRV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kZRV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 424w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 848w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1272w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif" width="689" height="340.19375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:395,&quot;width&quot;:800,&quot;resizeWidth&quot;:689,&quot;bytes&quot;:492929,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/gif&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kZRV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 424w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 848w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1272w, https://substackcdn.com/image/fetch/$s_!kZRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42913aa8-104c-407b-ac1d-04a2db814bf0_800x395.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Digital payment systems are built around human users, employing measures like CVV codes and billing address verification to prevent fraud. These methods assume a human is initiating the transaction, making them ill-suited for autonomous agent transactions.</p><p>One major challenge is fraud detection and resolution&#8212;an area I intimately understand from my time as a PM at Stripe, where I worked closely with card issuers to develop sophisticated fraud prevention systems. The current paradigm relies heavily on human behavioral patterns: typing speed, mouse movements, time spent reviewing checkout pages, and other signals that indicate legitimate human activity. But in an agent-driven world, these signals become obsolete.</p><p>During my time at Stripe, we saw how crucial these behavioral signals were for differentiating legitimate transactions from fraudulent ones. The shift to agent-driven commerce demands an entirely new approach. Rather than looking for signs of human behavior, we'll need systems that verify agent legitimacy, authorization scope, and decision-making logic.</p><p>This shift raises critical questions for fraud prevention across banks (Chase, Barclays), card networks (Visa, Mastercard), and payment processors (Stripe, PayPal). For instance, how do you establish a chain of trust when an agent purchases on behalf of a user? How do you verify that an agent hasn't been compromised or hijacked? These challenges require fundamentally rethinking our approach to transaction security and fraud prevention in an agent-driven ecosystem.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YH5H!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YH5H!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 424w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 848w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1272w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png" width="650" height="585" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1260,&quot;width&quot;:1400,&quot;resizeWidth&quot;:650,&quot;bytes&quot;:558331,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YH5H!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 424w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 848w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1272w, https://substackcdn.com/image/fetch/$s_!YH5H!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65382226-3d0c-41c7-95d4-59d12e5da924_1400x1260.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Fraud resolution in the era of AI agents</figcaption></figure></div><p>Future payment systems could introduce 'agent wallets' with granular spending controls, such as $100 limits for trusted merchants like Amazon and stricter caps for lesser-known websites. These wallets would integrate real-time fraud detection, submitting cryptographic evidence for disputes and maintaining transparent, auditable records of agent actions tied to human authorization.</p><p>Visa could introduce an agent-specific flag to the existing payment protocols, indicating this payment was initiated autonomously, along with a trial of reasoning and actions leading to this payment. Meanwhile, Stripe might expand its SDKs to enforce programmable payment rules, ensuring alignment with user-delegated instructions (Stripe has already made its foray into Agentic payments with its recent <a href="https://stripe.dev/blog/adding-payments-to-your-agentic-workflows">SDK release</a>).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8OjO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8OjO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 424w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 848w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1272w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png" width="671" height="432.0833333333333" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:850,&quot;width&quot;:1320,&quot;resizeWidth&quot;:671,&quot;bytes&quot;:153975,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8OjO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 424w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 848w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1272w, https://substackcdn.com/image/fetch/$s_!8OjO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F51162bd5-52fd-4d16-858a-e4ff07409f1e_1320x850.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Stripe&#8217;s new <a href="https://docs.stripe.com/agents">Agents SDK</a> supports Vercel&#8217;s AI SDK, LangChain, and CrewAI</figcaption></figure></div><p>Beyond preventing fraud, agentic payments face fundamental economic and infrastructural challenges. The existing payment infrastructure wasn't architected for the high-frequency, low-latency transactions that characterize agent interactions. Consider the standard pricing model of payment processors like Stripe: a 2.9% fee plus 30&#162; per transaction. While manageable for traditional e-commerce, this fee structure becomes prohibitively expensive when scaled to the myriad micro-transactions that agents might need to execute.</p><p>This pricing isn't arbitrary&#8212;it reflects the complex web of stakeholders in the traditional payment chain. Card networks like Visa and issuers like Chase have built their business models around these transaction fees. Interestingly, Stripe's recent acquisition of <a href="https://www.bridge.xyz/">Bridge</a>, a stablecoin payment infrastructure provider, hints at a potential solution. By leveraging blockchain-based payment rails, companies could facilitate agent-to-agent transactions without incurring the expensive overhead of traditional payment networks. This move suggests a growing recognition that the future of payments may require completely new infrastructure, optimized for the unique demands of autonomous agents.</p><p>Consider how a new payment protocol might work in practice. An "Agent Payment Protocol" (APP) could include:</p><ul><li><p><strong>Delegation Chain Verification</strong> - a cryptographic proof chain showing the agent's authorization to make specific types of purchases</p></li><li><p><strong>Transaction Context Object</strong> <strong>-</strong> machine-readable metadata including:</p><ul><li><p>Reasoning trail that led to the purchase decision</p></li><li><p>Reference to specific user preferences/rules that were satisfied</p></li><li><p>Confidence score for the decision</p></li></ul></li><li><p><strong>Smart spending controls</strong> with programmable constraints like:</p><ul><li><p>Category-specific limits (e.g., $200 for groceries, $50 for entertainment)</p></li><li><p>Merchant-specific trust scores</p></li><li><p>Required human confirmation above certain thresholds</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1pZZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 424w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 848w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1272w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png" width="682" height="654.3838028169014" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1090,&quot;width&quot;:1136,&quot;resizeWidth&quot;:682,&quot;bytes&quot;:331497,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!1pZZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 424w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 848w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1272w, https://substackcdn.com/image/fetch/$s_!1pZZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47852237-a5dc-410b-85ac-1108386b127d_1136x1090.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ul><p>Major payment providers could implement this through extensions to existing standards. For instance, Visa's existing 3D Secure protocol could add an agent verification layer, while Stripe's API could introduce new parameters for agent-specific transaction metadata.</p><p>While payment infrastructure provides the foundation for agent-driven commerce, the very nature of how we complete transactions must also evolve. The familiar checkout process&#8212;a hallmark of e-commerce for decades&#8212;is about to undergo its own transformation.</p><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>Redefined checkout experience</strong></h2><p>In an agent-first environment, the concept of a traditional checkout&#8212;where a human user confirms their purchase by clicking a &#8220;Buy&#8221; button&#8212;fades into the background. Instead, agents operate with predefined goals and parameters, continuously evaluating whether a proposed transaction aligns with those objectives. Rather than halting everything at a payment prompt, agents could integrate a &#8220;stop and reflect&#8221; step into their workflows. For example, if a user&#8217;s agent is tasked with booking a flight seat that&#8217;s both a window seat and close to an exit, the agent pauses before completing the reservation. It double-checks that the seat assignment matches the user&#8217;s criteria and only then proceeds, ensuring flawless execution of the user&#8217;s intent and mitigating the probability of the agent going off the rails due to hallucinations.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p6Gt!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 424w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 848w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1272w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:null,&quot;width&quot;:null,&quot;resizeWidth&quot;:60,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Stripe Blog: Product&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Stripe Blog: Product" title="Stripe Blog: Product" srcset="https://substackcdn.com/image/fetch/$s_!p6Gt!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 424w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 848w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1272w, https://substackcdn.com/image/fetch/$s_!p6Gt!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F83b2fdcb-bdc2-4fdd-add1-47d086546a46_1920x1080.gif 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Stripe perfected its checkout experience with its Optimized Checkout Suite, which dynamically shows customers the most relevant payment methods using machine learning, leading to a 10.5% increase in revenue</figcaption></figure></div><p>This reflective process transforms the final authorization into a subtle verification loop rather than a jarring user interruption. The agent reviews the selected attributes&#8212;price, seat location, baggage allowance, and cancellation policy&#8212;and compares them against the user&#8217;s stored preferences and constraints. It confirms not only that the requested outcome has been met but also that it falls within acceptable spending limits and trust parameters. This transforms purchasing from a manual &#8220;Are you sure?&#8221; prompt into a nuanced, data-driven decision matrix.</p><p>As these agent-mediated transactions proliferate, payment providers and merchants might offer additional layers of context-aware validation. For instance, when an agent chooses a specific insurance add-on, the payment system could prompt the agent to confirm whether its logic correctly interpreted the user&#8217;s needs. This transparent chain of reasoning, visible to the agent and logged for future reference, ensures that each transaction stands up to scrutiny. Ultimately, the checkout step evolves from a user-facing choke point to an agent-managed quality control measure, minimizing errors and elevating the overall integrity of automated commerce.</p><p>As agents reshape how we complete purchases, they're also forcing us to rethink how businesses attract and engage customers in the first place. The era of human-centric marketing campaigns is giving way to something far more systematic and efficient.</p><h2><strong>Agent-driven marketing and promotions</strong></h2><p>Marketing campaigns and promotions will evolve radically in an AI agent-mediated economy.</p><p>Traditional email marketing and coupon distribution systems, designed around human attention and impulse, will give way to programmatic offer networks where consumers&#8217; AI agents maintain persistent queries about their principals' needs and preferences. These agents subscribe to vendor APIs that broadcast real-time offers matching specific criteria, enabling hyper-personalized deal discovery that transcends the limitations of batch email campaigns.</p><p>Vendors might maintain agent-first promotional channels that communicate in structured data formats, allowing instant price comparison and benefit analysis. This ecosystem could enable &#8220;intent casting", where agents broadcast shopping goals to trusted vendor networks, receiving precisely targeted offers that align with the principal's timing, budget, and preferences&#8212;all without cluttering a human inbox or requiring manual coupon management.</p><p>Consider a practical example: A user instructs their agent to monitor high-end fashion retailers for specific items within their style preferences and budget constraints. Rather than the user repeatedly checking websites or subscribing to countless email lists, their agent maintains persistent monitoring across multiple vendors:</p><ul><li><p>Real-time inventory tracking across size, color, and style variations</p></li><li><p>Dynamic price monitoring, including flash sales and member-exclusive discounts</p></li><li><p>Evaluation of shipping times and costs to the user's location</p></li></ul><p>When ideal conditions align&#8212;perhaps a preferred sweater hits the target price point during an end-of-season sale&#8212;the agent can either notify the user or execute the purchase automatically based on pre-authorized parameters. This transforms shopping from an attention-demanding activity into an efficient background process governed by clear rules and preferences.</p><p>Major retailers like Nordstrom or ASOS could expose agent-specific APIs that provide structured access to:</p><ul><li><p>Real-time inventory and pricing data</p></li><li><p>Detailed product specifications and measurements</p></li><li><p>Membership program benefits and restrictions</p></li><li><p>Regional availability and shipping constraints</p></li></ul><p>This evolution democratizes personal shopping, allowing everyone to benefit from persistent, intelligent monitoring of their fashion preferences&#8212;not just those who can afford human personal shoppers. It also enables retailers to better match inventory with actual customer intent, reducing overhead from unsold merchandise and improving supply chain efficiency.</p><p>Other companies like Honey may need to pivot to offer agent-optimized tools that integrate directly with vendor APIs, allowing agents to query real-time discounts and rewards. Similarly, Mailchimp and HubSpot could develop agent-oriented campaign frameworks that distribute offers as structured data streams rather than traditional email blasts, ensuring seamless integration with agent-driven workflows.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>Agent-native customer support</strong></h2><p>Customer support today centers on human interactions or user-facing chatbots. In an agent-first paradigm, this shifts to agent-to-agent communication. Personalized AI agents will directly engage with business systems to resolve issues, retrieve shipping or refund policies, or autonomously initiate returns. This evolution will streamline processes, reduce human intervention, and enhance efficiency in support workflows.</p><p>In an agent-first paradigm, customer support is no longer solely about human users contacting businesses through chat widgets or call centers. Instead, autonomous agents interact directly with enterprise systems, pulling diagnostic information, requesting refunds, or escalating complex issues to a more constrained/expensive resource like a human or a superior model (e.g., o1 over GPT-4o). This shift encourages platforms like Intercom to develop agent-oriented communication layers&#8212;specialized APIs that allow autonomous agents to navigate support options, retrieve knowledge base articles, and submit detailed troubleshooting requests without human intervention.</p><p>These agent-facing APIs would streamline issue resolution, allowing routine queries, such as package tracking, account verification, or policy clarifications, to be handled agent-to-agent, drastically reducing response times. As soon as a problem arises, the user&#8217;s agent can pinpoint the issue and connect with the business&#8217;s support agent (be it a specialized LLM or a human representative), negotiating resolutions or applying discounts as needed. The result is a fluid, automated dialogue that bypasses human frustration and latency.</p><p>Over time, companies could implement reputation scoring systems that measure how efficiently their support agents (both human and AI) interact with consumer agents. Metrics like resolution speed, policy clarity, and refund accuracy become machine-readable signals, informing user agents which vendors offer superior support experiences. As more vendors embrace these standards, the entire support ecosystem evolves: prompt, well-structured responses become the norm, and agent-native customer support becomes a hallmark of high-quality digital services.</p><p>Imagine a complex warranty claim scenario: Your agent detects that your new laptop's battery is degrading unusually fast. It immediately:</p><ol><li><p>Collects diagnostic data and usage patterns</p></li><li><p>Cross-references warranty terms with actual performance</p></li><li><p>Initiates a support interaction with the manufacturer's agent</p></li><li><p>Negotiates a resolution based on precedent cases</p></li><li><p>Arranges shipping for replacement parts or full device replacement</p></li><li><p>Schedules a technician visit if needed</p></li></ol><p>This entire process happens without human intervention unless exceptional circumstances arise. The interaction generates a complete audit trail, including all diagnostic data, communication logs, and decision points&#8212;valuable data for improving both product quality and support processes.</p><p>One example in this space of agentic customer support is <a href="https://sierra.ai/">Sierra</a>, a startup taking aim at the expansive market of customer support by embedding AI agents into business workflows. Their conversational agents handle complex queries with contextual precision, managing tasks such as processing returns or updating subscriptions. While their primary focus remains on serving human customers, the foundation they&#8217;ve built is clearly aligned with an AI agent-driven future. With access to company policies (e.g., refund and shipping rules) and robust conversational AI infrastructure (spanning LLMs and voice interfaces), Sierra is well-positioned to seamlessly transition to support agent-to-agent interactions as demand evolves.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cVCp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cVCp!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif" width="618" height="583.2375" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:755,&quot;width&quot;:800,&quot;resizeWidth&quot;:618,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp (25).mp4 [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp (25).mp4 [optimize output image]" title="temp (25).mp4 [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!cVCp!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!cVCp!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbc196221-5e5c-47f3-80f3-b8c4312b632d_800x755.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sierra&#8217;s agents resolve support tickets autonomously, with Sierra customers only paying for <a href="https://sierra.ai/blog/outcome-based-pricing-for-ai-agents">resolved tickets</a></figcaption></figure></div><h2><strong>The end of language-optimized interfaces</strong></h2><p>As AI agents seamlessly translate and interpret information on the fly, the need for painstakingly maintained multilingual websites diminishes. Instead of forcing businesses to host separate English, French, or Mandarin versions of their interfaces, agents handle language conversion dynamically. This capability allows brands to maintain a single, streamlined codebase while ensuring that users, regardless of location, receive content and instructions in their preferred language&#8212;instantly and accurately.</p><p>Website builders like Webflow and Wix could evolve into platforms that generate &#8220;universal templates&#8221; optimized for agent interpretation rather than human linguistic preferences. Instead of focusing on localized landing pages, these platforms would produce standardized, machine-readable structures enriched with metadata and semantic cues. Agents, armed with cutting-edge language models, would then adapt the presentation layer for each user, including local dialects, cultural nuances, and even personalization cues drawn from the user&#8217;s profile.</p><p>The transformation goes beyond simple translation. Agents will handle complex cultural adaptations across multiple dimensions simultaneously. They'll dynamically adjust pricing strategies for different markets while modifying product descriptions to reflect local preferences and purchasing patterns. These agents will intelligently adapt imagery and design elements to ensure cultural appropriateness, automatically managing regional compliance requirements such as privacy policies or consumer protection disclosures. They'll even personalize communication styles based on cultural norms, shifting between formal and casual tones and adapting messaging cadence to match local expectations. This comprehensive cultural intelligence transforms what was once a labor-intensive localization process into a fluid, automated system that maintains cultural authenticity across all customer touchpoints.</p><p>In this new reality, the value proposition of website builders shifts from localization to robustness, structure, and data integrity. Rather than wrestling with manual translations or commissioning multiple language variants, businesses can rely on well-defined data schemas and agent-ready manifests. As a result, the concept of &#8220;language-optimized&#8221; sites becomes obsolete, replaced by fluid, dynamic interfaces that transcend linguistic barriers.</p><h2><strong>Small vs. large business impact</strong></h2><p>This transition creates both opportunities and challenges across the business spectrum. Large enterprises can invest in building sophisticated agent interfaces and maintaining complex agent-ready APIs. However, small businesses might initially struggle with the technical requirements and infrastructure costs.</p><p>To bridge this gap, we will likely see the emergence of "agent-enablement platforms"&#8212;services that help small businesses become agent-ready without significant technical investment. Think of them as the Shopify of the agent era, providing standardized tools that level the playing field. These platforms would offer pre-built solutions for creating agent-readable product catalogs and managing automated pricing and inventory systems. They would include standardized support protocols that small businesses can easily implement, along with simplified integration paths to agent payment systems. By democratizing access to agent-ready commerce capabilities, these platforms will play a crucial role in preventing a digital divide between large and small businesses in the agent economy.</p><p>This democratization of agent-ready commerce will be crucial for preventing a digital divide between large and small businesses in the agent economy.</p><h2><strong>The Great Rewiring</strong></h2><p>The transition to an agent-first internet represents more than just a technological shift&#8212;it's a fundamental reimagining of how commerce functions in the digital age. We're moving from a web optimized for human attention and interaction to one built for efficient, automated decision-making. This transformation touches every aspect of online business:</p><ul><li><p>Payment systems evolve from human-verification models to agent-oriented protocols with built-in delegation and accountability</p></li><li><p>Marketing shifts from attention-grabbing campaigns to structured, machine-readable offer networks</p></li><li><p>Customer support transforms from human-to-human interaction to efficient agent-to-agent problem resolution</p></li><li><p>Language barriers dissolve as agent-mediated communication enables seamless global commerce</p></li></ul><p>Companies that quickly adapt to this new paradigm&#8212;implementing agent passports, embracing agent-to-agent protocols, and restructuring their services for machine readability&#8212;will shape the next era of online interaction. Just as the mobile revolution created trillion-dollar opportunities, the agent revolution opens new horizons for innovation and value creation. The businesses that thrive won't just be those with the best products or prices, but those that best enable and embrace agent-driven commerce.</p><div><hr></div><p>This is the third essay in a <a href="https://www.aitidbits.ai/s/ai-agents">five-part series</a> exploring the future of AI agents and their impact on the internet. Having established the foundational shifts and their practical implications across multiple domains, our next post will examine early case studies and implementation challenges as businesses begin this transformation. Subscribe and follow as we continue to navigate the technical, social, and economic implications of this new paradigm.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">AI Tidbits is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The Voice Agents Toolkit for Builders]]></title><description><![CDATA[Curated frameworks, tools, and libraries to launch reliable and efficient voice agents]]></description><link>https://www.aitidbits.ai/p/voice-agents-toolkit</link><guid isPermaLink="false">https://www.aitidbits.ai/p/voice-agents-toolkit</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 22 Dec 2024 13:00:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</strong></em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;2242c69c-d496-4931-a37a-cd1a8169d758&quot;,&quot;duration&quot;:1503.6082,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>Over the past few weeks, I've explored how AI agents will fundamentally reshape the internet. From introducing the concept of "agent-responsive design" - where websites optimize for AI interaction - to examining the infrastructure needed for agent-to-agent communication protocols, my previous posts laid the groundwork for understanding the agent-centric future of the web.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;160324a6-5e60-45e5-9ec6-fccfaa98b0d7&quot;,&quot;caption&quot;:&quot;Welcome to AI Tidbits Deep Dives: short posts offering a perspective on AI-related topics. Some of my previous ones covered the economies of scale for foundational AI models, the recent consolidation in the AI space, and autonomous agents.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Agent-Responsive Design: Rethinking the web for an agentic future&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-17T16:01:09.913Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d892766-b0f9-45ea-b64e-1955fc3c4dc6_2458x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/agent-responsive-design&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:151674570,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:76,&quot;comment_count&quot;:13,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;b0999537-b746-4b4d-bc6e-4c6b73416b88&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the open-source toolkit for AI Agents builders, the economies of scale for foundational AI models&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Great Web Rebuild: Infrastructure for the AI Agent era&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-12-12T14:30:15.101Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/ai-agents-web-infrastructure&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152954768,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:50,&quot;comment_count&quot;:7,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;d983874a-b658-4232-bb8d-93ab29e27779&quot;,&quot;caption&quot;:&quot;Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-28T12:31:04.504Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:152246141,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:104,&quot;comment_count&quot;:8,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p><br>Now, it's time to tackle perhaps the most natural and transformative interface for these agents: voice.</p><p>While text-based agents have dominated the early wave of AI applications, voice represents the next frontier in human-AI interaction. It's not just another interface - it's the most intuitive and accessible way for humans to interact with AI systems. This intersection of voice technology and AI agents creates unprecedented opportunities for developers, so I'm excited to share this comprehensive guide to the voice AI ecosystem.</p><p>After decades of frustrating experiences with scoped voice assistants that cannot be interrupted and follow a narrow rule-based script, we're witnessing a fundamental shift in what's possible. Three key developments drive this shift:</p><ol><li><p><strong>Breakthrough in speech-native models</strong> - the release of OpenAI's Realtime API last October and Google's Gemini 2.0 <a href="https://blog.google/technology/google-deepmind/google-gemini-ai-update-december-2024/">Realtime Multimodal API</a> last week mark a transition from traditional "cascading architectures" (where speech is converted to text, processed, and converted back) to speech-native models that can process audio directly with unprecedented quality. With OpenAI's recent <a href="https://openai.com/index/o1-and-new-tools-for-developers/">60% Realtime API price reduction</a> and the hiring of WebRTC's founder, we're seeing a clear industry push toward making real-time voice interactions accessible and affordable.</p></li><li><p><strong>Dramatic reduction in complexity</strong> - what previously required hundreds of data scientists can now be achieved by small teams of AI engineers. We're seeing companies reach substantial ARR with lean teams by building specialized voice agents for specific verticals - from restaurant order-taking to lead qualification for sales teams.</p></li><li><p><strong>Infrastructure maturity</strong> - the emergence of robust developer platforms and middleware solutions has dramatically simplified voice agent development. These tools handle complex challenges like latency optimization, error handling, and conversation management, allowing developers to focus on building unique user experiences.</p></li></ol><p>This convergence creates a unique opportunity for builders. For the first time in human history, we have a god-like AI systems that converse like humans. The era of capable voice AI has arrived, opening up vast opportunities for innovators and developers alike.</p><p>Unlike web or mobile app development, where patterns are well-established, voice AI is still in its formative stage. The winners in this space will be those who can combine technical capability with a deep understanding of specific industry needs.</p><p>In this post, I'll provide a well-curated overview of the open-source and commercial tools available for developers building voice agents. While VCs segment the market based on investment opportunities, I'll map the ecosystem based on what matters to developers: APIs, SDKs, and tools you can actually use today. What is the go-to model for speech-to-text? The API for synthesis speech? Which tools do other builders rely on to develop voice agents? With the holiday season upon us, there's no better time to build your voice agent, turn it into a company, or automate a personal workflow.</p><p>Categories covered in this piece, along with my recommended stack and tips for choosing the right architecture at the end:</p><p><strong>Foundations</strong><br>&#8594; Speech-to-text<br>&#8594; Text-to-speech</p><p><strong>Voice AI Middleware</strong><br>&#8594; Speech-to-speech<br>&#8594; Conversational Agents Frameworks<br>&#8594; Evaluation</p><p><strong>End-to-end Automation<br></strong>&#8594; Full Stack</p><p><em><strong><br>Missing package or tool you think should have been part of this list? Comment on this post and I&#8217;ll take a look.</strong></em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wWQK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wWQK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 424w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 848w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1272w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png" width="1456" height="821" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:821,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1700082,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wWQK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 424w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 848w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1272w, https://substackcdn.com/image/fetch/$s_!wWQK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff09c911f-99e1-4b0a-84bf-1b94a5c3d457_2154x1214.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Foundations</h2><h3><strong>Speech-to-text (STT)</strong></h3><p>Automated transcription models have improved significantly over the past couple of years, featuring low latency and cost along with a decreasing WER (Word Error Rate). When selecting a speech-to-text model, consider these key factors:</p><ul><li><p><strong>Latency requirements</strong> - for real-time conversations, you need models that can process speech with &lt; 300ms latency. Cloud APIs like Assembly AI and Deepgram excel here, while Whisper variants are better suited for async transcription.</p></li><li><p><strong>Deployment constraints</strong> - open source models like Whisper.cpp offer flexibility for edge deployment and privacy-sensitive applications but require more engineering effort to optimize. Cloud APIs provide easier scaling but with higher operational costs.</p></li><li><p><strong>Language and accent support</strong> - while most models handle standard English well, performance varies significantly for other languages and accents. Whisper has broad language support, while specialized APIs like Assembly AI may offer better accuracy for specific use cases.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-hG2!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-hG2!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 424w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 848w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1272w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png" width="1456" height="797" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:797,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:257289,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-hG2!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 424w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 848w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1272w, https://substackcdn.com/image/fetch/$s_!-hG2!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F42454ed3-80b7-457c-a534-5e6b65b6ee23_1842x1008.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Speed vs. price tradeoff for leading speech-to-text models. <a href="https://artificialanalysis.ai/speech-to-text#:~:text=Speed%20Factor%20vs.%20Price">Source</a></figcaption></figure></div><p><strong>Open-source (mostly Whisper-based)</strong></p><ol><li><p><a href="https://github.com/ggerganov/whisper.cpp">Whisper.cpp</a> </p></li><li><p><a href="https://github.com/m-bain/whisperX">WhisperX</a></p></li><li><p><a href="https://github.com/KoljaB/RealtimeSTT">RealtimeSTT</a></p></li></ol><p><strong>Commercial</strong></p><ol><li><p><a href="https://deepgram.com/product/speech-to-text">Deepgram</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p></li><li><p><a href="https://www.assemblyai.com/products/speech-to-text?utm_source=google&amp;utm_medium=cpc&amp;utm_campaign=Brand&amp;utm_term=assembly%20ai&amp;gad_source=1&amp;gclid=CjwKCAiAgoq7BhBxEiwAVcW0LHxQOUlx5xFMVPkhO5PweZDnEOPzYZ9huYGCwPsc-tC2HwxkruseEBoCwnAQAvD_BwE">Assembly AI</a></p></li><li><p><a href="https://platform.openai.com/docs/guides/speech-to-text">OpenAI&#8217;s hosted Whisper</a></p></li><li><p>Cloud Providers: <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-speech">Azure</a>, <a href="https://cloud.google.com/speech-to-text?hl=en">Google</a>, <a href="https://aws.amazon.com/transcribe/">AWS</a>. There don&#8217;t seem to be significant differences between the cloud providers, so I usually opt for the one I already use for other services like storage and compute.</p></li></ol><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;6b948f35-326a-4ed2-94ec-aab2dcae103b&quot;,&quot;duration&quot;:null}"></div><p>&#128070; RealtimeSTT featured low-latency transcription with wake word activation</p><h3><strong>Text-to-speech (TTS)</strong></h3><p>Text-to-speech, once dominated by ElevenLabs&#8217; costly API as of June 2023, has evolved into a competitive market with numerous low-latency, cost-effective alternatives. Modern TTS systems have evolved far beyond simple voice synthesis, offering capabilities like:</p><ul><li><p><strong>Voice cloning</strong> - creating custom voices from just a few minutes of audio samples, enabling personalized agent voices</p></li><li><p><strong>Emotional synthesis</strong> - adjusting tone, pace, and emphasis to convey different emotions and speaking styles</p></li><li><p><strong>Multi-speaker synthesis</strong> - seamlessly switching between different voices in a single conversation</p></li></ul><p>The most significant recent advancement is the emergence of "speech-native" models that can generate highly natural speech without the traditional text-intermediate step. This enables more fluid conversations with lower latency and better preservation of emotional nuances.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7ApM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7ApM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 424w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 848w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1272w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png" width="670" height="446.3598901098901" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:670,&quot;bytes&quot;:230691,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7ApM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 424w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 848w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1272w, https://substackcdn.com/image/fetch/$s_!7ApM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbe0dbd1-4080-40a9-b4bc-c7e75d6c2249_1834x1222.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Text-to-speech models leaderboard. <a href="https://artificialanalysis.ai/text-to-speech/arena?tab=Leaderboard">Source</a></figcaption></figure></div><p><strong>Open-source</strong></p><ol><li><p><a href="https://github.com/coqui-ai/TTS">Coqui TTS</a></p></li><li><p><a href="https://github.com/2noise/ChatTTS">ChatTTS</a></p></li><li><p><a href="https://github.com/myshell-ai/OpenVoice">OpenVoice</a></p></li></ol><p><strong>Commercial</strong></p><ol><li><p><a href="https://www.cartesia.ai/sonic">Cartesia</a></p></li><li><p><a href="https://platform.openai.com/docs/guides/text-to-speech">OpenAI TTS</a></p></li><li><p><a href="https://deepgram.com/product/text-to-speech">Deepgram Aura</a></p></li><li><p><a href="https://elevenlabs.io/text-to-speech">ElevenLabs</a> (just released <a href="https://elevenlabs.io/blog/meet-flash">Flash</a>&#8212;a 75ms latency model in 32 languages)</p></li></ol><p>Other popular APIs: <a href="https://play.ht/">Play</a>, <a href="https://aws.amazon.com/polly/">Amazon Poly</a>, <a href="https://azure.microsoft.com/en-us/products/ai-services/ai-speech">Azure AI Speech</a>, <a href="https://cloud.google.com/text-to-speech?hl=en">Google</a></p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;bff556d4-1627-478d-bc9a-e6ba95c81f5b&quot;,&quot;duration&quot;:null}"></div><p>&#128070; Hello Patient&#8217;s voice agent is powered by Cartesia</p><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Voice AI Middleware</h2><h3><strong>Speech-to-speech</strong></h3><p>OpenAI sparked everyone&#8217;s imagination when they demoed GPT-4o last April, showcasing real-time conversational AI that can be interrupted. Until a few months ago, this kind of technology wasn&#8217;t available to developers. Now, we have commercial APIs at a reasonable price (OpenAI dropped the price for its Realtime API by 60% last week) and open-source alternatives.</p><p>Speech-to-speech models are specialized AI systems that process and respond to voice input directly in the audio domain without converting to and from text. They are superior to cascading workflows that involve sequential speech-to-text, text-to-text, and text-to-speech models for several key reasons:</p><ol><li><p><strong>Ultra-low latency</strong> - by processing audio natively, these models achieve response times of ~300 milliseconds, matching natural human conversation speeds. Traditional cascading approaches often exceed 1000ms, creating noticeable delays.</p></li><li><p><strong>Better contextual understanding</strong> - speech-to-speech models preserve important non-textual elements like tone, emotion, and conversation dynamics that are often lost in text conversion.</p></li><li><p><strong>Natural interruptions</strong> - unlike cascading systems that require rigid turn-taking, speech-native models can listen and process input even while speaking, enabling natural interruptions and more fluid conversations.</p></li><li><p><strong>Improved conversation quality</strong> - by maintaining the audio context throughout the interaction, these models better handle overlapping speech, background noise, and the natural rhythm of human dialogue.</p></li></ol><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/fixie-ai/ultravox">Ultravox</a></p></li><li><p><a href="https://github.com/kyutai-labs/moshi">Moshi</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://platform.openai.com/docs/guides/realtime?text-generation-quickstart-example=audio">OpenAI&#8217;s Realtime API</a></p></li><li><p><a href="https://ai.google.dev/api/multimodal-live">Gemini 2.0 Multimodal Live API</a></p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;addd3c0f-15a6-48af-b7bd-8827698acad6&quot;,&quot;duration&quot;:null}"></div><p>&#128070; Conversing with Google&#8217;s new Gemini 2.0 (source: <a href="https://github.com/saharmor/gemini-multimodal-playground">Gemini Playground</a>)</p><h3><strong>Conversational Agents Frameworks</strong></h3><p>Conversational agent frameworks provide the essential scaffolding needed to build production-ready voice AI applications. These frameworks abstract away much of the complexity in managing real-time voice infrastructure, handling edge cases, and orchestrating the various components of a voice application. Pipecat offers an open-source approach for developers who want complete control over their stack and need flexibility to build custom multimodal experiences. Vocode provides a more opinionated toolkit that simplifies the process of creating voice-based LLM agents, handling much of the complexity around conversation management and error handling. Vapi takes this further by offering a full-featured platform that abstracts away infrastructure challenges while providing enterprise-grade reliability and scalability. The choice between these frameworks often comes down to the specific requirements of your project - whether you need the flexibility of open-source, the simplicity of a focused toolkit, or the robustness of a complete platform.</p><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/pipecat-ai/pipecat">Pipecat</a></p></li><li><p><a href="https://github.com/vocodedev/vocode-core">Vocode</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://vapi.ai/">Vapi</a></p></li></ul><h3><strong>Evaluation</strong></h3><p>One of the most significant challenges in voice AI development is accurately measuring agent performance. Unlike text-based interactions, where metrics like response accuracy and latency are relatively straightforward, voice agents require a more nuanced evaluation approach that considers elements like conversation flow, emotional appropriateness, and real-world task completion.</p><p>Technical performance metrics form the foundation of evaluation, starting with response latency - which should target under 300ms for natural conversation and be measured across different percentiles, while speech recognition accuracy is tracked through Word Error Rate (WER) with special attention to domain-specific terminology. The second dimension focuses on conversational quality metrics that measure the agent's ability to handle natural dialogue. This includes turn management metrics like interruption handling and end-of-speech detection, along with task completion metrics such as success rate and the number of turns needed to complete a task. User experience metrics round out the evaluation framework, tracking call abandonment rates, average call duration, user sentiment scores, and the frequency of repeat requests.</p><p>When done right, proper evaluation becomes a game-changing unlock - it enables developers to identify subtle issues that could frustrate users (like awkward pauses or mismatched emotional responses), optimize for natural conversation patterns, and ensure consistent performance across different accents and background noise conditions.</p><p><strong>Open-source</strong></p><ul><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a></p></li></ul><p><strong>Commercial</strong></p><ul><li><p><a href="https://www.coval.dev/">Coval</a></p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r4LJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif" width="693" height="346.5" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:693,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!r4LJ!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!r4LJ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b454856-31bc-4210-9a24-6d0a91c3fd55_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Evaluating voice agents using Voice Lab</figcaption></figure></div><h2>End-to-end Automation</h2><h3><strong>Full Stack</strong></h3><p>The Stripe for voice agents. From prompt to a call.</p><p>While building with individual components offers maximum flexibility, full-stack solutions make sense when you need to quickly deploy production-ready voice agents without managing infrastructure complexity. These platforms are particularly valuable for teams that need to focus on their core business logic and customer experience rather than wrestling with the intricacies of voice infrastructure, latency optimization, and scaling challenges.</p><p>These solutions typically provide end-to-end capabilities,  including pre-built integrations with popular business systems (CRMs, scheduling tools, payment processors), sophisticated error handling, automated failover, and comprehensive analytics. They handle complex engineering challenges like maintaining low latency during peak loads, graceful degradation during API failures, and automated quality monitoring. While you trade some flexibility compared to building your own stack, these platforms can dramatically accelerate time-to-market and reduce engineering overhead.</p><ul><li><p><a href="https://www.bland.ai/">Bland</a></p></li><li><p><a href="https://deepgram.com/product/voice-agent-api">Deepgram Voice Agents API</a></p></li><li><p><a href="https://elevenlabs.io/conversational-ai">Eleven Labs Conversational AI</a></p></li></ul><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;c5a3c135-558f-4323-ab28-17c2da33b654&quot;,&quot;duration&quot;:null}"></div><p>&#128070; A Deepgram-powered voice agent handles a food order</p><div><hr></div><h2>Choosing the right architecture for your voice agent</h2><p>When selecting a voice AI architecture, developers face a fundamental choice between using full-stack platforms and assembling custom solutions from individual components. This decision ultimately comes down to three key factors: latency requirements, cost constraints, and the need for fine-grained control over the conversation flow.</p>
      <p>
          <a href="https://www.aitidbits.ai/p/voice-agents-toolkit">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Great Web Rebuild: Infrastructure for the AI Agent era]]></title><description><![CDATA[How core internet components will transform for an agent-driven web and the new opportunities for startup founders]]></description><link>https://www.aitidbits.ai/p/ai-agents-web-infrastructure</link><guid isPermaLink="false">https://www.aitidbits.ai/p/ai-agents-web-infrastructure</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 12 Dec 2024 14:30:15 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI. Some of my previous posts listed the <a href="https://www.aitidbits.ai/p/open-source-agents">open-source toolkit for AI Agents builders</a>, the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundational AI models</a>, and <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">the future of autonomous agents</a>.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;0af73734-f156-4b74-9423-6cee42ae4079&quot;,&quot;duration&quot;:1023.3992,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>December 2028. Sarah's AI agent encounters an unusual situation while booking her family's holiday trip to Japan. The multi-leg journey requires coordinating with three different airlines, two hotels, and a local tour operator. As the agent begins negotiations, it presents its "agent passport"&#8212;a cryptographic attestation of its delegation rights and transaction history. The vendors' systems instantly verify the agent's authorization scope, spending limits, and exposable metadata like age and passport number. Within seconds, the agent has established secure payment channels and begun orchestrating the complex booking sequence. When one airline's system flags the rapid sequence of international bookings as suspicious, the agent smoothly provides additional verification, demonstrating its legitimate delegation chain back to Sarah. What would have triggered fraud alerts and CAPTCHA challenges in 2024 now flows seamlessly in an infrastructure built for autonomous AI agents.</p><p><em>&#8212;&gt; The future, four years from now.</em></p><p>In my <a href="https://www.aitidbits.ai/p/agent-responsive-design">previous essay</a>, we explored how websites and applications must evolve to accommodate AI agents. Now we turn to the deeper infrastructural shifts that make such agent interactions possible. The systems we've relied on for decades: CAPTCHAs, credit card verification, review platforms, and authentication protocols, were all built with human actors in mind. As AI agents transition from experimental curiosities to fully operational assistants, the mechanisms underpinning the digital world for decades are beginning to crack under the pressure of automation.</p><p>The transition to an agent-first internet won't just streamline existing processes&#8212;it will unlock entirely new possibilities that were impractical in a human-centric web. Tasks that humans find too tedious or time-consuming become effortless through automation. Instead of clicking 'Accept All' on cookie banners, agents can granularly optimize privacy preferences across thousands of sites. Rather than abandoning a cart due to complex shipping calculations, agents can simultaneously compare multiple courier services and customs implications. Even seemingly simple tasks like comparing prices across multiple vendors, which humans typically limit to 2-3 sites, can be executed across hundreds of retailers in seconds. Perhaps most importantly, agents can maintain persistent relationships with services, continuously monitoring for price drops, policy changes, or relevant updates that humans would miss. This shift from manual, limited interactions to automated, comprehensive engagement represents not just a change in speed, but a fundamental expansion of what's possible online.</p><p>Amid these sweeping changes, a new gold rush is emerging. Just as the shift to mobile created opportunities for companies like Uber and Instagram to reinvent existing services, the transition to agent-first infrastructure opens unprecedented possibilities for founders. From building next-generation authentication systems and trust protocols to creating agent-mediated data marketplaces, entrepreneurs have a chance to establish the foundational layers of this new paradigm. In many ways, we're returning to the internet's early days, where core infrastructure is being reimagined from the ground up&#8212;this time for an autonomous, agent-driven future.</p><p>In this second post of the <a href="https://www.aitidbits.ai/s/ai-agents">AI Agents series</a>, we&#8217;ll focus on the foundational infrastructure changes that underlie the agent-first internet: new authentication mechanisms, trust systems, novel security challenges, and agent-to-agent protocols, setting the stage for the more commerce-oriented transformations we&#8217;ll explore in the following post.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bMLb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bMLb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 424w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 848w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1068161,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bMLb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 424w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 848w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1272w, https://substackcdn.com/image/fetch/$s_!bMLb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fed0224b3-8e5e-4a9a-8588-3082ea805905_2514x1414.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Proving you're <s>a human</s> an agent</strong></h2><p>Remember when "proving you're not a robot" meant deciphering distorted text or selecting crosswalk images? Those mechanisms become obsolete in a world where legitimate automated actors are the norm rather than the exception. Today&#8217;s CAPTCHAs, designed to block bots, have become increasingly complex due to advances in multimodal AI. Paradoxically, these mechanisms now hinder real humans while sophisticated bots often bypass them. As AI outpaces human problem-solving in these domains, CAPTCHAs risk becoming obsolete, reducing website conversions and frustrating legitimate users.</p><p>The challenge shifts from proving humanity to verifying the agent has been legitimately delegated and authorized by a human user.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iuJ0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 424w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 848w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1272w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif" width="659" height="570.035" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:692,&quot;width&quot;:800,&quot;resizeWidth&quot;:659,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;challenging captcha.mp4 [video-to-gif output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="challenging captcha.mp4 [video-to-gif output image]" title="challenging captcha.mp4 [video-to-gif output image]" srcset="https://substackcdn.com/image/fetch/$s_!iuJ0!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 424w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 848w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1272w, https://substackcdn.com/image/fetch/$s_!iuJ0!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffe08711a-4f10-42d0-98cb-b25015516fab_800x692.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I recently failed a CAPTCHA three times before finally passing on the fourth attempt. Now picture an 80-year-old attempting to decipher increasingly convoluted challenges</figcaption></figure></div><p>Today&#8217;s rate-limiting mechanisms assume human-paced interactions, relying heavily on IP-based throttling to manage access. But in a world of AI agents, what constitutes "fair use" of digital services? In an agent-driven internet, automated browsing will become not just accepted but essential. Cloudflare, Akamai, and similar services will need to pivot from simplistic IP-based throttling to sophisticated agent-aware frameworks.</p><p>As businesses grapple with these challenges, a new solution is emerging&#8212;one that shifts the paradigm from blocking automated traffic to authenticating and managing it intelligently. Enter the <strong>Agent Passport</strong>.</p><p>Imagine a digital credential that encapsulates an agent's identity and permissions&#8212;cryptographically secured and universally recognized. Unlike simple API keys or OAuth tokens, these passports maintain a verifiable chain of trust from the agent back to its human principal. They carry rich metadata about permissions scope, spending limits, and authorized behaviors, allowing services to make nuanced decisions about agent access and capabilities.</p><p>By integrating Agent Passports, business websites like airlines can distinguish between legitimate, authorized agents and malicious actors. New metrics, such as agent reliability scores and behavioral analysis, could ensure fair access while mitigating abuse, balancing security with the need for allowing agent-driven traffic.</p><p>Authentication mechanisms, such as signing up and signing in, must also evolve for an agent-first internet. Websites will need to determine not just an agent's identity but also its authorized scope&#8212;what data the agent is authorized to access (&#8216;read&#8217;) and what actions it is permitted to execute (&#8216;write&#8217;).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f50V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f50V!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 424w, https://substackcdn.com/image/fetch/$s_!f50V!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 848w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1272w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png" width="604" height="712.72" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1416,&quot;width&quot;:1200,&quot;resizeWidth&quot;:604,&quot;bytes&quot;:357639,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f50V!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 424w, https://substackcdn.com/image/fetch/$s_!f50V!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 848w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1272w, https://substackcdn.com/image/fetch/$s_!f50V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe22198ca-a25d-48c8-bc10-e59b404c6179_1200x1416.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Google Login revolutionized online authentication by centralizing access with a single credential, reducing friction and enhancing security. Similarly, agent passports could create a universal standard for agent authentication, simplifying multi-platform access while maintaining robust authorization controls.</p><p>Companies like Auth0 and Okta could adapt by offering agent-specific identity frameworks, enabling seamless integration of these passports into their authentication platforms. Meanwhile, consumer companies like Google and Apple could extend their authentication and wallet services to seamlessly support agent-mediated interactions, bridging the gap between human and agent use cases.</p><h2>A new protocol for Agent-to-Agent communication</h2><p>In the early days of the web, protocols like HTTP emerged to standardize how browsers and servers communicated. In much the same way, the rise of agent-mediated interactions demands a new foundational layer: an <strong>Agent-to-Agent Communication Protocol (AACP)</strong>. This protocol would formalize how consumer agents and business agents discover each other&#8217;s capabilities, authenticate identities, negotiate trust parameters, and exchange actionable data&#8212;all while ensuring both parties operate within well-defined boundaries.</p><p>Just as Sarah's travel agent from the intro paragraph seamlessly coordinated with multiple airlines and hotels, AACP enables complex multi-party interactions that would be tedious or impossible for humans to manage manually.</p><p>Much like HTTPS introduced encryption and certificates to authenticate servers and protect user data, AACP would implement cryptographic attestation for agents. Trusted third-party authorities, similar to today&#8217;s certificate authorities, would issue digital &#8220;agent certificates&#8221; confirming an agent&#8217;s legitimacy, delegation chain, and operational scope. This ensures that when a consumer&#8217;s travel-planning agent communicates with an airline&#8217;s booking agent, both sides can instantly verify authenticity and adherence to agreed-upon standards.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zXZv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zXZv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 424w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 848w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1272w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png" width="1204" height="1264" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1264,&quot;width&quot;:1204,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:204543,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zXZv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 424w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 848w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1272w, https://substackcdn.com/image/fetch/$s_!zXZv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F342674d7-8332-466d-abf0-875ab7c76cd7_1204x1264.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A potential implementation of the AACP protocol. A full example of booking an airline ticket can be found <a href="https://claude.site/artifacts/4087f5cf-c4dd-41a2-87df-5faae7fca960">here</a>.</figcaption></figure></div><p>Without such a protocol, a rogue agent might impersonate a trusted retailer to trick consumer agents into unauthorized transactions, or a malicious consumer agent could spoof credentials to overwhelm a merchant&#8217;s infrastructure. By mandating cryptographic proof, robust authentication handshakes, and behavior logs, AACP mitigates these threats before meaningful data or funds change hands.</p><p>The handshake phase in AACP would include mutual disclosure of the agents&#8217; technical stacks&#8212;such as which LLM or language configuration they use&#8212;and their supported capabilities. Once established, the protocol would also govern &#8220;write-like operations&#8221; (e.g., initiating a payment or updating account details) by enforcing strict sign-offs with auditable cryptographic signatures. Every action would leave a verifiable trail of authorization that can be reviewed and validated after the fact.</p><p>Finally, AACP would incorporate locale and language negotiation at the protocol level. Although agents can translate and interpret content dynamically, specifying a preferred language or locale upfront helps streamline interactions. This new protocol weaves together trust, authentication, and contextual awareness, forging a resilient substrate on which the agent-first internet can reliably function.</p><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like AI Voice Agents. It's also a great way to show your support :)

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2>Trust and reputation reimagined</h2><p>When we navigate the internet, our judgment of a website's credibility hinges on a blend of visual and social cues. We look for secure HTTPS connections, professional design, and familiar branding to assure us that a site is trustworthy. No one wants to input their credit card information on a site that looks like it was built in the early 2000s. User reviews and star ratings on platforms like Trustpilot and G2 further influence our decisions, offering insights drawn from shared human experiences.</p><p>Perhaps no aspect of online commerce requires more fundamental reimagining than trust and reputation systems. In an agent-mediated economy, traditional cues for reliability fall short. AI agents can't interpret visual aesthetics or branding elements&#8211;they operate on data, protocols, and cryptographic proofs. Trust mechanisms must pivot from human perception to machine-readable verifications. For instance, an agent might verify a seller's identity through cryptographic attestations and assess service quality via automated compliance records, ensuring decisions are based on objective, tamper-proof data. Traditional review platforms like Trustpilot and G2, built around subjective human experiences and star ratings, will also become increasingly obsolete.</p><p>The emerging alternative is a new trust infrastructure built on quantifiable, machine-readable metrics. Instead of relying on potentially AI-generated reviews, a problem that has already undermined traditional review systems, agents could assess services using benchmarks like delivery time reliability, system uptime, or refund processing speed&#8212;measurable metrics that ensure objective evaluations rather than subjective human reviews. This could involve decentralized reputation networks where trust is established through cryptographically verified interaction histories and smart contract execution records. Such systems would offer objective assessments of service quality, enabling agents to make informed decisions without relying on potentially biased or manipulated human reviews.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3gJJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 424w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 848w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1272w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png" width="1456" height="537" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:537,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:493147,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3gJJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 424w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 848w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1272w, https://substackcdn.com/image/fetch/$s_!3gJJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84a28f39-5257-4e44-90d6-d9033f53acee_1968x726.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Moreover, the feedback loop between consumers and businesses will evolve dramatically. Instead of sending generic emails requesting reviews&#8212;a method often resulting in low response rates&#8212;commerce websites can engage directly with your AI agent to collect timely feedback about specific topics like shipping or product quality. They might offer incentives like future store credit to encourage participation. The human user could provide a brief impression, such as "The cordless vacuum cleaner works well, but the battery life is short." The agent then takes this input, contextualizes it with additional product data, and generates a comprehensive review that highlights key features and areas for improvement. This process not only saves time for the user but also provides businesses with richer, more actionable insights.</p><p>Trustpilot and G2 could pivot by introducing agent-oriented verification systems, such as machine-readable trust scores derived from operational metrics like service accuracy, delivery consistency, and customer support responsiveness, enabling agents to evaluate businesses programmatically.</p><h2>The new data-sharing economy</h2><p>Information sharing in the age of AI agents demands a fundamental reinvention of the current consent and data access model. Rather than blunt instruments like cookie banners and privacy policies, websites will implement structured data requirement protocols&#8212;machine-readable manifests that explicitly declare what information is needed and why.</p><p>This granular control would operate at multiple levels of specificity. For example, an agent could share your shirt size (L) with a retailer while withholding your exact measurements. It might grant 24-hour access to your travel dates but permanent access to your seating preferences. When a service requests location data, your agent could share your city for shipping purposes but withhold your exact address until purchase confirmation. These permissions wouldn't be just binary yes/no choices&#8212;they could include sophisticated rules like "share my phone number only during business hours" or "allow access to purchase history solely for personalization, not marketing." Such granular controls, impossible to manage manually at scale, become feasible when delegated to AI agents operating under precise constraints.</p><p>AI agents would also act as sophisticated information gatekeepers, maintaining encrypted personal data vaults and negotiating data access in real time.</p><p>These mechanisms will fundamentally shift the balance of power in data-sharing dynamics. GDPR-like frameworks may evolve to include provisions for dynamic, agent-mediated consent, allowing for more granular data-sharing agreements tailored to specific tasks. Websites might implement real-time negotiation protocols, where agents can evaluate and respond to data requests based on their principal's preferences, preserving privacy while optimizing functionality.</p><h2>New attack vectors</h2><p>The shift to agent-mediated interaction introduces novel security challenges. Agent impersonation and jailbreaking agents are two examples.</p><p>Jailbreaking AI agents pose significant risks, as <a href="https://www.aitidbits.ai/i/135923979/manipulating-ai-models-for-profit">manipulated agents</a> could act outside their intended scope, leading to unintended purchases or other errors. Techniques like instruction-tuning poisoning or adversarial suffix manipulation could alter an agent&#8217;s behavior during critical tasks. For example, adversarial instructions embedded in websites&#8217; HTML might influence an agent&#8217;s purchasing logic, bypassing its human-defined constraints. Robust safeguards and continuous monitoring will be essential to prevent these vulnerabilities.</p><p>Agent impersonation adds a complex layer to cybersecurity challenges. Malicious actors could spoof an agent's credentials to access sensitive data or execute fraudulent transactions. Addressing this threat demands robust multi-layered verification protocols, such as cryptographic identity verification paired with continuous behavioral monitoring, to ensure authenticity and safeguard sensitive interactions.</p><h2>Building the new web - opportunities for founders</h2><p>The web&#8217;s agent-first future has no established playbook&#8212;and that&#8217;s exactly where founders thrive. Entirely new product categories are waiting to be defined: agent-to-agent compliance dashboards, cryptographic attestation services that replace outdated CAPTCHAs, and dynamic data-sharing frameworks that make &#8220;privacy by design&#8221; a reality. Platforms that offer standardized &#8220;agent passports,&#8221; identity brokerages that verify delegation rights, agent-native payment gateways, and trust ecosystems driven by machine-readable performance metrics&#8212;each of these represents a greenfield opportunity to set the standards of tomorrow&#8217;s internet. Startups anticipating these shifts can position themselves as foundational players in an agent-driven economy, opening new channels of value creation and establishing a competitive edge before the rest of the market catches up.</p><p>Some concrete areas include:</p><ul><li><p>Trustpilot for agents - creating machine-readable trust metrics and reputation systems that help agents evaluate services and vendors</p></li><li><p>Okta for AI agents - building the identity and authentication layer that manages agent credentials, permissions, and delegation chains</p></li><li><p>OneTrust for agents - creating the new standard for privacy preference management, turning today's basic cookie banners into sophisticated data-sharing frameworks where agents can negotiate and manage granular permissions across thousands of services</p></li><li><p>Cloudflare for agent traffic - developing intelligent rate-limiting and traffic management systems designed for agent-scale operations</p></li><li><p>LastPass for agent permissions - building secure vaults that manage agent credentials and access rights across services</p></li><li><p>AWS CloudFront for agent data - creating CDN-like infrastructure optimized for agent-readable formats and rapid agent-to-agent communication</p></li><li><p>McAfee security for agents - developing security platforms that protect against agent impersonation and novel attack vectors</p></li></ul><p><em><strong><br>Go build.</strong></em></p><div><hr></div><p>This is the second essay in a <a href="https://www.aitidbits.ai/s/ai-agents">five-part series</a> exploring the future of AI agents and their impact on the internet. In Part 3, we'll move from infrastructure to commerce, exploring critical questions that emerge in an agent-first economy: How will payment systems evolve to support autonomous purchasing decisions? What happens to marketing and customer engagement when AI agents become the primary decision-makers? And perhaps most intriguingly, how will businesses adapt their revenue models when agents can negotiate prices and terms at superhuman speed and scale? Subscribe and follow as we continue to examine the technical, social, and economic implications of this transformation.</p>]]></content:encoded></item><item><title><![CDATA[The Open-Source Toolkit for Building AI Agents]]></title><description><![CDATA[Curated frameworks, tools, and libraries every developer needs to build functional and efficient AI agents]]></description><link>https://www.aitidbits.ai/p/open-source-agents</link><guid isPermaLink="false">https://www.aitidbits.ai/p/open-source-agents</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Thu, 28 Nov 2024 12:31:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!p8_R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>June &#8216;25 update: Released an updated map with new frameworks and repositories since this post was published in Nov &#8216;24.</em></p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;78e32312-2147-4e0e-ab16-1aa629bb1311&quot;,&quot;caption&quot;:&quot;An opinionated, developer-first guide to building AI agents with real-world impact&quot;,&quot;cta&quot;:&quot;Read full story&quot;,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;The Open-Source Toolkit for Building AI Agents v2&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2025-06-01T15:02:05.876Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F190a5f5b-203f-447d-8ad6-a07d9ab3c874_2600x1456.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-agents-updated&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:163224476,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:28,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><div><hr></div><p><em>Welcome to a new post in the AI Agents Series - helping AI developers and researchers deploy and make sense of the next step in AI.</em></p><p>My last post explored how the internet will trans</p><p>form for an agent-first future - from websites optimizing for AI interaction through "agent-responsive design" to the emergence of Agent Engine Optimization (AEO) as the next SEO. We saw how tech giants like Google, Apple, OpenAI, and Anthropic are racing to define this next evolution of digital interaction, with Gartner projecting that by 2028, 33% of enterprise software applications will include agentic AI.</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;025ad65a-0fcf-4b32-bcaf-0497505cdce2&quot;,&quot;caption&quot;:&quot;Welcome to AI Tidbits Deep Dives: short posts offering a perspective on AI-related topics. Some of my previous ones covered the economies of scale for foundational AI models, the recent consolidation in the AI space, and autonomous agents.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;sm&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Agent-Responsive Design: Rethinking the web for an agentic future&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2024-11-17T16:01:09.913Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d892766-b0f9-45ea-b64e-1955fc3c4dc6_2458x1406.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/agent-responsive-design&quot;,&quot;section_name&quot;:&quot;AI Agents&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:151674570,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:76,&quot;comment_count&quot;:13,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:false,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p>In this post, I'll outline a curated, though non-exhaustive, overview of the open-source ecosystem for developers creating these AI agents. While numerous market maps exist for AI agents, they often cater more to venture capitalists than builders. Developers need actionable tools and frameworks to launch functional AI agents today.</p><p>Which tools do other builders rely on to develop voice agents? What&#8217;s the leading open model for document understanding? With new packages emerging almost daily, I&#8217;ll focus solely on the libraries I&#8217;ve personally found most effective. This list is, therefore, intentionally selective rather than exhaustive.</p><p>Every package included here supports commercial use and has a permissive open-source license.</p><p>With the holiday season coming, there's no better time to dive into these tools and start building.</p><p>Categories covered in this piece:<br>&#8594; Frameworks for Building and Orchestrating Agents<br>&#8594; Computer and Browser Use<br>&#8594; Voice<br>&#8594; Document Understanding<br>&#8594; Memory<br>&#8594; Testing and Evaluation<br>&#8594; Monitoring and Observability<br>&#8594; Simulation<br>&#8594; Vertical Agents</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p8_R!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p8_R!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 424w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 848w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1272w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png" width="1456" height="810" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:810,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1849994,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!p8_R!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 424w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 848w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1272w, https://substackcdn.com/image/fetch/$s_!p8_R!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d163d4c-73d3-4cab-b150-956c0cd6f49e_2174x1210.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Frameworks for Building and Orchestrating Agents</h2><p>Building AI agents requires robust frameworks that can handle complex workflows, memory management, and tool integration. These foundational frameworks serve as the backbone for creating agents that can understand, plan, and execute tasks autonomously.</p><ul><li><p><a href="https://github.com/crewAIInc/crewAI">CrewAI</a> - a framework for orchestrating role-playing, autonomous AI agents</p></li><li><p><a href="https://github.com/phidatahq/phidata">Phidata</a> - build AI assistants with memory, knowledge, and tools</p></li><li><p><a href="https://github.com/camel-ai/camel">Camel</a> - build customized multi-agent systems to generate data, complete tasks, or simulate real-world interactions</p></li><li><p><a href="https://github.com/Significant-Gravitas/AutoGPT">AutoGPT</a> - create, deploy, and manage continuous AI agents that automate complex workflows</p></li><li><p><a href="https://github.com/microsoft/autogen">AutoGen</a> - develop LLM applications using multiple agents that can converse with each other</p></li><li><p><a href="https://github.com/TransformerOptimus/SuperAGI">SuperAGI</a> - build, manage, and run autonomous AI agents quickly and reliably</p></li><li><p><a href="https://github.com/superagent-ai/superagent">Superagent</a> - an open framework for building AI assistants</p></li><li><p><a href="https://github.com/langchain-ai/langchain">LangChain</a> &amp; <a href="https://github.com/run-llama/llama_index">LlamaIndex</a> - the usual suspects, facilitating AI Agents through composability</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TASi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TASi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 424w, https://substackcdn.com/image/fetch/$s_!TASi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 848w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png" width="1456" height="999" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:999,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;CrewAI Mind Map&quot;,&quot;title&quot;:&quot;CrewAI Mind Map&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="CrewAI Mind Map" title="CrewAI Mind Map" srcset="https://substackcdn.com/image/fetch/$s_!TASi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 424w, https://substackcdn.com/image/fetch/$s_!TASi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 848w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1272w, https://substackcdn.com/image/fetch/$s_!TASi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F583c9e3b-7d8c-4fb5-b1a2-2230afc1292a_2128x1460.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">CrewAI supports running customized agents with specific roles, goals, and tools</figcaption></figure></div><h2>Computer and Browser Use</h2><p>For AI agents to be truly useful, they need to interact with computers and browsers just like humans do. These tools enable agents to navigate websites, control applications, and execute commands programmatically, bridging the gap between AI reasoning and real-world actions.</p><ul><li><p><a href="https://github.com/OpenInterpreter/open-interpreter">Open Interpreter</a> - turn natural language commands into code that runs on your local machine</p></li><li><p><a href="https://github.com/OthersideAI/self-operating-computer">Self-Operating Computer</a> - enables multimodal models to operate a computer</p></li><li><p><a href="https://github.com/simular-ai/Agent-S">Agent-S</a> - an open agentic framework that uses computers like a human</p></li><li><p><a href="https://github.com/lavague-ai/LaVague">LaVague</a> - create web agents that take actions on websites using LLMs as their reasoning engines</p></li><li><p><a href="https://github.com/microsoft/playwright">Playwright</a> - a framework for web testing and automation</p></li><li><p><a href="https://pptr.dev/">Puppeteer</a> - a JavaScript library that provides a high-level API to control Chrome or Firefox</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rMyZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif" width="600" height="338" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:338,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [video-to-gif output image]&quot;,&quot;title&quot;:&quot;temp.mov [video-to-gif output image]&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [video-to-gif output image]" title="temp.mov [video-to-gif output image]" srcset="https://substackcdn.com/image/fetch/$s_!rMyZ!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 424w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 848w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1272w, https://substackcdn.com/image/fetch/$s_!rMyZ!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F355e0bd5-8b6d-4118-94ba-8ec84c7d8b4a_600x338.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Self-Operating Computer generates a poem and saves it in a Google Doc</figcaption></figure></div><div><hr></div><h2>Voice</h2><p>Voice interfaces represent the most natural way for humans to interact with AI agents. These tools enable the creation of agents that can understand spoken language, maintain context in conversations, and respond with natural-sounding speech, making AI interaction more accessible and intuitive.</p><p><strong>Speech2speech</strong></p><ul><li><p><a href="https://github.com/fixie-ai/ultravox">Ultravox</a> - a speech2speech model for real-time voice interaction, superior to Moshi for now</p></li><li><p><a href="https://github.com/kyutai-labs/moshi">Moshi</a> - a speech2speech model for real-time voice interaction</p></li><li><p><a href="https://github.com/pipecat-ai/pipecat">Pipecat</a> - a framework for voice and multimodal conversational AI, supporting speech2text, text2speech, video, etc.</p></li></ul><p><strong>Speech2text</strong></p><ul><li><p><a href="https://github.com/openai/whisper">Whisper</a> - OpenAI's speech2text model</p></li><li><p><a href="https://github.com/jianfch/stable-ts">Stable-ts</a> - a lightweight Whisper wrapper with timestamps and more</p></li></ul><ul><li><p><a href="https://huggingface.co/pyannote/speaker-diarization-3.1">Speaker diarization 3.1</a> - pyannote&#8217;s flagship model for speaker detection</p></li></ul><p><strong>Text2speech</strong></p><p>The only decent open model I came across was ChatTTS, which is satisfactory for production. I, therefore, default to ElevenLabs or Cartesia.</p><p><strong>Misc</strong></p><ul><li><p><a href="https://github.com/vocodedev/vocode-core">Vocode</a> - a toolkit for building voice-based LLM agents</p></li><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a> - a comprehensive testing and evaluation framework for voice agents across language models, prompts, and agent personas</p></li></ul><h2>Document Understanding</h2><p>Modern AI agents need to process and understand documents in various formats, from PDFs to images with text. These tools provide the crucial ability to extract, comprehend, and act on information from unstructured documents, enabling agents to handle real-world business processes.</p><ul><li><p><a href="https://huggingface.co/collections/Qwen/qwen2-vl-66cee7455501d7126940800d">Qwen2-VL</a> - vision language model from Alibaba outperforming GPT-4o and Claude 3.5 Sonnet</p></li><li><p><a href="https://huggingface.co/mPLUG/DocOwl2">DocOwl2</a> - an efficient multimodal LLM for OCR-free document understanding</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vTOR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vTOR!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 424w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 848w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1272w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif" width="800" height="463" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image temp.mov&quot;,&quot;title&quot;:&quot;Image temp.mov&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image temp.mov" title="Image temp.mov" srcset="https://substackcdn.com/image/fetch/$s_!vTOR!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 424w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 848w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1272w, https://substackcdn.com/image/fetch/$s_!vTOR!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff53ee1a6-0625-4ec5-a7c7-34607542fffe_800x463.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Qwen2 excels in document and chart understanding with a commercially permissive license</figcaption></figure></div><h2>Memory</h2><p>Without memory, AI agents are limited to single-turn interactions. These memory tools enable agents to maintain context over long conversations, remember user preferences, and learn from past interactions, making them truly personal assistants rather than just query responders.</p><ul><li><p><a href="https://github.com/mem0ai/mem0">Mem0</a> - provides an efficient, self-improving memory layer for LLMs, enabling personalized AI experiences</p></li><li><p><a href="https://github.com/letta-ai/lettahttps://github.com/letta-ai/letta">Letta (fka MemGPT)</a> - create LLM agents with long-term memory and custom tools</p></li><li><p><a href="https://github.com/langchain-ai/langchain">LangChain</a> - offers memory components to manage conversation history and context</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BqBj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BqBj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 424w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 848w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1272w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png" width="1456" height="996" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:996,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Screenshot of the Letta ADE (Agent Development Environment)&quot;,&quot;title&quot;:&quot;Screenshot of the Letta ADE (Agent Development Environment)&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Screenshot of the Letta ADE (Agent Development Environment)" title="Screenshot of the Letta ADE (Agent Development Environment)" srcset="https://substackcdn.com/image/fetch/$s_!BqBj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 424w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 848w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1272w, https://substackcdn.com/image/fetch/$s_!BqBj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb411e011-6f1c-4195-8237-db4cb1df4e26_2494x1706.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Stateful agents with Letta</figcaption></figure></div><div><hr></div><pre><code><code>Become a premium AI Tidbits subscriber and get over $1k in free credits to build AI agents with Vapi, Claude, and other leading AI tools (Hugging Face, Deepgram, etc.), along with exclusive access to the LLM Builders series and in-depth explorations of crucial topics, such as the future of the internet in an era driven by AI agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><div><hr></div><h2>Testing and Evaluation</h2><p>As AI agents become more complex, robust testing becomes critical. These tools help developers evaluate agent performance, identify failure modes, and ensure reliability across different scenarios and environments.</p><ul><li><p><a href="https://github.com/saharmor/voice-lab">Voice Lab</a> - a comprehensive testing and evaluation framework for voice agents</p></li><li><p><a href="https://github.com/AgentOps-AI/agentops">AgentOps</a> - tools for monitoring and benchmarking agent performance</p></li><li><p><a href="https://github.com/THUDM/AgentBench">AgentBench</a> - a benchmark to evaluate LLMs as agents across various environments (Web, Minecraft, Visual Design, etc.)</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VEsF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif" width="800" height="400" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:400,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Demo usage&quot;,&quot;title&quot;:&quot;Demo usage&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Demo usage" title="Demo usage" srcset="https://substackcdn.com/image/fetch/$s_!VEsF!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 424w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 848w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1272w, https://substackcdn.com/image/fetch/$s_!VEsF!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2156ad29-8bdc-4efa-abb2-209ec74ed81f_800x400.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Test and refine your voice agents with Voice Lab</figcaption></figure></div><h2>Monitoring and Observability</h2><p>Understanding how AI agents perform in production is crucial for maintaining reliability and optimizing costs. These tools provide insights into agent behavior, resource usage, and performance metrics essential for running agents at scale.</p><ul><li><p><a href="https://github.com/traceloop/openllmetry">openllmetry</a> - an open-source, OpenTelemetry-based end-to-end observability tool for LLM applications</p></li><li><p><a href="https://github.com/AgentOps-AI/agentops">AgentOps</a> - agent monitoring, LLM cost tracking, benchmarking, and more</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hMwg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hMwg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 424w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 848w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1272w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png" width="1456" height="840" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:840,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Session Replays&quot;,&quot;title&quot;:&quot;Session Replays&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Session Replays" title="Session Replays" srcset="https://substackcdn.com/image/fetch/$s_!hMwg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 424w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 848w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1272w, https://substackcdn.com/image/fetch/$s_!hMwg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71950670-67fd-4db6-860c-0b6d6b60b1e2_3428x1978.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Debug agents with AgentOps</figcaption></figure></div><h2>Simulation</h2><p>Before deploying agents to real-world scenarios, testing them in controlled environments is crucial. These simulation tools allow developers to validate agent behavior, test edge cases, and refine decision-making capabilities in safe, reproducible environments.</p><ul><li><p><a href="https://github.com/OpenBMB/AgentVerse">AgentVerse</a> - facilitates the deployment of multiple LLM-based agents in various applications, including simulations</p></li><li><p><a href="https://github.com/sierra-research/tau-bench">Tau-Bench</a> - a benchmark and testing code for agent-user interactions in real-world domains like retail and airline</p></li><li><p><a href="https://github.com/Farama-Foundation/chatarena">ChatArena</a> - multi-agent language game environments for research on autonomous LLM agents</p></li><li><p><a href="https://github.com/a16z-infra/ai-town">AI Town</a> - A virtual town where AI characters live, chat, and socialize</p></li><li><p><a href="https://github.com/joonspk-research/generative_agents">Generative Agents</a> - Stanford&#8217;s Interactive simulacra of human behavior</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!THzk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!THzk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 424w, https://substackcdn.com/image/fetch/$s_!THzk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 848w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png" width="1456" height="832" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:832,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Screen Shot 2023-10-16 at 10 53 49 PM&quot;,&quot;title&quot;:&quot;Screen Shot 2023-10-16 at 10 53 49 PM&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Screen Shot 2023-10-16 at 10 53 49 PM" title="Screen Shot 2023-10-16 at 10 53 49 PM" srcset="https://substackcdn.com/image/fetch/$s_!THzk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 424w, https://substackcdn.com/image/fetch/$s_!THzk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 848w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1272w, https://substackcdn.com/image/fetch/$s_!THzk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadb5660-4a06-4ac2-8b40-313b931dc105_2754x1574.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Simulate agentic environments with AgentVerse</figcaption></figure></div><h2>Vertical Agents</h2><p>There are dozens of open vertical agents out there, so here are just a few select ones I&#8217;ve tinkered with and found the most useful:</p><ul><li><p><a href="https://github.com/All-Hands-AI/OpenHands">OpenHands</a> (Coding) - a platform for software development agents powered by AI</p></li><li><p><a href="https://github.com/Aider-AI/aider">aider</a> (Coding) - pair programming in your terminal</p></li><li><p><a href="https://github.com/gpt-engineer-org/gpt-engineer">GPT Engineer</a> (Low code) - build applications using natural language. Specify what you want to build, and the AI will ask for clarification before building it.</p></li><li><p><a href="https://github.com/abi/screenshot-to-code">screenshot-to-code</a> - convert screenshots into a functioning website using HTML/Tailwind/React/Vue</p></li><li><p><a href="https://github.com/assafelovic/gpt-researcher">GPT Researcher</a> (Research) - an autonomous agent that performs comprehensive research on any given topic</p></li><li><p><a href="https://github.com/vanna-ai/vanna">Vanna</a> (SQL) - chat with your SQL database</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RzPu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RzPu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 424w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 848w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1272w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg" width="687" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:687,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;aider screencast&quot;,&quot;title&quot;:&quot;aider screencast&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="aider screencast" title="aider screencast" srcset="https://substackcdn.com/image/fetch/$s_!RzPu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 424w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 848w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1272w, https://substackcdn.com/image/fetch/$s_!RzPu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5dbeb423-7286-43ee-9d4c-ec29e31ef8a1_687x499.svg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Aider is a pair programming in a terminal</figcaption></figure></div><h1><strong>Looking Ahead</strong></h1><p>While this post focused on open-source packages with permissive licenses, I plan to publish another comprehensive list specifically for engineers building voice agents. This upcoming guide will include both open-source and commercial tools, covering solutions like OpenAI's Realtime API (speech2speech) and ElevenLabs (text2speech), along with detailed comparisons of their capabilities, pricing models, and ideal use cases.</p><p>Stay tuned for more deep dives in the AI Agents Series.</p><div><hr></div><p>Comprehensive list of open-source packages for AI engineers (last update: Aug &#8216;23)</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;976ec2c7-f9b6-4434-879a-bc69ba00e2ef&quot;,&quot;caption&quot;:&quot;Welcome to Deep Dives - a new section of AI Tidbits providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go!&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;Open-source Generative AI&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:3770805,&quot;name&quot;:&quot;Sahar Mor&quot;,&quot;bio&quot;:&quot;An operator and a founder in the AI space for over a decade, recently at Stripe. Helping AI researchers and builders make sense of AI @ AI Tidbits.&quot;,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2Fa06b2072-0444-44f7-8106-7892097e4128_1690x1762.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:100}],&quot;post_date&quot;:&quot;2023-08-06T16:30:15.749Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/885bba4a-9f47-4763-82f1-b7b9196ed69d_1664x958.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://www.aitidbits.ai/p/open-source-llms&quot;,&quot;section_name&quot;:&quot;Sahar's 2&#162;&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:135729768,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:19,&quot;comment_count&quot;:0,&quot;publication_id&quot;:null,&quot;publication_name&quot;:&quot;AI Tidbits&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F71d6ea06-1f4c-478d-b0f2-6227eede6b25_1280x1280.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div>]]></content:encoded></item><item><title><![CDATA[Agent-Responsive Design: Rethinking the web for an agentic future]]></title><description><![CDATA[How AI agents will give birth to a new internet]]></description><link>https://www.aitidbits.ai/p/agent-responsive-design</link><guid isPermaLink="false">https://www.aitidbits.ai/p/agent-responsive-design</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 17 Nov 2024 16:01:09 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/5d892766-b0f9-45ea-b64e-1955fc3c4dc6_2458x1406.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to AI Tidbits Deep Dives: short posts offering a perspective on AI-related topics. Some of my previous ones covered the <a href="https://www.aitidbits.ai/p/economies-of-scale-gen-ai">economies of scale for foundational AI models</a>, <a href="https://www.aitidbits.ai/p/the-great-ai-consolidation">the recent consolidation in the AI space</a>, and <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">autonomous agents</a>.</em></p><div><hr></div><p>A NotebookLM-powered podcast episode discussing this post:</p><div class="native-audio-embed" data-component-name="AudioPlaceholder" data-attrs="{&quot;label&quot;:null,&quot;mediaUploadId&quot;:&quot;720bbf62-76af-417f-978e-3329c1385c35&quot;,&quot;duration&quot;:627.35675,&quot;downloadable&quot;:false,&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>November 2028. Maya's personal AI agent quietly handles her holiday shopping, easily navigating dozens of e-commerce sites. Unlike the clunky chatbots of 2024, her agent seamlessly parses product specifications, compares prices, and makes purchase decisions based on her preferences. "The boots for your sister," it explains, "are from that sustainable brand you both discussed last month - I found them at 20% off and confirmed they'll arrive before your family gathering." What would have taken Maya hours of manual searching now happens automatically, thanks to a web rebuilt for agent-first interaction.</p><p><em>&#8212;&gt; The future, three years from now.</em></p><p>As we approach the end of 2024, a new paradigm shift is emerging in how we build and interact with the internet. With rapid advances in AI reasoning capabilities, tech giants and innovative startups alike are racing to define the next evolution of digital interaction: AI agents. Google, Apple, OpenAI, and Anthropic have all declared AI agents as their primary focus for 2025. This transformation promises to be as significant as the web and mobile revolutions were and represents perhaps the most natural interface for LLM-powered technology&#8212;far more intuitive and capable than the chatbots that preceded it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Q05!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Q05!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 424w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 848w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 1272w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Q05!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png" width="1456" height="463" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:114920,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6Q05!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 424w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 848w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 1272w, https://substackcdn.com/image/fetch/$s_!6Q05!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3cc88be7-1f92-40cb-94b3-b1c483313a1d_1610x512.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sam Altman states that agents would be OpenAI&#8217;s next breakthrough in a recent <a href="https://www.reddit.com/r/ChatGPT/comments/1ggixzy/comment/luqgr7l/">Reddit AMA</a></figcaption></figure></div><p>In the recent No Priors Podcast, Nvidia&#8217;s CEO Jensen Huang <a href="https://www.youtube.com/watch?v=hw7EnjC68Fw&amp;t=1716s">stated</a> that "there's no question we're gonna have AI employees of all kinds&#8221; that would "augment every single job in the company&#8221;.</p><p>Moreover, Gartner projects that by 2028, <a href="https://www.gartner.com/en/articles/intelligent-agent-in-ai#:~:text=By%202028%2C%2033%25%20of%20enterprise,complete%20tasks%20and%20achieve%20goals.">33% of enterprise software applications</a> will include agentic AI, up from less than 1% today, enabling 15% of day-to-day work decisions to be made autonomously. This rapid adoption mirrors the mobile revolution of the early 2010s but with potentially more far-reaching implications for how we interact with digital services.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;9a44a87c-876f-4499-a0b0-c4172b5fbb5d&quot;,&quot;duration&quot;:null}"></div><p><em>Anthropic&#8217;s Claude submits a vendor request autonomously using the recently announced <a href="https://www.anthropic.com/news/3-5-models-and-computer-use">Computer Use capability</a></em></p><p>This essay, the first in a five-part series, explores how web and mobile will evolve to accommodate an agent-first future and how businesses can prepare for this seismic shift.</p><h2><strong>AI Agents</strong></h2><p>While there's ongoing debate about what an AI Agent is, at its core, what sets agents apart from traditional software is their ability to autonomously plan and adapt. Unlike rule-based systems that follow predetermined paths, agents can formulate strategies, execute them, and&#8212;most importantly&#8212;adjust their approach based on outcomes and changing circumstances. Think of them as digital assistants that don't just follow a script, but actually reason about the best way to achieve your goals. If a planned action fails or yields unexpected results, an agent can reassess and chart a new course, much like a human would. This flexibility and autonomous decision-making capability marks a departure from traditional software, which can only respond in pre-programmed ways.</p><p>Central to agents' capabilities is their sophisticated use of tools. Much like a handyman who knows when to use a screwdriver versus a hammer, agents must determine which tools to use, when to use them, and how to use them effectively. For instance, when helping you plan a trip, an agent might first use a calendar tool to check your availability, then a flight search API to find options, and finally a weather service to ensure you pack appropriately. The key isn't just having access to these tools&#8212;it's the agent's ability to reason about their use and orchestrate them intelligently to accomplish complex tasks.</p><h2><strong>From mobile-first to agent-first</strong></h2><p>Remember when 'www' stood for something closer to 'Wild Wild West' than 'World Wide Web'? The early 2000s internet was an untamed digital frontier, where users navigated through a maze of pop-ups, fought off malware, and relied on bookmarked URLs just to find their way around. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!f_BL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!f_BL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 424w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 848w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 1272w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!f_BL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png" width="693" height="417.6986301369863" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:440,&quot;width&quot;:730,&quot;resizeWidth&quot;:693,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!f_BL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 424w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 848w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 1272w, https://substackcdn.com/image/fetch/$s_!f_BL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29e8f6f3-55fa-4805-8e90-301404560ddb_730x440.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Good ol&#8217; days</figcaption></figure></div><p>The early 2010s, when mobile exploded, weren&#8217;t that different as businesses scrambled to make their websites mobile-responsive. That shift wasn't just about resizing content for smaller screens&#8211;it fundamentally changed how we approached web design, user experience, and digital strategy. It created a whole new field of website and mobile optimization: choosing the best colors and text copy to increase traffic, conversion rates, and stickiness.&nbsp;</p><p>Today, we stand at a similar inflection point with AI agents.</p><p>Just as mobile-responsive design emerged from the need to serve smartphone users better, "agent-responsive design" is emerging as websites adapt to serve AI agents. But unlike the mobile revolution, which was about accommodating human users on different devices, the agent revolution requires us to rethink our fundamental assumptions about who &#8211; or what &#8211; is consuming our digital content.</p><p>In this agent-first era, websites will undergo a dramatic transformation. Gone are the days of flashy advertisements, elaborate typography, and resource-heavy images&#8212;elements that consume bandwidth but provide little value to AI agents. Instead, we're moving toward streamlined, efficient interfaces that prioritize function over form. These new websites will feature minimalist designs optimized for machine parsing, structured data layers that enable rapid information extraction, standardized interaction patterns that reduce processing overhead, and resource-efficient components that minimize token usage and computation costs.</p><p>This evolution extends beyond traditional websites. Mobile applications are already being reimagined with agent-interaction layers, as evidenced by recent novel methods like Apple's <a href="https://arxiv.org/abs/2410.18967">Ferret-UI 2</a> and <a href="https://arxiv.org/abs/2410.09407">CAMPHOR</a>, enabling seamless agent navigation of mobile interfaces while maintaining human usability. Google and Microsoft also invest in this space, as demonstrated in their recent papers <a href="https://google-research.github.io/android_world/">AndroidWorld</a> and <a href="https://microsoft.github.io/WindowsAgentArena/">WindowsAgentArena</a>, respectively. Both are fully functional environments for developers to build and test agents.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!A6Jk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!A6Jk!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!A6Jk!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif" width="675" height="379.6875" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:675,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [video-to-gif output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [video-to-gif output image]" title="temp.mov [video-to-gif output image]" srcset="https://substackcdn.com/image/fetch/$s_!A6Jk!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!A6Jk!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F490aca1f-9a4c-4c98-9049-bfc3d5baae52_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Microsoft&#8217;s WindowsAgentArena is an open-source framework to test and develop AI agents that can reason, plan, and act on a PC using language models</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OQGO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OQGO!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 424w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 848w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 1272w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OQGO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif" width="800" height="439" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:439,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!OQGO!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 424w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 848w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 1272w, https://substackcdn.com/image/fetch/$s_!OQGO!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff46cf239-15e6-4b19-9a9d-00de4b51e434_800x439.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Google&#8217;s AndroidWorld is a testing ground for AI agents on Android phones</figcaption></figure></div><p>The incentives are becoming clear: optimize for agents, and you'll unlock new channels of engagement and commerce. Ignore them, and you risk becoming invisible in the emerging agent-first internet.</p><h2><strong>Blueprint of Agent-Responsive Design</strong></h2><p>At its core, agent-responsive design represents a radical departure from traditional web design principles. Instead of optimizing for human visual perception and engagement, websites must provide clear, structured interfaces that agents can efficiently navigate and interact with.</p><p>This transformation will likely unfold in two phases:</p><h3><strong>Phase 1: Hybrid optimization</strong></h3><p>Initially, websites will maintain dual interfaces: one optimized for human users and a "shadow" version optimized for agents. This agent-optimized version will feature:</p><ul><li><p>Enhanced semantic markup with clear structure and purpose</p></li><li><p>Unobfuscated HTML that welcomes rather than blocks automated interaction</p></li><li><p>Well-defined aria-label labels and metadata to help agents choose and interact with the right UI components</p></li><li><p>Direct access to knowledge bases and documentation by exposing information beyond what&#8217;s visible on the &#8220;website interface&#8221;, giving the querying agents access to their RAG to easily retrieve information such as refund policy or answer questions the agent has based on their help docs. Also, after being authenticated, providing easy access to user-related information such as last purchases or stored payment methods.</p></li><li><p>Streamlined authentication and authorization protocols</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pkJh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pkJh!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pkJh!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif" width="667" height="629.48125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:755,&quot;width&quot;:800,&quot;resizeWidth&quot;:667,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!pkJh!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 424w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 848w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 1272w, https://substackcdn.com/image/fetch/$s_!pkJh!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba069a1-71b4-49a9-abed-6a97d35044ce_800x755.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A human-optimized vs. agent-optimized website powered by Claude Artifacts. The agent website doesn&#8217;t have the colorful banner at the top and exposes better aria-label descriptions</figcaption></figure></div><h3><strong>Phase 2: API-first architecture</strong></h3><p>The second phase will move beyond traditional UI components, focusing on exposing clean, well-documented APIs that agents can directly interact with. Consumer websites like Amazon, TurboTax, and Chase will:</p><ul><li><p>Provide clear documentation of available tools and capabilities. The agent will leverage its reasoning engine and the task the human delegated to plan the tools and sequence that it needs to use.</p></li><li><p>Offer structured workflows with explicit input/output specifications</p></li><li><p>Enable direct access to business logic and user data</p></li><li><p>Support sophisticated authentication mechanisms for agent-based interactions</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CpJk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CpJk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 424w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 848w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 1272w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CpJk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png" width="711" height="325.71222527472526" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:667,&quot;width&quot;:1456,&quot;resizeWidth&quot;:711,&quot;bytes&quot;:334338,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CpJk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 424w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 848w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 1272w, https://substackcdn.com/image/fetch/$s_!CpJk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F314edbc0-bb19-48dd-be54-e1062711a653_2424x1110.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>The death of traditional A/B testing</strong></h2><p>In an agent-first world, the traditional approach to A/B testing becomes obsolete. Instead of testing different button colors or copy variations for human users, companies like Amazon will need to optimize for agent interaction efficiency and task completion rates.</p><p>These A/B tests will target similar metrics as today: purchases, sign-ups, etc., employing LLMs to generate and test thousands of agent personas without the need for lengthy user testing cycles.</p><p>This new paradigm of testing will require new success&nbsp;metrics such as: </p><ul><li><p>Model compatibility across different AI providers (GPT, Claude, etc.) - each language model has its own nuances. Optiziming can help businesses squeeze a few more percentage points for conversion, bounce rate, etc.</p></li><li><p>Task completion rate for the human-delegated task at hand, like purchasing a product or subscribing to a newsletter</p></li><li><p>Token efficiency and latency optimization, enabling lightning-fast interactions while minimizing computational overhead and associated costs</p></li><li><p>Authentication and security protocol effectiveness, ensuring robust protection while maintaining frictionless agent operations</p></li></ul><p>The competitive landscape in this new era will be shaped significantly by model providers' unique advantages. Companies like OpenAI and Google, with their vast user interaction data, will possess an inherent edge in creating agents that deeply understand user preferences and behaviors. However, this also creates an opportunity for innovation in the form of universal memory and context layers, like what mem0 is pitching with their recently released <a href="https://github.com/mem0ai/mem0-chrome-extension">Chrome extension</a>&#8212;systems that can bridge different models, devices, and platforms to create a cohesive user experience.</p><p>Drawing from Sierra's <a href="https://arxiv.org/abs/2406.12045">&#964;-bench research</a>, we can anticipate the emergence of standardized benchmarks for measuring agent-readiness across verticals and task types, similar to how we currently measure mobile responsiveness or page load times.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kSY5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kSY5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 424w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 848w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 1272w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kSY5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png" width="522" height="476" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:476,&quot;width&quot;:522,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:64853,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kSY5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 424w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 848w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 1272w, https://substackcdn.com/image/fetch/$s_!kSY5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F50f372d7-8136-4083-b0e8-6ce4ed3ae3d3_522x476.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Sierra&#8217;s &#964;-bench emulates conversations between users&lt; simulated by an LLM, and a language agent over multiple trials</figcaption></figure></div><h2><strong>New discovery protocol - Agent Engine Optimization (AEO)</strong></h2><p>Just as websites evolved from manually curated directories to sophisticated search engine optimization, the agent era demands a new discovery mechanism. The question isn't just about findability&#8212;it's about actionability: how do agents identify and interact with the most relevant and capable digital services?</p><p>In 2005, Google introduced the Sitemap protocol to improve search engine crawling efficiency, enable discovery of hidden content, and provide webmasters with a standardized method for communicating site structure and content updates to search engines. What is the Sitemap equivalent for AI agents?</p><p>Just as SEO emerged to help websites become discoverable in search engines with Google&#8217;s inaugural PageRank algorithm, Agent Engine Optimization (AEO) will become crucial for visibility in an agent-first web. Back in Aug 2023, I called it <a href="https://www.aitidbits.ai/p/future-of-internet-search">Language Model Ranking Optimization</a>.</p><p>This new protocol will go beyond traditional sitemaps, providing agents with structured information about websites:</p><ul><li><p>Available services and capabilities like signing up, placing an order, booking a flight seat</p></li><li><p>Authentication requirements - what actions require authentication</p></li><li><p>Data schemas and API endpoints - what data does each action/endpoint need? What is mandatory vs. optional?</p></li><li><p>Privacy and security protocols - how information is being stored&nbsp;</p></li><li><p>Service level agreements like refund and shipping guidelines and data retention policy</p></li></ul><p>Exposing such information will become a standard feature in website builders like Shopify and Wix, much like mobile responsiveness is today. These platforms will automatically generate and maintain agent-interaction layers, democratizing access to the agent-first economy for businesses of all sizes.</p><p>Companies will need to optimize not just for search engines but for an emerging ecosystem of agent directories and registries that help autonomous agents discover and interact with digital services.</p><p>More on the PageRank for AI Agents will be explored in the next post of this series.</p><pre><code><code>Become a premium member to access the LLM Builders series, $1k in free credits for leading AI tools and APIs, and editorial deep dives into key topics like OpenAI's DevDay and autonomous agents.

Many readers expense the paid membership from their learning and development education stipend.</code></code></pre><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.aitidbits.ai/subscribe&quot;,&quot;text&quot;:&quot;Upgrade to Premium&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://www.aitidbits.ai/subscribe"><span>Upgrade to Premium</span></a></p><h2><strong>The web's great reversal - from defense to embrace</strong></h2><p>This transformation marks a seismic shift in how the internet treats automated interactions. We're moving from an era of aggressive bot blocking and CAPTCHA barriers to one where automated agents are not just tolerated but actively embraced and optimized for. This shift necessitates new quality assurance practices, with websites being tested and validated for agent-readiness just as rigorously as they're tested for human usability today. The implications are profound: businesses that master agent-responsive design will gain a competitive edge in an increasingly automated digital economy.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SpDx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SpDx!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 424w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 848w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 1272w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SpDx!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif" width="681" height="367.2484076433121" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:508,&quot;width&quot;:942,&quot;resizeWidth&quot;:681,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SpDx!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 424w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 848w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 1272w, https://substackcdn.com/image/fetch/$s_!SpDx!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7bb0ad4-aaee-4f46-907c-bcf8eadd3976_942x508.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">HyperWrite&#8217;s demo of its personal assistant, Aug &#8216;23</figcaption></figure></div><h2><strong>2025: Year of the Agent</strong></h2><p>As we enter 2025, the pieces are falling into place for a fundamental shift in how we interact with the internet. Google's Jarvis, Anthropic's Computer Use capabilities, Apple Intelligence, and OpenAI's recently announced <a href="https://techcrunch.com/2024/11/13/openais-take-on-ai-agents-could-come-in-january/">Operator</a> all point to a future where autonomous agents become the primary interface between humans and digital services.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kO3O!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kO3O!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kO3O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif" width="800" height="450" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:450,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;temp.mov [optimize output image]&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="temp.mov [optimize output image]" title="temp.mov [optimize output image]" srcset="https://substackcdn.com/image/fetch/$s_!kO3O!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 424w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 848w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 1272w, https://substackcdn.com/image/fetch/$s_!kO3O!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd33e1e9-59ae-4536-b57e-eb653a9b714e_800x450.gif 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">OpenAI equipped its ChatGPT Desktop app with the ability to <a href="https://x.com/OpenAIDevs/status/1857129790312272179">control apps on Mac</a>, Nov &#8216;24</figcaption></figure></div><p>This transformation will require businesses to rethink their digital presence from the ground up. Just as the mobile revolution forced companies to adopt responsive design principles, the agent revolution will require the adoption of agent-responsive design patterns and practices.</p><p>In the next essay in this series, we'll explore the components of such a new ecosystem, examining how authentication, payments, and reputation systems must evolve to support this new paradigm.</p><div><hr></div><p><em>This is the first essay in a five-part series exploring the future of AI agents and their impact on the internet. Follow along as we examine this transformation's technical, social, and economic implications.</em></p>]]></content:encoded></item><item><title><![CDATA[The rise of autonomous agents]]></title><description><![CDATA[Did recent trends and OpenAI's GPTs announcement unlock autonomous agents?]]></description><link>https://www.aitidbits.ai/p/the-rise-of-autonomous-agents</link><guid isPermaLink="false">https://www.aitidbits.ai/p/the-rise-of-autonomous-agents</guid><dc:creator><![CDATA[Sahar Mor]]></dc:creator><pubDate>Sun, 19 Nov 2023 16:30:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F56d15b18-239c-4403-839e-544d2e9dac77_600x378.gif" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to Deep Dives <strong>- </strong>an AI Tidbits section providing editorial takes and insights to make sense of the latest in AI. Let&#8217;s go!</em></p><div><hr></div><p>Last February, Stanford published a paper that sparked everyone&#8217;s imagination. In <a href="https://arxiv.org/abs/2304.03442">this paper</a>, the researchers leveraged ChatGPT to power human-like agents. A mini-simulation of humanity.</p><p>A few weeks later, two viral open-source autonomous agents frameworks were introduced to the world - AutoGPT and BabyAGI. Their goal? Build fully autonomous AI agents. Those libraries generated a lot of excitement, but have yet to spawn useful agent-powered applications.</p><p>An autonomous agent, described simply, is an AI program capable of planning and executing tasks based on a given objective. Imagine asking an AI to &#8220;book a flight&#8221;, or &#8220;create a website for people interested in renting their apartments when they&#8217;re away&#8221; - and the agent goes to work. It does so by repeatedly asking itself &#8220;What should be the next steps to achieve the task at hand&#8221;, utilizing an LLM to answer this question.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EQxf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EQxf!,w_424,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 424w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_848,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 848w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_1272,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 1272w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_1456,c_limit,f_webp,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EQxf!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif" width="600" height="266" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1233bc47-fc94-4429-a749-614e720a8651_600x266.gif&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:266,&quot;width&quot;:600,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EQxf!,w_424,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 424w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_848,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 848w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_1272,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 1272w, https://substackcdn.com/image/fetch/$s_!EQxf!,w_1456,c_limit,f_auto,q_auto:good,fl_lossy/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1233bc47-fc94-4429-a749-614e720a8651_600x266.gif 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">AutoGPT breaks down goals into tasks it then executes. <a href="https://medium.com/geekculture/autogpt-unleashed-the-autonomous-ai-agent-revolution-chat-gpt-ai-artificial-intelligence-productivity-19ad5b71c436">Source</a></figcaption></figure></div><p>Such objectives are great demo material, but demos are often cherry-picked, and the recent consensus is that AI agents are a great concept, but that&#8217;s about it. To me, it felt like agents were becoming the new Crypto - a lot of hype, with no one able to point to a concrete widespread use case.</p><p>That might have changed with <a href="https://www.aitidbits.ai/p/openai-devday">OpenAI&#8217;s recent release of GPTs</a>. GPTs are small ChatGPT wrappers that have the prompt, context, and tools baked into them, ready to roll.</p><p>Though powerful, they have yet to earn the &#8220;autonomous agents&#8221; badge&#8211;they still require a high degree of human feedback to achieve their goal. They are, however, capable of connecting to other services and APIs such as your calendar and email.</p><p>Combined with a few other headwinds, GPTs might represent the milestone the space has been waiting for, leading to a Cambrian explosion of capable autonomous agents.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Iauz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Iauz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Iauz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg" width="1199" height="670" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:670,&quot;width&quot;:1199,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Iauz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Iauz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff2d6038e-9f20-4dd5-81f8-9987cea292fd_1199x670.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">OpenAI announcing GPTs at DevDay&nbsp;</figcaption></figure></div><p><em><strong>Bonus at the end of the post</strong>: a list of open-source repositories every agents builder should know and the startups leading the way.</em></p><h2><strong>Unlocking autonomous agents</strong></h2><p>AutoGPT, the largest autonomous agents framework to date, has over 150k stars on GitHub and 50k builders in its Discord server. With such a large community of developers, one might wonder why useful agents have yet to emerge. The answer lies in the inherent limitations of agents, which are slowly being addressed.</p><h3>Limitation #1 - Small context windows</h3>
      <p>
          <a href="https://www.aitidbits.ai/p/the-rise-of-autonomous-agents">
              Read more
          </a>
      </p>
   ]]></content:encoded></item></channel></rss>