Izzo: You're listening to Exploring Next, episode 298. Today, we're talking about Cloudflare's new Code Mode MCP server, which promises to optimize token usage for AI agents. But why does this matter right now? Boone: Well, Izzo, it's all about reducing the cost of interacting with complex APIs. Traditionally, each API endpoint exposed to an agent represented a separate tool definition, which incurs a significant context window cost every time a tool specification consumes tokens in the model's limited input budget. Izzo: That's a big deal, especially when you're working with large language models. So, how does Code Mode solve this problem? Boone: Code Mode exposes only two tools, search() and execute(), backed by a type-aware SDK that allows the model to generate and execute JavaScript inside a secure V8 isolate. This compiles an agent's plan into a small code snippet orchestrating multiple operations against the OpenAPI spec, avoiding the need to load all endpoint definitions into context. Izzo: That's really clever. And what about the impact? How much does it reduce token usage? Boone: According to Cloudflare, Code Mode reduces the token footprint of interacting with over 2,500 API endpoints from more than 1.17 million tokens to roughly 1,000 tokens, a reduction of around 99.9%. Izzo: Wow, that's significant. And what about security? How does Cloudflare handle user-generated code? Boone: The server runs user-generated code in a Dynamic Worker isolate with no file system, no environment variables exposed, and outbound requests controlled via explicit handlers. This design mitigates risks associated with executing untrusted code while preserving agent autonomy. Izzo: Okay, so what does this mean for developers? How can they start using Code Mode? Boone: Cloudflare has open-sourced a Code Mode SDK within its broader Agents SDK, which enables similar patterns in third-party MCP implementations. Developers can start by checking out the Cloudflare blog post and the GitHub repository for more information. Izzo: Alright, so to build next, what should our listeners go research or try installing? Boone: I'd recommend checking out the Cloudflare Code Mode SDK and the OpenAPI specification. You can also try installing the Dynamic Worker isolate and experimenting with user-generated code. And if you're feeling adventurous, you can try adding Code Mode to your weekend project list. Izzo: Ha! Okay, Boone, I think you've added enough to your weekend project list for now. But seriously, this is a really exciting development, and we'll be keeping an eye on how it evolves. Thanks for tuning in to episode 298 of Exploring Next.