Welcome back. Fable 5 is back. You already know it's the best coding model. The real question: what should you be using it for? Anthropic's top exec Mike Krieger and the Every team just shared 13 prompts you can use right away. Make sure you bookmark these before moving on to the rest of these emails.

Also: Cognition ships Devin for Security, how to turn Fable 5 into an orchestrator, and a former Oracle engineering leader's guide to agent loops.

TODAY IN PROGRAMMING

Click here to watch Cognition’s Devin for Security in action.

Cognition ships AI agents that find and fix security vulnerabilities: The SF-based startup just dropped Devin Security Swarm. It sends parallel agents across entire codebases to catch business logic flaws and chained exploits. Each finding is reproduced in a sandbox to prove it is actually exploitable. Devin then writes the patch and opens a PR for review. Cognition says it finds more verified vulnerabilities at 30% lower cost than rivals. See how it works.

Meta reportedly plans a cloud business to rent AI compute: The social media giant is drafting plans for a new arm called Meta Compute. According to Bloomberg, it will sell raw compute capacity and host AI models like Muse Spark. This move positions Meta against AWS, Google Cloud, and Azure. Engineering teams will gain another heavyweight supplier for training and inference workloads.

Z.ai ships its official coding environment for GLM-5.2: The Chinese AI lab just released ZCode. It is an agentic desktop app that turns the model's 1M-token context into long-running coding sessions. Developers can hand off planning, debugging, and code review. You can check progress from mobile or chat apps while tasks keep running. It works with existing API keys on Mac, Windows, and Linux. Download it here.

INSIGHT

The cheapest new Claude model quietly costs more per task than the last one

Source: The Code, Superhuman

Sold as the economical one. Anthropic just shipped Claude 5 Sonnet as the default for free and Pro users. The pitch was pretty straightforward: performance that rivals Claude Opus 4.8 but at a much lower price point. When you put it to work on real tasks, it ends up costing nearly double what Sonnet 4.6 does. It even costs more than Opus 4.8. So, it turns out the budget option is actually the most expensive of the three to operate.

Flat on paper. Anthropic kept the base token rate flat and added a discount on top, calling the switch basically cost-neutral. At first glance, it looks like a free upgrade. That is the narrative most people are running with. But cost neutral is the giveaway, because a real discount wouldn't need to cancel itself out.

The token counter. There are two things quietly driving up the actual bill. First, a new tokenizer is turning the same text into more tokens. Second, the model is working harder. It's running more reasoning loops for every task. Independent testing from Artificial Analysis confirms the result. Sonnet 5 ends up costing about 15% more per task than Opus. Counterintuitively, the "cheaper" model is now billing higher than the Opus flagship model it was meant to undercut.

The dial is the price. That said, those extra tokens are actually buying you better results. Ramp Labs ran a benchmark and saw more test runs and much tighter self-correction than previous versions. If you set the effort level to low or medium, Sonnet stays cheaper. But if you crank it up to the max, Opus actually wins on price. The setting most teams leave on default decides your bill, so tune effort against cost per finished task and stop ranking models by token rate.

IN THE KNOW

What’s trending on socials and headlines

Meme of the day.

  • AI Tech Lead: This 5-step Claude Code setup turns Fable 5 into an orchestrator that hands grunt work to cheaper models (4.9K bookmarks).

  • Agents on Autopilot: A former Oracle engineering leader uses agent loops and goals to fix bugs, triage issues, and ship apps while you sleep.

  • Rate Limit Hack: The creator of T3.gg shared 4 tips for heavy Fable use without hitting limits (486K views).

  • Margin Squeeze: A viral post argues Chinese open-source models are wiping out margins at the model layer, and the real money is moving somewhere else (500K views).

  • Hidden Check: Claude Code was silently flagging China-linked users until a developer exposed it (5M views). Anthropic confirmed the experiment and shipped a removal.

AI CODING HACK

How to stop re-prompting Claude Code after a failed command

Normally, if you run a command in Claude Code and it fails, you'd have to follow up with a second prompt asking for a fix. That's no longer necessary. Anthropic just rolled out an update where any command prefixed with '!' will automatically get a response.

Just make sure you're on the latest version by running 'claude update' and then start using the '!' prefix for your commands.

! npm test

Claude instantly analyzes the output and suggests a fix in one go. It works just as well for broken builds, type-check errors, or when you simply need a diff explained.

P.S. Get 50+ AI coding hacks for Claude Code, Cursor, and Codex here.

TOP & TRENDING RESOURCES

Click here to watch the tutorial.

Top Tutorial

How to build a continuous eval pipeline for multi-agent systems with Gemini: In this tutorial, you'll learn how to move from subjective testing to data-driven assessments. It covers replacing manual testing with automated, model-based grading, managing unpredictable AI outputs, and integrating automated evaluation directly into a CI/CD pipeline using Cloud Run.

Top Tool

BrowserBash: This CLI tool converts plain-English sentences into reliable browser tests using local models. It eliminates selectors and flaky locators while supporting Chrome, LambdaTest, and various CDP endpoints.

Top Repo

OpenWiki (by LangChain): A CLI built specifically for agents that writes and maintains documentation for your codebase.

Trending Paper

Are we ready for an agent-native memory system? Current tests treat AI agent memory like a black box, ignoring backend data management trade-offs and operational costs. This study reveals that no single architecture fits all scenarios, meaning success depends entirely on matching the memory structure to specific workload bottlenecks.

IN CASE YOU MISSED IT

Our most-clicked story from yesterday

Check out the top 30 prompt techniques that work in 2026.

Grow customers & revenue: Join companies like Google, IBM, and Datadog. Showcase your product to our 300K+ engineers and 150K+ followers on socials. Get in touch.

What did you think of today's newsletter?

Your feedback helps us create better emails for you!

Login or Subscribe to participate

You can also reply directly to this email if you have suggestions, feedback, or questions.

Until next time — The Code team

Keep Reading