Welcome back. Protecting model reasoning has become the top priority for AI labs. Anthropic claims Chinese giant Alibaba used nearly 29 million exchanges to extract Claude's capabilities. They've taken the fight all the way to the US Senate, asking Congress to step in and hold companies accountable for this kind of extraction. 

Also: How to use Claude Tag, fixing your team’s code review process and how to choose between code and LLMs.

TODAY IN PROGRAMMING

Made with Midjourney.

Anthropic reportedly accuses Alibaba of mass distillation attack: The AI lab informed US lawmakers that operators linked to a Chinese e-commerce giant used thousands of fake accounts to run nearly 29 million exchanges. According to the lab, this was a distillation attack designed to copy Claude's reasoning into a cheaper rival model. They're now calling on Congress to penalize what they describe as industrial-scale theft, mirroring similar accusations OpenAI has leveled against Chinese groups. 

OpenRouter lets your agents choose AI models: The LLM marketplace just rolled out an MCP that lets your AI agent choose the right AI model for the right task. They provide access to 400 models, which prevents your agent from relying on outdated training data. You can set it up in your CLI with just two commands. Once it's connected, your agent can access real-time benchmarks, pricing, and documentation. You can easily find models that cost less than $2 per million tokens and price, test, and swap models in under a minute.

Vercel ships AI SDK 7 with durable agents: The hosting platform just dropped version 7 of the AI SDK, specifically designed for production-ready agents. Its new WorkflowAgent keeps tasks active through restarts, deployments, and interruptions, so you never start from scratch. Agents can now pause for human approval before risky tool calls and run shell commands inside sandboxes. Plus, you can also swap in harnesses like Claude Code or Codex through one interface.

Most production agents carry dead weight: context they re-send, instructions they repeat, tools they never touch. But the invoice doesn’t say what was waste, and what worked. Hand-authored rules don't keep up with new model drops, pricing shifts, or workflow drifts.

AI21 is building the drop-in layer that reads your live traffic, finds the waste, and cuts it at runtime.

Read why we need a self-updating model router.

INSIGHT

A smarter AI code reviewer won't cut the noise. Here's what will:

Source: The Code, Superhuman

Reviews can't keep up. Teams are shipping more code than managers can review. A recent study tracked 22,000 developers and found that nearly one-third of PRs now merge without any review at all. Reviewers just can't keep up with the volume. The usual fix is an AI reviewer. But that often backfires. It floods you with nitpicks that get ignored. Some open-source maintainers have been overwhelmed by AI-filed issues. They've even started blocking outside contributions.

Jan Giacomelli, a dev who runs AI review on every backend merge at Ren Systems, argues the value comes from three specific moves most teams avoid:

First, think about size. Both you and the AI have a limited context window. Massive merge requests are basically unreadable for everyone. Keep your PRs small. They should be tiny enough that you wouldn't mind scrapping them if the AI messes up. Small changes are where the real review happens.

Second, you need context. A generic “review this” prompt just gives you noise. Instead, get the team together. Agree on what’s worth flagging. Save those rules in a markdown file for the AI. If you are not sure where to start, look back at common complaints. Find past human reviews and turn those into your new guidelines.

The third tip is all about the plumbing. Set up the review as a manual Continuous Integration (CI) job. Don't run it on every push. This saves tokens and keeps a human in the loop. The real payoff is the cost. Anthropic's standard feature costs $15 to $25 per review. But Giacomelli's custom Claude Code setup uses Sonnet and Haiku. That only costs between $0.15 and $1.50. You can read his full setup here or, for the wider agentic-review picture, former Google Cloud AI Director Addy Osmani's guide here.

Anyone can make an AI demo. The hard part is shipping it, monitoring it, and knowing when it breaks (and how to fix it).

Datadog's Developer Toolkit for the AI Era gives you the playbook for building, deploying, and observing AI applications at scale. It covers: CI pipelines, LLM observability, feature flags, testing, and modern AI delivery workflows.

IN THE KNOW

What’s trending on socials and headlines

Meme of the day.

  • Beautiful UI: A full-stack engineer used Claude to build a token-based design system that scales without manual tweaks (4.2K bookmarks).

  • Tag Claude In: Anthropic recently unveiled an AI coworker you can tag in Slack. This video covers how it works and the best practices behind it (1.6K likes).

  • No-JS Forms: An ex-Meta staff engineer explains why you shouldn’t use JavaScript for form validation and submission.

  • Post-Training Hub: Everything on Reinforcement Learning from Human Feedback (RLHF) and LLM post-training now lives in one place, with code examples at every stage.

  • Markdown Trap: Don't make an LLM do work that plain code does faster, cheaper, and safer. This piece explains exactly when to reach for AI and when to skip it.

  • Loop Engineering: A former senior Amazon engineer shares when to stop hand-steering your coding agent and what's worth automating.

  • AI Roadmap: A complete path to LLMs, agents, and MCP, built entirely from credible courses and open-source projects.

  • Agent Swarms: Worktrees defer agent collisions, they don't fix them. A software architect shares his single-branch workflow for running multiple coding agents without the chaos.

AI CODING HACK

How to stop Codex wasting time searching your codebase

Codex often spends more time searching for code than actually writing it. In large repos, it triggers too many greps and reads the wrong files. This clogs the context with noise.

To fix this, Morph launched WarpGrep. It is a search subagent that runs in its own context window. It returns only the specific file and line ranges Codex needs. You can install it with one command using your API key. 

npx -y @morphllm/morph-setup --morph-api-key YOUR_API_KEY

Choose Codex when prompted. Then get your API key. You can now ask broad questions like explain the auth flow. Codex uses WarpGrep to find the answer.

It takes five seconds instead of seventy. Morph benchmarks show Codex scores three points higher on SWE-Bench Pro with this tool. It also works in Claude Code and Cursor. 

P.S. Get 50+ AI coding hacks for Claude Code, Cursor, and Codex here.

TOP & TRENDING RESOURCES

Click here to watch the tutorial.

Top Tutorial

How to build a coding agent: You'll learn how to build a modular Python coding agent using a three-layer architecture. The tutorial shows you how to swap out LLM providers, manage stateless agent loops with stateful harnesses, and build a terminal user interface that supports real-time event streaming and custom coding tools.

Top Tool

BrowserAct: A browser automation CLI designed for AI agents. It enables agents to open pages, read their state, click, type, and extract data.

Top Repo

Ruflo (61.5K ⭐): The leading multi-agent harness for Claude. Build and deploy autonomous workflows and conversational AI with adaptive memory, RAG, and native Claude Code integration. 

Trending Cookbook

How agents are transforming work (by OpenAI): Traditional chatbots usually only handle quick tasks, but the real impact of advanced AI agents on the workplace has been hard to pin down. This research paper shows that employees are quickly moving toward using agents for long-term projects, with non-devs leading the charge in expanding what they can do. 

IN CASE YOU MISSED IT

Our most-clicked story from yesterday

A GitHub director rebuilt her workday using 40 automations, which helped her become a more effective leader. She published a guide detailing each one and why it's essential. 

Grow customers & revenue: Join companies like Google, IBM, and Datadog. Showcase your product to our 300K+ engineers and 150K+ followers on socials. Get in touch.

What did you think of today's newsletter?

Your feedback helps us create better emails for you!

Login or Subscribe to participate

You can also reply directly to this email if you have suggestions, feedback, or questions.

Until next time — The Code team

Keep Reading