Welcome back. OpenAI keeps finding ways to make coding feel fun again. Their weekend update pulled over 3 million views on X — and developers are loving the playful twist. See it here.
Also: How to do agent-native product management, pressure-test a startup idea with a Codex skill, and follow the Musk v. Altman trial in a transcript-built wiki.
Today’s Insights
Powerful new updates and hacks for devs
Cursor's bet: the agent harness is the product
How to skip copy-pasting errors into Claude Code
Trending social posts, top repos, and more

TODAY IN PROGRAMMING
Anthropic unveils an AI vulnerability scanner for enterprise teams: The AI lab just released Claude Security in public beta, using Opus 4.7 to find and patch vulnerabilities in enterprise code. Its multi-stage validation cuts down on false positives, letting teams fix issues quickly. You can send scan results to Slack and Jira via webhooks or export them as CSV and Markdown. See how it works.
xAI's newest model brings always-on reasoning at a fraction of the cost: The Musk-founded AI lab just shipped Grok 4.3, which features a reasoning-first engine and a massive 1M token context window. At $1.25 per million input tokens and $2.50 for output, it's significantly cheaper than Claude Opus or GPT-5.5. They also launched a voice cloning API that lets developers create custom voices in under two minutes or choose from over 80 voices in 28 languages.
Developer's AI-coded app success turns into maintenance nightmare: A viral Reddit post detailed how a developer spent six months shipping with Cursor, Lovable, and Bolt, resulting in a successful app but a codebase that was a total disaster. When a new hire joined and saw the repo, they were completely lost. Now, senior engineers are calling for stricter code reviews and treating sloppy, unchecked AI-generated pull requests as a performance issue.

PRESENTED BY GRANOLA
Most AI notetakers just transcribe and send a summary. Granola is different—it’s built for everything after the meeting.
It runs quietly in the background while you take notes your way—no interruptions.
Then turns your notes into clear summaries, action items, and next steps—from your POV.
Chat with your notes to draft follow-ups, prep meetings, or turn conversations into work instantly.
Perfect for back-to-back meetings.
Try Granola for a month at no cost with code: THECODE (1 month off any paid plan)

INSIGHT
Cursor's bet: the agent harness is the product

Source: The Code, Superhuman
Beyond the model. Most devs experimenting with AI coding agents start by looking for the best model. Cursor’s betting that's the wrong starting point. They just released an SDK that treats their agent harness (the tools, prompts, and edit logic surrounding the model) as the actual product. Their pitch is simple: the real value isn't the model itself, but the layer that manages how it works.
Inside the harness. OpenAI and Claude models are trained on different file-editing formats, so Cursor tailors the output to each model to save reasoning tokens and reduce errors. Since standard benchmarks overlook these nuances, they use a custom metric called “Keep Rate” to track how much AI code stays after user edits, with every update tested via A/B test.
The payoff is real. In just one sprint, Cursor slashed unexpected tool call errors by 10x. They also used clever prompting to fix a Claude quirk where the model would avoid long-context tasks, proving that better scaffolding was the answer.
Call it “Model-Harness-Fit”. Cursor isn't alone here. Nicolas Bustamante, whose startup Fintool was acquired by Microsoft, showed how Claude Code, Codex CLI, and Copilot CLI produce totally different results from the same model. Bustamante argued that since each new model breaks the previous version's harness, the work of maintaining these AI tools never really stops.

IN THE KNOW
What’s trending on socials and headlines

Meme of the day.
Always On: If your long-running coding agents die the second you shut your MacBook, this 2-step setup lets them run even when it's closed.
Agent-Native PM: One PM is running his entire product on coding agents alone. Here's the step-by-step breakdown of his exact stack.
Skill Check: A Google senior engineer shares which AI certifications hiring managers actually care about, and which ones quietly collect dust on your resume.
Idea Killer: This Codex skill pressure-tests your startup idea, hunts fatal flaws, maps competitors, and scopes a 2-week MVP. OpenAI cofounder Greg Brockman approved (1.9k likes).
One-Prompt Module: A founder built a native image processing module for his JavaScript runtime using a three-sentence Claude prompt. Here’s the prompt.
Trial Tracker: The Musk v. Altman trial is unfolding right now, and this wiki built from court transcripts is the cleanest way to follow every twist.

AI CODING HACK
How to skip copy-pasting errors into Claude Code
Copy-pasting error logs into Claude is a major pain. You lose formatting, waste time, and truncation often cuts off the parts you actually need. This fix lets you skip the manual step entirely by sending any command's output directly into Claude:
cat error.log | claude "fix this"By reading the full “stdout” exactly as it appears in your terminal, the model gets much better context than a messy copy-paste. This works for any command, making it easy to diagnose failing tests:
npm test 2>&1 | claude "diagnose the failures"To move even faster, set up a shell alias like: alias fix='claude "fix this"' to close the loop instantly.
P.S. You can find 50+ AI coding hacks here.

TOP & TRENDING RESOURCES
Top Tutorial
Build and ship with Codex: This 4-hour tutorial shows you how to build and ship full-stack web apps using OpenAI’s Codex. You'll pick up hands-on agentic coding skills like leveraging plugins, setting up automations, and running sub-agents, while mastering stacks like Next.js and Vercel to level up your workflow.
Top Tool
Montage: AI agents are notoriously slow and expensive when it comes to rendering UI. Montage solves this by turning simple intent schemas into production-ready components. It’s 10x faster, slashes token usage by 100x, and keeps everything perfectly on-brand.
Top Repo
Ruflo (39.5k ⭐): This repo is the leading orchestration platform for Claude, designed to deploy multi-agent swarms and coordinate autonomous workflows. It features an enterprise-grade architecture with self-learning swarm intelligence, RAG integration, and native support for Claude Code and Codex.
Trending Paper
Speeding up agentic workflows (by OpenAI): Traditional API requests created major latency bottlenecks for coding agents by repeatedly processing the entire conversation history for every action. Switching to persistent WebSocket connections cached this state, eliminating redundant work and speeding up end-to-end agentic workflows by 40%.
Grow customers & revenue: Join companies like Google, IBM, and Datadog. Showcase your product to our 260K+ engineers and 150K+ followers on socials. Get in touch.
What did you think of today's newsletter?
You can also reply directly to this email if you have suggestions, feedback, or questions.
Until next time — The Code team




