spoonai
TOPAnthropicClaudeAI Safety

Claude Now Writes 80% of Anthropic's Code — and Dario Wants to Install a Brake Pedal

Anthropic's early-June report says Claude now writes more than 80% of the code merged into its own production codebase. As AI-building-AI becomes real, CEO Dario Amodei publicly proposed a verifiable, global option to slow the frontier down.

·10분 소요
공유
Anthropic 'Code with Claude' event signage with an attendee's silhouette
Source: Tom's Hardware

When AI starts writing its own code, who's actually holding the wheel?

Here's the deal: in early June 2026, Anthropic published an internal report titled "When AI Builds Itself," and one number stopped the industry cold. In May, more than 80% of the code merged into Anthropic's own production codebase was written by Claude. Back when Claude Code first launched in February 2025, that figure was in the low single digits. From single digits to 80%-plus in a little over a year.

The scarier part is the slope. On the hardest coding tasks Anthropic tracks internally, Claude succeeded 76% of the time in May — up from around 26% just six months earlier. A 50-percentage-point jump in half a year isn't incremental progress; it's leaping a whole step at once. AI building better AI, and that better AI then improving itself — the so-called recursive self-improvement loop — is no longer a thought experiment.

Right after the report dropped, CEO Dario Amodei went public. His message was blunt: we're flooring the accelerator and we don't even know where the brake pedal is. He argued the world needs to build, right now, a verifiable way to slow frontier AI development if and when it becomes necessary. In other words, the guy building one of the fastest cars on the road is also saying "let's all agree on how to install brakes." That contradiction is the whole story.

The players — Anthropic, Claude, and Dario's two faces

The first protagonist is Anthropic itself — a company that built its entire identity around AI safety, while simultaneously competing head-to-head with OpenAI and Google via a frontier model. Those two identities have always been in tension. "Go safely" and "win the race" often point in opposite directions, and this report is Anthropic dragging that tension into the open itself.

The second is Claude, specifically Claude Code. It's not autocomplete. It takes an issue, designs an approach, writes the code, runs the tests, and produces code that gets reviewed and merged into real production. Anthropic's engineers are shifting from "people who write code" to "people who supervise AI." The picture the report paints: humans set direction and draw boundaries, and the model does most of the actual typing.

The third is Dario Amodei himself. He originally led research at OpenAI, then left over differences in safety philosophy to found Anthropic. He's long made aggressive forecasts — like AI surpassing most human work by around 2027. This time he bolted a safety mechanism onto that forecast: powerful AI may not be something any single government or company can be fully trusted to control alone. He wants mandatory technical testing and auditing — airplane-style — plus government authority to block or reverse high-risk deployments.

The substance — what "80%" actually means

Let's clear up a misconception first. "Claude writes 80% of the code" does not mean Anthropic's engineers stopped working. If anything, their jobs got harder. The high-level decisions — direction, architecture, what to build and what not to build — are still firmly human. What moved to the model is implementation: turning those decisions into code. The center of gravity shifted from hammering a keyboard to directing and reviewing a model.

Why is that read as a warning signal? Because "development speed" itself becomes tied to AI capability. It used to be that building better AI required more engineers and more time — human physical limits acted as a natural speed governor. But if AI writes 80% of the code, the pace of building the next AI scales with the current AI's capability. Better models build the next models faster: a positive feedback loop.

Item Detail
Report title When AI Builds Itself (early June 2026)
Self-written code share 80%+ (single digits in Feb 2025)
Hard-task success rate 76% (~26% six months prior)
Core concept Recursive self-improvement
Dario's proposal A verifiable global slowdown option
Analogy Airplane-grade mandatory testing & auditing
Researchers Marina Favaro, Jack Clark

Anthropic researchers Marina Favaro and Jack Clark wrote that a worldwide frontier slowdown "would likely be a good thing" — with one condition. US and Chinese labs, plus everyone else near the frontier, would have to stop together, under rules outsiders can verify. Stop alone and you lose the race; fake the stop and the cheater wins.

The crux is verifiability. Just as nuclear arms control depends on each side inspecting the other's facilities, an AI slowdown only works if a third party can confirm the slowdown actually happened. Dario's proposal is essentially "let's pre-design an arms-control verification regime for AI." He's not saying stop now — he's saying build the button now so we can press it later if we need to.

What's in it for whom

For Anthropic, this report is a double-edged sword. One edge is a killer marketing line: Claude writing 80% of Anthropic's own code is the most persuasive proof of how strong Claude Code is in real-world use. No enterprise sales pitch beats "we use it that heavily ourselves."

The other edge is safety branding. Anthropic has always differentiated itself as the "responsible frontier lab." Building the fastest car while shouting "let's install brakes" is also a play for leadership in the regulatory conversation. If rules are coming anyway, you'd rather be the one drafting them. Read cynically, there's a regulatory-capture worry too: safety-justified barriers could lock out late entrants and the open-source crowd.

For Dario personally, the message is about consistency. He's said for years that AI will soon become enormously powerful. If that's true, you can't exactly argue for releasing something that powerful with zero safeguards. Acknowledging both the power and the danger is him owning his own prediction. Skeptics counter that the "it's so powerful" claim is itself fundraising hype. Both takes have merit — and how you read this news basically defines how you see the whole AI industry.

Historical echoes — the successes and failures of "let's slow down"

This isn't the first call to slow AI. In spring 2023, thousands — including Elon Musk — signed an open letter for a six-month AI moratorium. The result? Nobody stopped. Some signatories started new AI companies in the meantime. It's the textbook case of why a voluntary, unenforceable pause fails in a competitive field. Dario's "verifiable simultaneous stop" is clearly designed with that failure in mind.

There's a counterexample that did work, though: aviation's safety-certification regime. A plane must pass mandatory technical testing and auditing before it flies, and an entire model can be grounded if something goes wrong. That's exactly why Dario reached for the airplane analogy — a working regulatory model already exists that prevents catastrophic failure without killing innovation. The catch: in aviation, causation is clear and the verification tech is mature, whereas in AI, even defining "a dangerous failure" is contested.

Another touchstone is the IAEA nuclear-inspection regime — proof that even adversaries can sustain an agreement when there's mutually verifiable inspection. Dario's vision is basically "an IAEA for AI." The problem: nuclear has physically trackable targets like enrichment facilities, while AI's only comparable physical footprint is something like a datacenter's GPU clusters. How do you inspect software progress? That remains the open, unsolved hard problem.

How rivals counter-play — OpenAI and open source

This message is awkward for OpenAI. If Anthropic re-raises the "safe frontier lab" flag, OpenAI risks looking like the side that only cares about speed. But OpenAI has its own safety framework and can push back with the logic that getting AI into as many hands as fast as possible is itself a form of safety — let society adapt by spreading it, not by stopping.

China's camp reacts differently. Dario's proposal rests on the premise that the US and China must stop simultaneously. From Beijing's view, "why should we match rules written by the US that's currently ahead?" is a natural suspicion. With Chinese open-weight players like Moonshot and DeepSeek catching up fast, a "simultaneous stop" looks like a loss for the chaser and a win for the leader. That geopolitical asymmetry is the biggest real-world barrier to Dario's plan.

Open source will push back hard too. "Let governments block or reverse high-risk deployments" collides head-on with the open-weight philosophy where anyone can download and run a model. Once weights hit the internet, "reversing" is physically impossible. So expect the open camp to frame this as "closed mega-labs defending their moat." Same words — "AI safety" — and the closed and open camps reach opposite conclusions.

So what actually changes — by who you are

If you're a developer, the concrete shift is that "hand the whole task to an agent" workflows are graduating from experiment to standard. If Anthropic writes 80% of its code that way, your team probably will too before long. The ability to define what to build, review the AI's output, and draw boundaries matters more than raw typing speed. It's less "jobs disappear" and more "the job changes from implementer to supervisor."

If you're a founder or decision-maker, read the regulatory risk early. Dario's exact proposal becoming law is unlikely, but "AI safety certification" landing on the policy table is now clearly a trend — and places like Korea and the EU already have pre-emptive rules. If you train frontier-grade models yourself, factor potential testing and auditing obligations into your cost scenarios.

If you're a regular user, read this as a signal that the AI you use is getting faster and stronger — and that even the company driving that pace is saying it needs a brake. No need to panic, but it's a fair moment to upgrade your mental model from "AI is just a handy tool" by a notch.

🥄 Three Things You're Probably Wondering

— So is AI actually about to go out of control? Too early to say. The report warns about conditions that could make AI harder to control, not control loss itself. Writing 80% of the code is evidence of capability, not of intent or autonomy. That said, the logic that risk grows once development speed outpaces human verification speed is worth taking seriously.

— Is Anthropic sincere, or is this marketing? Honestly, both. The safety message can be sincere while also being Anthropic's single strongest differentiator. Those don't contradict. "Our model is that strong" and "so we need rules" are two sides of the same coin. Rather than forcing the two apart, it's more accurate to accept that they overlap.

— What are the odds this proposal actually happens? Low, near-term. A US–China simultaneous stop is geopolitically very hard. But the idea of "verifiable safety certification" may survive and seep into national regulations in partial form. A staged, aviation-style mandate is a more realistic landing spot than a blanket moratorium.

Sources

Numbers are as of announcement and may change.

관련 기사

무료 뉴스레터

AI 트렌드를 앞서가세요

매일 아침, 엄선된 AI 뉴스를 받아보세요. 스팸 없음. 언제든 구독 취소.

매일 30개+ 소스 분석 · 한국어/영어 이중 언어광고 없음 · 1-클릭 해지