spoonai
TOPMicrosoftAnthropicClaude Code

Microsoft Killed Internal Claude Code — Because Engineers Loved It Too Much

Microsoft is canceling Claude Code across the Experiences & Devices org (Windows, M365, Teams) by June 30. Reason: ~5,000 engineers used it so heavily that token bills torched the annual AI budget in months. They're moving to GitHub Copilot CLI. This is the clearest 'enterprise AI cost reckoning' of 2026.

·9분 소요·Windows CentralWindows Central
공유
Microsoft cancels internal Claude Code licenses — illustration
Source: Windows Central

The whole story is in one twist: it got cut not for being unused, but for being used too well

Here's the deal: Microsoft is canceling most internal Claude Code licenses. The target is Experiences & Devices (E&D) — the org behind Windows, Microsoft 365, Outlook, Teams, and Surface — with a cutoff of June 30, 2026. Normally "we dropped an AI tool" makes you think it underperformed. This is the opposite. It was so good, used so much, that the money stopped making sense.

The numbers nail the paradox. Microsoft deployed Claude Code to roughly 5,000 engineers, and by April 2026 monthly usage climbed to 84–95%. Most AI rollouts beg people to actually use the thing; here nearly everyone was on it every month. The catch was the billing. Claude Code is token-based metered, so heavier use means a bigger bill, directly. Some engineers burned $500–$2,000 of tokens a month, and full annual AI budgets vanished in months. The fix: switch to Microsoft's own GitHub Copilot CLI. The nuance — Claude models are still reachable through Copilot CLI. So this isn't "block the model," it's "block the direct-pay pipe to Anthropic."

Why is this one of 2026's most important enterprise-AI stories? Because the glamorous "a Claude Code seat for every employee" era just produced the most credible first-hand evidence that the unit economics may not pencil out — and it came from inside Microsoft, a company as all-in on AI as anyone.

The players — Microsoft who cut it, and Anthropic on the other end

Microsoft is the AI era's biggest bettor (tens of billions into OpenAI) and also owns GitHub Copilot, its own coding assistant. That creates an odd tension: E&D engineers preferred Anthropic's Claude Code over their own company's child. Reports say Claude Code became "perhaps a little too popular" internally, and engineers picking a rival's tool over the in-house product is an uncomfortable truth for Microsoft. Cost was the stated reason; pride may have been the engine.

The internal memo reportedly came from Microsoft EVP Rajesh Jha. Jha credited Claude Code with playing an important role in helping Microsoft understand AI-assisted software development, but pointed to Copilot CLI's decisive edge: Microsoft can shape the product directly through GitHub. Translation: "It was great. But rather than pour money monthly into someone else's tool, we'll make ours that good."

Anthropic essentially created the agentic-coding market with Claude Code. It first landed inside E&D in December 2025 and, in under six months, became a tool nearly the whole org depended on. For Anthropic that's a double-edged sword: a happy "product-too-good" problem and a "big customer's metered-bill shock" at once. You build something so good the customer uses all of it — and then leaves over the bill. A brand-new kind of problem the SaaS era never had.

What happened, in numbers

The "more you use, more you pay" structure of token metering is clearest as a table.

Item Detail
Org Experiences & Devices (Windows·M365·Outlook·Teams·Surface)
Internal rollout December 2025 (Claude Code)
Scale ~5,000 engineers
Monthly usage 84–95% by April 2026
Per-engineer token spend $500–$2,000/month
Cutoff June 30, 2026
Replacement GitHub Copilot CLI (Claude models still callable)
Decision memo EVP Rajesh Jha

Don't miss the multiplication: 84–95% usage times $500–$2,000 per head. Five thousand engineers, nearly every month, hundreds to thousands of dollars each, adds up to tens of millions a year. And metering's scary part is unpredictability. A per-seat flat license (say Office at $30/month) makes budgets land cleanly; token metering spikes the instant an engineer turns an agent loose on a big codebase. One report put it perfectly: managing AI cost now looks "more like AWS billing than Office licences." That sentence is the whole story.

And it's not just Microsoft. Uber's CTO said in an April interview with The Information that the company burned its entire 2026 AI coding budget in just four months. Individual engineers spending thousands a month are reported across the industry. Microsoft's move isn't one company's budgeting accident — it's the biggest visible slice of a wall the whole industry hit at once.

Who benefits — Microsoft, Anthropic, GitHub

For Microsoft, two wins. First, immediate cost control: plug the metered leak of tens of millions and redirect that budget to a predictable in-house tool. Second, the bigger strategic prize — forced Copilot dogfooding. Stopping engineers from defecting to a rival and routing them to Copilot CLI feeds back into making Copilot itself better fast. "Our engineers don't use our tool" is a deadly signal to a product team; this flips it by fiat. Jha's line about shaping the product through GitHub is exactly this.

For Anthropic, there's an unexpected upside too. On the surface it lost a big customer, but Claude model calls survive through Copilot CLI, so much of the API revenue stays. And the fact that 5,000 Microsoft engineers preferred Claude Code over Microsoft's own Copilot is itself Anthropic's most powerful marketing reference. "Even inside Microsoft, our tool was #1" is a brutal weapon in other enterprise deals. At the same time, Anthropic now feels pressure to strengthen billing options beyond pure metering — enterprise flat-rate, budget caps. It got assigned the homework of solving "the product's so good the customer leaves" through pricing design.

For GitHub, it's a clear win. A parent-mandated migration of 5,000 engineers to Copilot CLI lifts usage, feedback, and internal loyalty in one shot. For a team long judged "weaker than Claude Code," this hands it both the resources and the mandate to level up overnight.

History — the fate of tools "too good to afford"

Metered tools that get popular then boomerang into a cost bomb are a recurring cloud-era story. This sits squarely in that lineage.

Precedent — the cloud cost shock (2018–2020). As companies rushed to AWS/GCP, the "pay for what you use" charm soured when an engineer's idle instance or data-transfer fees produced bill bombs. That's why "FinOps" became a job. Lesson: metering is heaven at adoption, hell at peak usage, and only stabilizes once cost governance arrives. Agentic AI is riding the same curve, and "AI FinOps" will soon be a real role.

Precedent — the death of unlimited plans. Telecom and streaming "unlimited" tiers that bled money on heavy users and reverted to tiered pricing follow the same pattern. Lesson: "unlimited for all" is great for the average user, but top heavy-user cost breaks the whole economics. Claude Code's 84–95% usage means basically everyone was a heavy user — a structure unlimited/flat models can't survive.

Counter-success — GitHub Copilot's flat rate. Tellingly, Copilot launched per-seat flat ($10 individual, $19–39 enterprise), making cost predictability its weapon. It was judged weaker than metered agents, but "the bill won't spike" was decisive for enterprise buyers. Lesson: sometimes it's not the strongest tool but the most predictable cost structure that wins the enterprise standard. Microsoft reverting to Copilot CLI proves the point.

How rivals counter

Anthropic answers two ways. First, strengthen enterprise budget caps, flat options, and cost-visibility dashboards to directly defuse the "bills spike" fear. Second, improve Claude Code's token efficiency so the same task finishes on fewer tokens — drop per-task cost and the metering dread shrinks. A two-front war: prove "better than Copilot" on product while making it "as safe as Copilot" on price.

GitHub/Microsoft ride the momentum to crank up Copilot CLI's agentic features and lean into "flat + predictable + multi-model (Claude included)." The core message: power and budget control. Real-world data from 5,000 internal engineers fuels the improvement.

Coding startups like Cursor and Cognition get a warning and an opening. The warning: fail to prove metered-model cost sustainability and big customers can bolt anytime. The opening: whoever nails cost predictability and governance (per-team budget caps, usage analytics) first takes the enterprise. The next round's battleground shifts from raw model performance to "product design that tames cost."

So what actually changes

For engineers and dev teams, the infinite-token party is over. Internal AI coding tools will now ship with per-team/per-person budget caps, usage dashboards, and cost alerts as defaults. "Run agents all you want" becomes "spend smart inside this month's token budget." And as multi-model environments standardize (pick Claude, GPT, or in-house in one tool), cost-optimal routing — swapping models by task difficulty — becomes a daily developer skill.

For the AI industry and startups, it's time to prove unit economics. Agentic AI raised money on performance demos; the core question now is "can you deliver this at a sustainable cost?" Token efficiency, caching, cheaper inference, and flat-rate design rise to compete alongside raw capability. Pair this with Samsung pre-shipping next-gen HBM4E this week and the picture sharpens: the two levers on the "AI cost crisis" are cheaper inference hardware (HBM) and smarter cost design (software).

For decision-makers and general readers, this marks Act 2 of AI adoption. If Act 1 was "deploy first," Act 2 is "we saw the value, now tame the cost." Crucially, AI isn't retreating because it failed — it's moving to cost discipline because it worked too well. When even a giant like Microsoft stops in front of the bill, every organization is about to face the same question: how much, how, and under whose control do we spend on AI?

References

관련 기사

무료 뉴스레터

AI 트렌드를 앞서가세요

매일 아침, 엄선된 AI 뉴스를 받아보세요. 스팸 없음. 언제든 구독 취소.

매일 30개+ 소스 분석 · 한국어/영어 이중 언어광고 없음 · 1-클릭 해지