The compute shortage is breaking Claude. Theo and Ben of Nerd Snipe argue Anthropic's infrastructure arbitrage - shunting users from optimized Nvidia stacks to Google TPUs and Amazon Trainium chips - has crippled performance. Their primary conspiracy: Anthropic eliminated the price premium for its 1-million-token window to force traffic onto this inferior hardware, hoarding scarce Nvidia GPUs for internal research.
"The input tokens required for similar tasks increased by 170x, while output tokens grew 64x."
- Theo, Nerd Snipe
The damage is measurable. Theo cites an AMD audit showing Claude Code's input tokens for the same tasks jumped 170-fold between January and March. One AMD engineering project saw its estimated AWS Bedrock costs explode from $26 to over $42,000 monthly. Anthropic’s own engineering, strained by hiding 'thinking' traces to prevent competitor distillation, broke under load. Theo attributes bans of third-party tools like OpenClaw to this cost panic, noting OpenClaw's idle heartbeat function costs him $4.31 daily.
The bottleneck forced an unthinkable alliance. Frank Downing of FYI reports Anthropic, which once blocked Elon Musk's xAI from its models, now leases the 300-megawatt Colossus 1 data center - controlled by Musk’s AI and space interests - to get Claude back online. The deal provides inference capacity ideal for running existing models, not training new ones.
Nathaniel Whittemore of AI Daily Brief frames the crunch as systemic. OpenAI CFO Sarah Fryer describes a 'vertical wall of demand' where compute is the primary bottleneck, ending the era of 'all-you-can-eat' AI subsidies. GitHub and Microsoft are pivoting to usage-based billing. Supply is so tight that even second-tier labs are sold out of tokens.
"This marks the first known instance of the executive branch restricting a model rollout based on policy considerations rather than technical failures."
- Nathaniel Whittemore, The AI Daily Brief
The scarcity reshapes power. Whittemore notes Washington recently blocked the broad release of Anthropic’s Mythos model, citing national security and compute cannibalization concerns - an 'informal, highly improvised licensing regime.' Policy expert Dean Ball says the state is now an active gatekeeper of AI capability, not just a customer.
Garry Tan, on Tetragrammaton, sees a different escape route: decentralization. He argues corporate AI models like ChatGPT and Claude function as surveillance tools, and the real shift will come when users run local agents on their own hardware. Tan aligns with 'Team Pirate,' advocating for personal liberty over a 'unipolar' world controlled by one corporation or government.
The frontier is moving off-planet. Brett Winton of FYI expects SpaceX to begin launching modular AI clusters into low Earth orbit around 2028. If Starship launch costs hit $300/kg, deploying a gigawatt of compute in space could cost $7.5 billion, beating terrestrial infrastructure's $20 billion price tag. For Anthropic and others, getting hardware online today is worth any premium.
The compute crisis is no longer a scaling challenge - it's a strategic cage.



