Elon Musk leases SpaceX supercomputer to Anthropic ending model war

May 16, 2026Saturday, May 16, 2026Summary from 4 contributing podcasts.

Elon Musk is leasing the Colossus data center to rival Anthropic, ending the AI model race to become a compute utility.
Anthropic’s engineering teams now rely on AI agents that manage memory and self-correct, making manual coding obsolete.
Compute scarcity is so severe that Anthopic routes users to inferior chips while hoarding Nvidia GPUs for research.

The AI industry’s strategic pivot happened this week. Elon Musk’s SpaceX and xAI are leasing their Colossus 1 supercomputer - a 220,000 GPU facility - to Anthropic. The move solves Anthropic’s acute compute shortage and signals Musk’s retreat from the frontier model race. As Frank Downing noted on FYI, Musk is becoming a compute kingmaker, a utility provider like Nvidia, while Anthropic gets the raw power it needs to stop throttling its users.

The deal underscores a brutal new reality: hardware, not software, is the ultimate bottleneck. Nathaniel Whittemore’s analysis highlighted that Anthropic had the best coding model but couldn’t run it reliably. Musk had the world’s largest AI cluster but a mediocre model. The partnership is a marriage of convenience dictated by scarcity.

“Anthropic is aggressively routing public users away from high-performance Nvidia H100 clusters and onto inferior chips like Google’s TPUs and Amazon’s Trainium.”
- Theo, Nerd Snipe

This hardware triage has a direct user cost. Theo and Ben cited an internal AMD audit showing the tokens required for identical Claude Code tasks increased 170-fold from January to March, with costs exploding from $26 to over $42,000. Anthropic is sacrificing public performance to preserve its research pipeline, creating a two-tier system where employees use superior internal versions.

The shift in focus is evident in Anthropic’s product launches. At its recent ‘Code with Claude’ event, the company didn’t announce a new model. Instead, it unveiled ‘Managed Agents’ with features like Dreaming, which allows AI to review its own work, and Outcomes, which uses a grading agent to iterate until a rubric is met. Boris Cherny, creator of Claude Code, stated that manual coding has effectively disappeared within Anthropic’s own teams.

The endgame is an AI that never stops learning. Anthropic CEO Dario Amodei revealed the company is targeting ‘infinite’ context windows. As Whittemore explained, if a model’s context window never clears, it gains a persistent, evolving memory, reducing the need for constant retraining. The distinction between reasoning and processing is collapsing.

“If rented out as infrastructure-as-a-service, a gigawatt generates roughly $15 billion per year. If used by the model builder itself, like OpenAI or Anthropic, that same capacity can drive $30 billion in annual revenue.”
- Frank Downing, FYI

The economics justify the hardware arms race. Building a one-gigawatt data center costs $60 billion but can generate $30 billion annually for a vertically integrated AI company. This payoff is why the next scaling frontier may be space. Downing’s co-host Sam Korus argued that orbital data centers, launched by SpaceX’s Starship, could deploy compute faster and cheaper than fighting for terrestrial power and land.

Ben Horowitz’s framework explains the strategic mindset. On The a16z Podcast, he argued a founder’s only job is ‘right product, right time.’ Everything else is support. For Musk, the right product is now compute infrastructure. For Anthropic, it’s agentic workflows. Both are focusing on what they can uniquely deliver amidst a shortage - be it power or product - and outsourcing the rest. The model war is over. The utility war has begun.

Source Intelligence

- Deep dive into what was said in the episodes

The a16z Show

Ben Horowitz - "Your ONLY job is Right Product, Right Time" • May 14

Also from this episode: (8)

Startups (5)

Horowitz says the founder's only job is delivering the right product at the right time; all other activities like hiring or fundraising are just support for that singular goal.
A company's story is its strategy; Horowitz argues founders must continually refine and articulate this 'why' in writing to guide recruitment, investment, and internal execution.
Horowitz advises founders to identify opportunities by solving tangible shortages like power transformers, electricity, chips, or cooling, where unlimited demand meets constrained supply.
Horowitz views pivots as difficult and late-stage pivots as nearly impossible; he says founders constantly adjust assumptions, but a wholesale pivot should only occur if there is no other choice.
For fundraising, Horowitz tells founders to craft a story that convinces themselves to rejoin their own company, rather than tailoring a pitch to guess what investors want to hear.

AI & Tech (3)

In an AI era, Horowitz says creativity and relationship-building skills are undervalued but increasingly critical, as AI excels at grind tasks but cannot generate original ideas or maintain high-fidelity human connections.
Horowitz wrote 'Good Product Manager, Bad Product Manager' to define the role: the product leader's sole responsibility is ensuring 'right product, right time,' not writing requirements or pitching customers.
Horowitz argues defensibility in AI can still come from solving hard technical problems, possessing the customer, or building a strong brand, citing OpenAI's consumer base as a key asset despite potential model parity.

#Startups #VC #Models #Energy

Nerd Snipe with Theo and Ben

Anthropic solved their compute problem by buying it from Elon? • May 14

Theo argues Anthropic's Claude Opus 4.7 is not a meaningful improvement over prior models and comes with user experience regressions due to overly strict safety system prompts.
Figma's stock dropped 5% after a competitive announcement from Anthropic, contributing to an 85% decline since its IPO. Ben cites this as evidence of Anthropic's negative market impact.
Theo reveals Anthropic confirmed a routing error last year where 0.8% to 16% of Sonnet requests were sent to a dumber, 1M-context version, establishing a precedent for performance regressions via model versioning.
Theo's primary conspiracy is that Anthropic now forces all Claude Code users onto the dumber 1M-context model version to route traffic away from scarce Nvidia GPUs and onto partners like AWS and Google TPUs, explaining the performance drop.
The hosts claim Anthropic employees use different, superior internal versions of Claude and its tools, creating a disconnect where employees don't experience the external product's failures and dismiss user complaints.

Also from this episode: (5)

AI & Tech (5)

Anthropic is aggressively banning third-party tools like T3 Code and OpenClaw that interface with Claude Code. Theo attributes this to a compute crisis and poor caching implementations that increase costs.
Theo states OpenClaw's heartbeat function, which polls for tasks, costs him $4.31 daily without active use, extrapolating to roughly $120 monthly in wasted API spend.
Ben cites a GitHub issue from an AMD AI head showing a massive spike in Claude Code usage and cost at their company. Input tokens increased 170x and costs jumped from $26 to over $42,000 monthly after model updates.
Theo and Ben argue Anthropic's engineering is incompetent, citing recent changes like a new tokenizer, a 5-minute cache TTL, and hidden reasoning data that complicate their stack across three hardware providers, leading to reliability and intelligence issues.
Theo attributes Anthropic's problems to a research-first, safety-obsessed culture that devalues engineering and product reliability, creating opaque policies and a 'holier-than-thou' attitude that frustrates developers.

#Models #Chips #Enterprise #AI Infrastructure #Big Tech

FYI — For Your Innovation (ARK Invest)

SpaceX And Anthropic Partnership | The Brainstorm EP 131 • May 13

Anthropic has been supply-constrained on compute, restricting user tokens and cutting off research compute, forcing them to lease Colossus 1 - a 300MW, 220,000 GPU data center - from SpaceX to lift capacity limits.
Anthropic previously enforced a policy to cut off competitors like XAI from using its models, but the compute deal has reopened communication between them.
SpaceX’s vertical integration from chip fabrication to model inference reduces its risk when leasing out compute, as it can more rapidly rebuild capacity.
Building a gigawatt-scale AI data center costs roughly $60B: $19B for the facility, $30B for GPUs, and $11B for other IT equipment like CPUs and networking.
A gigawatt of leased compute infrastructure generates about $15B annually; a model provider operating on that scale can generate $30B in revenue.
OpenAI’s revenue scaled with compute; they generated $20B on roughly 2 gigawatts early this year, implying about $20B per effective gigawatt for inference.
Revenue per watt is increasing as model utility and enterprise willingness to pay rise, creating pricing power for both infrastructure providers and model companies.
Space-based AI compute economics hinge on Starship launch costs; at $300/kg, launching a gigawatt costs $7.5B, beating terrestrial facility costs.
Manufacturing satellites on a production line offers cost efficiencies over building unique terrestrial data centers, where $5B of the $19B facility cost is labor.
Compute scarcity on Earth means AI companies like Anthropic would pay a premium for space-based watts even before launch costs break parity, valuing velocity over price.
ARK analysts project SpaceX could begin scaling space-based AI compute in 2028-2029, reaching tens of gigawatts per year in the early 2030s.
Small modular nuclear reactors are more likely to scale in the US than gigawatt plants, fitting incremental demand like offsetting retired coal plants.

#AI Infrastructure #Models #Enterprise #Big Tech

The AI Daily Brief: Artificial Intelligence News and Analysis

Nathaniel Whittemore

How to Build an AI Native Team with Mike Cannon-Brookes • May 9

Anthropic is developing future models with higher judgment, 'infinite' context windows, and multi-agent coordination, with research head Diane Penn suggesting infinite context could enable continual learning.
Anthropic and SpaceX announced a partnership granting Anthropic full use of XAI's Colossus 1 data center, containing 220,000 Nvidia GPUs operating at 300 megawatts.
Elon Musk explained his decision to lease Colossus 1 after meeting with Anthropic's team and concluding their work was 'good for humanity'.
Nathaniel Whittemore argues Elon Musk's pivot from model builder to compute provider represents AI play 3.0, aligning with his strength in scaling known-but-hard infrastructure projects.
Chamath Palihapitiya predicted power constraints would force AI labs to negotiate for compute, creating an opportunity for SpaceX's excess capacity.
Anthropic will dissolve XAI as a separate company, integrating it as SpaceX AI, and will initially raise API rate limits and eliminate peak hour reductions for Claude Code users.

Also from this episode: (7)

AI & Tech (5)

Anthropic's Code with Claude event centered on agent harnesses and workflow tools rather than a major model release, reflecting the shift from model to ecosystem competition.
Anthropic launched Dreaming, a scheduled memory review system for agents that extracts patterns from past sessions to improve performance over time.
Anthropic's Outcomes feature uses a separate grading agent with a user-defined rubric to automatically evaluate and iterate on agent outputs, improving document quality by 8.4% for Word and 10.1% for PowerPoint.
Managed Agents now support multi-agent orchestration, allowing a lead agent to delegate tasks to specialists with shared context and auditable execution traces.
Anthropic released a suite of 10 predefined financial service agents for Claude Finance, including pitch builders and market researchers, alongside a cookbook for customization.

Enterprise (1)

Anthropic CEO Dario Amodei revealed the company saw 80x annualized growth in revenue and usage during the first quarter of 2026.

Coding (1)

Claude Code creator Boris Cherny disavowed the term 'vibe coding', stating his company uses coordinated AI agents over Slack with automated testing and no manually written code.

#AI Infrastructure #Agents #Big Tech #Startups #Reasoning

Elon Musk leases SpaceX supercomputer to Anthropic ending model war

Source Intelligence

Related Stories