What happened with hyperscalers burn cash to dominate ai's physical bottleneck?

Amazon, Google, and Microsoft are crushing free cash flow with a trillion-dollar buildout for physical AI infrastructure.

What happened with hyperscalers burn cash to dominate ai's physical bottleneck?

AI labs trade massive equity stakes to hyperscalers in exchange for guaranteed compute and power capacity.

What happened with hyperscalers burn cash to dominate ai's physical bottleneck?

The supply crunch for AI tokens is structural, shifting competition from software to grid access and chip fabs.

BUSINESS

Hyperscalers burn cash to dominate AI's physical bottleneck

Sunday, May 3, 2026 · from 6 podcasts, 8 episodes

All-In AI Daily Brief Moonshots with Peter Diam…The Economist This Week in Startups The a16z Show

Amazon, Google, and Microsoft are crushing free cash flow with a trillion-dollar buildout for physical AI infrastructure.
AI labs trade massive equity stakes to hyperscalers in exchange for guaranteed compute and power capacity.
The supply crunch for AI tokens is structural, shifting competition from software to grid access and chip fabs.

The profit engines of Big Tech are being dismantled for kilowatts. Amazon's free cash flow is down 97% this year, with Microsoft, Google, and Meta each down at least 8%, as they pivot from returning cash to shareholders to a $700+ billion capital expenditure sprint on data centers and power contracts.

This isn't just a chip race. According to Chamath Palihapitiya on the All-In podcast, hyperscalers are signing long-term power purchase agreements at more than double the spot rate to lock in supply, morphing into highly leveraged industrial businesses. The strategic bottleneck has shifted from code to the physical grid, with the White House invoking the Defense Production Act to fast-track transformer manufacturing as data center demand is set to double US electricity use by 2030.

"The asset-light era of software is ending."
- Chamath Palihapitiya, All-In

AI labs are trading their futures for access to this scarce infrastructure. Anthropic has signed $73 billion in combined deals with Google and Amazon, locking in capacity that rivals Microsoft's entire 2024 global data center footprint of about 6 gigawatts. As Nathaniel Whittemore notes on The AI Daily Brief, these are compute-for-equity swaps that bind startups to specific clouds for a decade, securing the gigawatts needed to stay in the race.

Microsoft and OpenAI unwound their exclusive partnership to adapt to this new reality. Microsoft dropped its exclusive sales rights, allowing OpenAI to sell on AWS and Google Cloud, but secured its 27% equity stake and revenue share through 2030. This polyamorous pivot, as Jason Calacanis called it on This Week in Startups, lets Microsoft capitalize on OpenAI's wider growth while hedging its own infrastructure bets.

The demand for AI is not the constraint. Semi Analysis analyst Dylan Patel notes token demand has officially outpaced global compute supply, leading to rationing. Anthropic's Opus 4.7 is currently "compute-gated," and labs are sold out of capacity. This creates a new competitive moat: sheer physical ability to serve tokens. As Ben Horowitz argues on the a16z Show, capital can now bridge technical gaps, making physical bottlenecks - electricity, chips, land - the decisive factors.

"The historical rule that you can't throw money at a software problem to catch up is now false. Capital can accelerate progress."
- Ben Horowitz, The a16z Show

The outcome is a brutal capital race where the hyperscalers, owning the physical stack, extract massive premiums. Google’s new AI-designed TPU chips offer 80% better performance-per-dollar for internal workloads, and the company now controls an estimated 25% of the planet's AI compute. The game is no longer about who has the smartest model, but who controls the factories that produce the tokens.

Source Intelligence

- Deep dive into what was said in the episodes

All-In with Chamath, Jason, Sacks & Friedberg

OpenAI Misses Targets, Codex vs Claude, Elon vs Sam Trial, Big Hyperscaler Beats, Peptide Craze • May 1

OpenAI missed its target of reaching one billion weekly active users for ChatGPT by the end of 2025 and is still short of that milestone in 2026. It also missed its 2025 revenue target.
OpenAI has $600 billion in spending commitments for compute, a figure roughly equal to its secondary market valuation. CFO Sarah Friar is reportedly concerned about revenue growth keeping pace with these expenses.
David Sacks argues OpenAI's recent product release, GPT 5.5, has received strong reviews from developers and is taking coding market share from Anthropic's Opus 4.7, which users complain is compute-constrained and buggy.
Polymarket shows a 32% chance OpenAI goes public by the end of 2026, down from 60% in December, reflecting market skepticism about its IPO timeline.
Chamath Palihapitiya contends the fundamental constraint for AI companies is power supply, not demand, and that supply chain delays for grid infrastructure will hurt OpenAI and Anthropic while benefiting hyperscalers like Oracle, Amazon, Meta, Microsoft, and Google.
David Friedberg cites a BCG 'Rule of Three' theory, predicting mature AI markets will evolve to a 4:2:1 market share ratio between the top three players, with OpenAI and Google currently leading in consumer and Google leading in enterprise.
Friedberg highlights an MIT paper on neural network pruning, showing models can be reduced in size by 90% for the same accuracy, potentially cutting inference costs by 10x and dramatically improving energy efficiency.
In the Elon Musk vs. OpenAI trial, excerpts from Greg Brockman's diary suggest the OpenAI team wanted to transition to a for-profit structure and remove Musk as an investor, which Musk claims breaches charitable trust.
Polymarket odds give Elon Musk a 42-43% chance of winning his lawsuit against OpenAI, with speculation that a win might only result in him being credited back his initial $40 million investment.
Major hyperscalers announced massive 2026 capital expenditure guidance: Amazon at $200 billion, Microsoft and Google at $190 billion each, and Meta at $145 billion, signaling a trillion-dollar infrastructure buildout driven by AI and cloud demand.
This CAPEX surge is crushing free cash flow: Amazon's is down 97%, while Google, Microsoft, and Meta are down 12%, 12%, and 8% respectively, as companies pivot from shareholder returns to heavy infrastructure investment.
Chamath Palihapitiya predicts hyperscalers will become highly leveraged, debt-heavy industrial businesses within five years as they lock in long-term power contracts at rates more than 2x the prevailing spot price.

Also from this episode: (4)

AI & Tech (2)

Sacks notes OpenAI released GPT 5.5 Cyber, which matches Anthropic's Mythos in completing multi-step cyber attack simulations, and is likely the first such model commercially available due to OpenAI's superior compute capacity.
Sacks argues AI cyber models like Mythos or GPT 5.5 Cyber don't create vulnerabilities but discover existing bugs, and their proliferation will lead to a one-time upgrade cycle as systems are hardened, followed by a new equilibrium between offense and defense.

Science (1)

David Friedberg cites phase three trial data for Eli Lilly's Retatrutide, a triple agonist peptide, showing an average weight loss of 37 pounds in 40 weeks, an 80% reduction in liver fat, and significant drops in A1C, non-HDL cholesterol, and triglycerides.

Regulation (1)

Friedberg attended the Supreme Court hearing for Monsanto (Bayer) vs. Hardeman, a case testing federal preemption of pesticide labels under FIFRA against state failure-to-warn laws, with Bayer having paid out $10 billion and reserving another $10 billion for 90,000 outstanding cases.

#AI Infrastructure #Enterprise #Markets #Models #Energy

The AI Daily Brief: Artificial Intelligence News and Analysis

Nathaniel Whittemore

The Week AI Grew Up • May 1

GPU rental prices rose 40% over the last six months, driven by real token demand, not hype. The top two AI labs now generate almost $60 billion in aggregate annual revenue, signaling fundamental strength.
A 'vertical wall of demand' exists where every producible AI token will be sold, according to OpenAI CFO Sarah Fryer. Compute, not model quality, is the current bottleneck for the industry.
GitHub Copilot is shifting to usage-based billing. Microsoft's Satya Nadella stated all per-user services will evolve into per-user plus usage models, reflecting the intensity of AI consumption.
Big Tech cloud earnings showed explosive AI-driven growth: AWS revenue was up 28% YoY, Microsoft Azure grew 40% YoY, and Google Cloud beat estimates with 63% YoY growth, triggering a record market cap jump.
Anthropic is negotiating a funding round targeting a valuation exceeding $90 billion, potentially surpassing OpenAI's $82.5 billion valuation from March. Some secondary market trades already imply a $1 trillion valuation.
Microsoft and OpenAI restructured their deal, granting Microsoft non-revenue-share access to OpenAI's models for five more years and removing the AGI clause. OpenAI is now free to sell models on AWS and Google Cloud.

Also from this episode: (5)

AI & Tech (5)

The White House is considering rescinding Anthropic's supply-chain risk designation to allow government use of its models, but some officials oppose a broader rollout of Mythos due to national security and compute capacity concerns.
Cursor launched an SDK and OpenAI updated Codex for non-developers, asking users to define their role. This signals a battle over interface philosophy: Claude separates technical and non-technical work, while Codex bets on a unified tool for all.
A viral MS Paint-style prompt instructs AI to redraw images in a 'clumsy, scribbly, and utterly pathetic way,' exemplifying a cultural trend toward low-fidelity, humorous outputs that contrast with the industry's growing maturity.
OpenAI's Codex model developed an unexplained fixation on mentioning goblins, gremlins, and other creatures. The company traced it to personality reinforcement learning, where a 'nerdy' training preference spilled over, highlighting how quirks can propagate in model-based training.
A New York Times op-ed predicts a 'permanent underclass' from AI. Nathaniel Whittemore argues Silicon Valley builders often misjudge AI's real-world economic impact, citing economist Kevin Bryan's view that economists largely reject this permanent underclass thesis.

#Enterprise #Regulation #Big Tech #Models

AI Lab Power Rankings • Apr 29

Nathaniel Whittemore reports Microsoft and OpenAI unwound major parts of their partnership. Microsoft remains OpenAI's primary but non-exclusive cloud partner, loses a revenue share, and retains a 27% equity stake through 2032.
OpenAI's GPT-5.4 is now available as a limited preview on AWS, with GPT-5.5 coming soon, a direct result of the amended partnership ending Microsoft's exclusivity.
An aggregated AI assessment ranked Google first, OpenAI second, Microsoft third, Anthropic fourth, Amazon fifth, Meta sixth, XAI seventh, and Apple eighth. Whittemore's personal scores were significantly harsher.
Whittemore argues incumbency in the enterprise is worth less than many think for AI adoption, giving Anthropic and Microsoft equal scores. He ranks Google behind both OpenAI and Anthropic for enterprise AI.
He gives Google a low momentum score of 3 out of 10, citing its struggle to break into the agentic and coding-led conversation of 2026, despite strong narrative positioning at the year's start.
Whittemore scores XAI highest on the X-Factor category due to Elon Musk, arguing they have the most room to rise in the next 6 to 12 months despite currently ranking seventh.
He cites analyst Miles Brundage to counter zero-sum thinking, arguing the rapidly expanding AI pie means multiple labs can succeed, with Semi Analysis's Dylan Patel noting token demand outpaces supply.
Whittemore cautions that data from the pre-agentic era, like a WSJ report on OpenAI missing revenue targets, is now a lagging indicator and won't reflect the current structural shift in the industry.

Also from this episode: (3)

AI & Tech (3)

Whittemore's AI lab power ranking uses nine categories: compute, enterprise positioning, platform control, consumer positioning, model leverage, momentum, branded narrative, wedge, and X-Factor. Compute and enterprise are weighted highest.
Amazon launched Quik, an agentic desktop assistant that connects to calendars, email, Slack, and Jira, which The Information framed as a major play for the agentic era.
Claude announced new integrations with creative and professional tools including Adobe Creative Cloud, Blender, Ableton, and Autodesk.

#Big Tech #Agents #Models #AI Infrastructure

How DeepSeek V4 Connects to the US Power Grid • Apr 27

Nathaniel Whittemore notes Google plans a $40 billion investment in Anthropic with $10B upfront and $30B contingent on commercial milestones, building on earlier $3B investments.
Whittemore cites Mireille Securities’ view that Anthropic's deal with Amazon and Google trades equity for compute, with each gigawatt of capacity equated to a full-scale nuclear reactor.
The transcript notes Microsoft's entire global data center footprint in 2024 was around 6 gigawatts, while Anthropic is locking in capacity rivaling that scale.
Anthropic reportedly reached $30 billion in Annual Recurring Revenue, and Amazon's earlier deal required Anthropic to spend $100 billion with AWS over a decade.
OpenAI expects to build 30 gigawatts of capacity by 2030, with 8 gigawatts already identified, partnering with Oracle, data center developers, and smaller clouds.
Amazon invested $50 billion in OpenAI in February, and Microsoft holds an estimated 50% stake, according to the Mireille Securities note.
Meta forecasts up to $135 billion for its AI build-out this year and rents Graviton 5 CPUs from Amazon for agentic workloads while maintaining deals with NVIDIA, AMD, Google, CoreWeave, and Nebius.
The S&P 500 is up 12.5% over the past month, erasing the drawdown from the Iran war, with 82 stocks up more than 10% almost exclusively AI-related.
NVIDIA became the world's first $5 trillion company, nine months after reaching $4 trillion, on the back of AI demand.
Goldman Sachs projected data centers' share of U.S. electricity demand would rise from 6% today to 11% by 2030, identifying power grid constraints as AI's next bottleneck.
The White House invoked the Defense Production Act for grid infrastructure, declaring a national emergency to expand domestic production of transformers, transmission lines, and related components.
DeepSeek V4 Pro is priced at $174 per million input tokens and $348 per million output tokens, undercutting Opus 4.6 and GPT-5.4 by more than 75%.

Also from this episode: (2)

Models (1)

DeepSeek released its V4 model family with a 1 trillion parameter Pro version and a smaller Flash version, both featuring a 1 million token context window.

China (1)

China plans to curb U.S. investment in domestic tech firms like Moonshot and StepFund, following the blocked $2B Meta acquisition of Manus on national security grounds.

#AI Infrastructure #China #Energy #Startups

Moonshots with Peter Diamandis

Google Invests $40B Into Anthropic, GPT 5.5 Drops, and Google Cloud Dominates | EP #252 • Apr 30

Google is investing $40 billion in Anthropic, providing five gigawatts of TPU compute over five years, signifying a major commitment to the AI frontier.
Amazon is investing $33 billion in Anthropic, committing to $25 billion in new funds on top of a previous $8 billion, with Anthropic pledging to spend over $100 billion on AWS services.
The AI industry faces a significant bottleneck at TSMC, limiting the availability of chips, which Elon Musk highlights as the fundamental constraint to AI development.
Google Cloud unveiled its eighth generation of TPUs (8T for training, 8I for inference), offering three times faster training performance and 80% better performance per dollar.
Alex reports that Frontier Math Tier 4 benchmark, a proxy for professional-level math, shows approximately 1% monthly gains from frontier AIs, suggesting all such problems could be solved in four to five years.
Alex argues that Anthropic's projects, including 'Project Deal' (running a marketplace), are driven by a strategy to maximize the economic value per token generated by their models.
The trial between Elon Musk and OpenAI in Oakland Federal Court has begun with jury selection, unveiling details through discovery that are likely to be publicly aired.
Startup CEOs are engaging in 'token maxing,' spending heavily on AI compute, which Dave views as a necessary step for learning and getting into the AI race, despite it appearing as a vanity metric.

Also from this episode: (13)

Models (1)

OpenAI released GPT 5.5, seven weeks after GPT 5.4, showcasing a 37-point increase in long-context reasoning and a 60% reduction in hallucinations compared to 5.4.

AI & Tech (10)

Moonshot AI launched Kimi K2.6, a trillion-parameter open-source model that costs 30 times less than closed models, trained for $4.6 million, demonstrating significant cost-efficiency.
Dave notes that current AI models can manage dozens or hundreds of other models successfully, enabling consumers to ask a coordinator model to install software or build things without needing technical expertise.
OpenAI's Chronicle builds memories from periodic screenshots of user activity, raising privacy concerns despite its potential to offer 'telepathy-like' assistance.
Salim states that 44% of Gen Z workers are sabotaging AI automation efforts by providing incorrect data, highlighting a significant workplace resistance to AI.
World ID verification is integrating into Zoom to combat deepfake fraud, which caused $130 million in losses between 2019-2023, and is projected to reach $40 billion by 2027.
Dr. David Lutske's viral post demonstrates Grok creating a realistic AI Frenchwoman with a reflective ID, suggesting video ID verification may soon be unreliable.
Sheikh Mohammed announced that the UAE will launch an agentic AI government model within two years, with 50% of all government service operations run by AI.
OpenAI released ChatGPT for clinicians, a free AI co-pilot that outperforms human doctors on health benchmarks, scoring 59 compared to 43.7 for human clinicians.
Project Top Heart by NYU and Stanford uses AI to analyze 20 variables, aiming to increase the number of viable donor hearts for transplant by an additional 500 per year.
The FDA-approved blood pressure medication Candesartan has been repurposed using AI to stop and inhibit MRSA infections, which affect 2.8 million people and kill 35,000 annually in the U.S.

Science (2)

MRNA vaccines for pancreatic cancer show lasting results, with 87.5% of patients who generated a strong immune response still alive after six years, compared to a historical 13% survival rate.
A single-shot CAR-T infusion achieved 100% cancer-free status in all 20 melanoma patients within two months of treatment, with no recurrence after a median follow-up of 15.3 months.

#Models #AI Infrastructure #Agents #Big Tech

The Intelligence from The Economist

Power ranges: AI faces supply crunch • Apr 29

OpenAI shut down its Sora video generation tool to allocate scarce computing resources toward more lucrative ventures, reflecting an industry-wide AI compute shortage.
Weekly AI token processing on Open Router quadrupled from January to March 2024, illustrating surging AI demand that hardware cannot match.
Five major U.S. cloud providers, including Amazon, Meta, and Microsoft, will spend close to $700 billion on AI data center buildouts this year.
Data center construction faces local opposition over electricity, land, and water usage, causing project delays amid the urgent AI capacity push.
NVIDIA supplies over two-thirds of the world's AI processing power, but its chips are sold out, forcing companies to use older 2-3 year old hardware.
TSMC is the sole manufacturer for most advanced AI chips. Its capital expenditures are increasing by $60 billion this year, but capacity remains constrained.
Elon Musk's proposed 'TerraFab' aims to exceed all current chip fabrication capacity by 2030, a project analysts estimate would cost $5 to $13 trillion.
A prolonged AI supply crunch could reverse the trend of falling inference prices, leading to higher costs for users and potentially slowing AI adoption.
Millennial-focused direct-to-consumer brands like Allbirds face pressure from rising interest rates, expensive online ad markets, and competition from larger, established companies.

Also from this episode: (5)

AI & Tech (5)

A sophisticated spyware attack in Indonesia used a fake tax app to steal biometric data and drain over $26,000 from a charity accountant's bank accounts.
Criminal groups now operate a 'malware as a service' model, buying and selling stolen data and malicious software on platforms like Telegram to execute rapid, personalized attacks.
The global cybercrime industry is estimated to generate $500 billion annually, a scale comparable to the global illicit drug trade.
Security firm Infoblox identified a software cluster targeting victims in over 20 countries, with criminals integrating AI chatbots and deepfake tools to enhance attacks.
Allbirds is abandoning its footwear business, selling all shoe assets and rebranding as Newbird AI to pivot towards AI compute infrastructure.

#AI Infrastructure #Chips #Macro #Social Media #Enterprise

This Week in Startups

To the left of this of thumbnail i want to add a chinese flag to the left destroying the meta logo. because the title is China Kills Meta / Manus Deal (Story Of The Year) | E2281 • Apr 28

Microsoft and OpenAI restructured their partnership, ending Microsoft's exclusive sales rights and allowing OpenAI to pursue deals with Amazon AWS and others.
Microsoft's key concession was removing a revenue share it paid to OpenAI. In exchange, Microsoft secured its revenue share from OpenAI until 2030, regardless of AGI progress, and remains the primary cloud provider.
Microsoft owns 27% of OpenAI, a stake analysts value in the hundreds of billions if OpenAI achieves multi-trillion dollar status.
China's National Development and Reform Commission blocked Meta's acquisition of the AI startup Manus, ordering the deal's cancellation.
The Manus deal was partially executed, with capital transferred and some employees already moved to Meta's Singapore HQ before the Chinese government's intervention.
The move signals the end of 'Singapore washing,' where Chinese entrepreneurs relocated headquarters to access foreign capital, as China now asserts control over companies formed within its borders.
A purported dashcam video from Chongqing on March 9, 2026, shows a Harmony OS self-driving car hitting a toddler, though its authenticity is questioned.

Also from this episode: (6)

AI & Tech (2)

AGI attainment will now be independently verified by a third party, removing OpenAI's sole authority to declare its arrival.
A viral AI thread demonstrates using ChatGPT to analyze palm photos and generate palm prints, raising significant biometric privacy concerns.

Startups (1)

Benchmark invested $75 million in Manus in May 2025, which facilitated the startup's move from China to Singapore.

China (1)

Hosts speculate the Chinese government could escalate by pressuring Singapore for extradition or seizing assets, citing historical actions like the disappearance of Alibaba's Jack Ma.

Autonomous Vehicles (2)

The Epoch Times reported the car accelerated from 24 km/h to 31 km/h before the incident, and that the child was a local shopkeeper's toddler.
Jason Calacanis argues self-driving cars will be judged by superhuman, not human, reaction times, and a fatal pedestrian incident in the US would end the industry.

#Startups #Big Tech #China #Models

The a16z Show

Ben Horowitz on Venture Capital and AI • Apr 27

Also from this episode: (12)

VC (3)

Ben Horowitz says the traditional venture capital model was built for an era where only about 15 companies per year reached $100 million in revenue.
Horowitz says a16z's first system innovation was sharing economics with partners but centralizing control. He argues shared control makes organizations impossible to change.
To bootstrap its network, a16z used unconventional tactics. The firm redirected management fees into network-building and gained corporate introductions by leveraging its HP briefing center connection.

Startups (3)

Horowitz argues that with software's rise, the number of potential $100M-revenue companies would increase dramatically. He and Marc Andreessen believed the figure could be 200, not 15.
The firm viewed itself as a network to be bootstrapped. Horowitz explains successful network bootstrapping is the hardest part, using the analogy of selling the first telephone with no one to call.
Horowitz explains their first controversial investment - a quarter of the $300M Fund I into the Skype buyout from eBay - generated a 4x return in 18 months. This validated their approach to LPs.

AI & Tech (3)

Horowitz says AI has fundamentally changed venture capital. The historical rule that you can't throw money at a software problem to catch up is now false. Capital can accelerate progress.
He argues that with AI, code and user interface are no longer significant moats. The key questions for startups are now defining new barriers to entry.
He argues the biggest AI danger is over-regulation and a U.S. failure to compete, which could lead to China achieving superintelligence first, creating a dangerous power imbalance.

Culture (2)

Horowitz redefines company culture as a specific set of actions and behaviors, not just beliefs or values. He cites the Bushido concept that culture is a set of actions.
He argues companies are dictatorships, not democracies, because dictatorships are more efficient in competition. Countries need resiliency to bad leadership, but companies must optimize for speed.

Enterprise (1)

Horowitz has chosen not to pursue leveraged buyouts despite their profitability with AI. He says the LBO culture of firing people to extract efficiency is opposite to venture capital’s growth mindset.

#AI Infrastructure #Business #Startups #Macro

Hyperscalers burn cash to dominate AI's physical bottleneck

Source Intelligence

Related Stories