Price:

AI & TECH

Anthropic chokes AI devs as Claude Code's SaaS war escalates

Monday, May 18, 2026 · from 3 podcasts, 4 episodes
  • Developers report a 'dumber' Claude, with AMD audit showing a 170x spike in token costs.
  • Anthropic is routing public users to inferior hardware to preserve Nvidia GPUs for internal research.
  • The AI 'second moment' is here: Claude Code now earns $2.5B annually as a full-stack competitor.

The AI coding boom hit a wall. Developers are flooding GitHub with complaints that Claude Code, the agent that triggered a 'SaaS apocalypse,' is becoming unusably slow and expensive. An internal audit at AMD reveals the scale: between January and March, input tokens for the same tasks increased 170-fold, and one project’s estimated cost exploded from $26 to over $42,000.

Nathaniel Whittemore of The AI Daily Brief frames this as the 'second moment' for AI - a shift from assistants to autonomous builders. Claude Code grew from $1 billion to $2.5 billion in annualized revenue in just two months, its success upending the traditional software stack. Companies like Figma, down 85% since its IPO, are feeling the pressure.

“You realize this is a suicide run... routing users away from hardware they like, to hardware they want their researchers to use.”

- Theo, Nerd Snipe

Theo and Ben on Nerd Snipe argue Anthropic’s problems stem from a hardware crisis. They claim the company is aggressively rerouting public users from optimized Nvidia H100 clusters onto Google TPUs and Amazon Trainium chips to hoard scarce GPUs for internal research. This infrastructure arbitrage explains why Anthropic eliminated the price premium for its 1-million-token context window - it forces traffic onto the cheaper, non-Nvidia hardware where performance degrades.

Anthropic’s engineering moves compound the issue. To prevent competitors from distilling its reasoning, the company began redacting 'thinking' traces from API responses. This requires perfect mapping of thread IDs to hidden server-side data - a brittle system that, when it fails, leaves the model disoriented. A recent cache timeout reduction from 60 minutes to five exacerbates the 'dumbness,' as the model forgets its own logic mid-task.

“The era of using AI to save minutes is over; the era of using it to invent new business models has begun.”

- Nathaniel Whittemore, The AI Daily Brief

The broader market is moving faster than Anthropic’s stability. Whittemore notes enterprise focus has pivoted from time-saving 'efficiency AI' to revenue-generating 'opportunity AI.' The capability overhang is widening: 40% of enterprises are predicted to have working agents by year’s end, while others flounder on data quality. Anthropic’s internal culture, described by Theo as research-first and safety-obsessed, now faces the operational reality of serving millions of paying developers who expect reliability.

The cracks in Claude Code are a stress test for the entire agentic future. As AI shifts from a feature to a foundational competitor, the platform that wins won't just be the smartest - it will be the one that doesn’t break under its own success.

Source Intelligence

- Deep dive into what was said in the episodes

AI InequalityMay 17

  • Nathaniel Whittemore defines Q2 2026 as the AI 'second moment,' shifting from chatbot assistants to workable agentic systems, with stakes marked by billions of weekly users and $650 billion in planned capex.
  • Enterprise AI shifted from pilots to production, with 40% of enterprises predicted to have working agents by end of 2026. Pulsia, a fully agentic company, reached $6 million annualized revenue with a single founder.
  • Anthropic and the Pentagon clashed over terms for Claude's use, leading to Anthropic being designated a supply chain risk and a subsequent lawsuit. ChatGPT's agreement with the Department of War triggered a 775% surge in one-star reviews.
Also from this episode: (8)

Enterprise (1)

  • Claude Code revenue grew from $1 billion to $2.5 billion annualized in two months. Anthropic's enterprise share reached 70% of first-time buyers, and its overall revenue run rate hit $19 billion.

Models (1)

  • The quarter saw rapid frontier model releases: GPT-5.2 Codex, Genie 3, Opus 4.6, GPT-5.3 Codex, Sonnet 4.6, Gemini 3.1 Pro, Nano Banana 2, and GPT-5.4, with no single benchmark winner across common tests.

Open Source (1)

  • OpenClaw became the most starred open-source project on GitHub. Nvidia launched Nemo Claw as an enterprise-grade wrapper, and Anthropic integrated its features into Claude Code and Claude Co-work.

AI & Tech (5)

  • Survey data shows 71% of practitioners used vibe coding, 62% used agentic automation, and the average respondent uses 3.5 models. ROI shifted from time savings (13.6% of use cases) to increased output and new capabilities.
  • Customer service AI adoption is mature with 91% of businesses experimenting, but 64% of customers prefer no AI in interactions. Legal AI adoption lags, with only 15% of tasks using AI despite 80% capability.
  • HR AI deployment grew 320% in 12 months, from 19% to 61%. Finance AI adoption faces data quality obstacles, with 91% of firms reporting low impact.
  • Generative Engine Optimization (GEO) market was under $1 billion in 2025, projected to reach $34 billion by 2034. Sales AI use cases are mature, with 63% categorized as 'primetime' for most organizations.
  • Nathaniel Whittemore argues the capability overhang - the gap between AI's potential and deployed value - is widening, increasing the disparity between leading and lagging companies.

Towards AI That Can Actually InteractMay 12

  • Claude Code's annualized revenue grew from $1 billion to $2.5 billion in about two months during Q1 2026, while its use expanded beyond coding to preview broader agentic trends.
  • Cursor doubled its annualized revenue to $2 billion this quarter, Lovable hit $400 million ARR with a $100M monthly jump, and Replit projects $1 billion ARR by end of 2026.
  • Anthropic reached a $19 billion run rate and captured 70% of first-time enterprise AI buyers per Ramp data, while OpenAI held 25% and about $25 billion in annualized revenue.
  • AI usage surveys show 71% of practitioners 'vibe coded' in the past month, and 62% used automation or agentic AI, with the average user employing 3.5 different models.
  • The Pentagon designated Anthropic a supply chain risk after a dispute over using Claude in military operations, while OpenAI's subsequent DoD deal triggered a 775% surge in one-star ChatGPT reviews.
Also from this episode: (7)

AI & Tech (4)

  • Whittemore argues Q1 2026 marked AI's second moment, shifting from viable chatbots to workable agentic systems with higher economic and corporate stakes.
  • Industry AI capex is projected to reach $650 billion in 2026, triple the spending from a couple of years ago and exceeding the inflation-adjusted cost of the US interstate highway buildout.
  • The Generative Engine Optimization market was valued under $1 billion in 2025 but is projected to reach nearly $34 billion by 2034 as AI referrals convert better than traditional search.
  • Whittemore notes a widening capability overhang where the gap between potential AI value and actual deployed value increases costs, creating a larger divide between leading and lagging companies.

AI Infrastructure (1)

  • Gartner predicts 40% of enterprises will have working agents in production by end of 2026, fueled by products like agent credit cards from Ramp and Stripe.

Enterprise (2)

  • Survey data indicates a shift from efficiency AI to opportunity AI, with time-saving use cases dropping from 19.9% to 13.6% of ROI between January and February 2026.
  • HR AI deployment grew 320% in 12 months from 19% to 61% adoption, while seven states have enacted AI employment regulations.

Can We Reverse Aging?May 17

  • Billionaires are the primary backers of longevity science. Sam Altman invested roughly $200 million in Retro Biosciences. Jeff Bezos is a major investor in Altos Labs, the largest biotech startup launch in history.
  • Domonius notes billionaires investing in longevity R&D hope to profit, which requires treatments becoming widely available. She acknowledges this mirrors the pharmaceutical industry's profit-driven model, but the convergence with tech billionaires raises uncharted questions about control and privacy.
Also from this episode: (10)

Biology (5)

  • Longevity science focuses on cellular rejuvenation - the idea that aged cells can be made to function like younger cells. Susan Domonius points to embryos as proof, noting they shed inherited aging markers shortly after fertilization.
  • Shinya Yamanaka won the Nobel Prize in 2006 for reverting aged mouse skin cells to embryonic form using powerful genes. Early attempts to apply these Yamanaka factors to mice caused monstrous tumors, as cells became unspecialized and developed fatal teratomas.
  • Juan Carlos Izpisua Belmonte tweaked the Yamanaka formula, applying a reduced dose to fast-aging mice. The mice lived longer, looked younger with less gray fur, had stronger muscles, healed faster, and became friscier - lab technicians thought they were replaced.
  • David Sinclair, a Harvard genetics professor, is a controversial figure in longevity research. Domonius notes colleagues criticize him for overselling promises, circulating unsupported claims about reversing aging in dogs, and co-founding a wellness platform with non-mainstream practices.
  • Sinclair’s key breakthrough involved dropping the most cancer-prone Yamanaka factor and using only three to restore vision in blinded mice without causing cancer. Life Biosciences, his biotech, has FDA approval for human safety trials targeting glaucoma and nystagmus.

Longevity (4)

  • Altos Labs recruited top scientists like Belmonte by offering million-dollar salaries, triggering a major academic migration to private industry. The company focuses on predictive research using human organoids and AI-driven virtual cells to bypass unreliable mouse models.
  • Hal Barron, Altos Labs CEO, distances the company from extreme longevity promises like living to 150. Domonius says Barron aims for reasonable goals, such as extending human health by a few years or preserving ovary function, which he considers revolutionary.
  • Extending lifespan raises economic and philosophical issues. Social Security isn't designed for people living to 110. Domonius says if people are healthier longer, they might remain active and fill roles amid population decline, reducing pressure on young caregivers.
  • Hal Barron stated we already know how to reverse aging through diet, exercise, sleep, and sociability. Domonius points to GLP-1 drugs as evidence that behavioral solutions are insufficient, arguing accessible medical interventions could transform health for those in food deserts or under stress.

Health (1)

  • The realistic expectation for cellular rejuvenation is treating specific diseases like glaucoma within a decade, not dramatically expanding lifespan. Domonius argues the goal is curing disease to reduce suffering and extend healthy years, not achieve radical longevity.

Anthropic solved their compute problem by buying it from Elon?May 14

  • Theo argues Anthropic's Claude Opus 4.7 is not a meaningful improvement over prior models and comes with user experience regressions due to overly strict safety system prompts.
  • Figma's stock dropped 5% after a competitive announcement from Anthropic, contributing to an 85% decline since its IPO. Ben cites this as evidence of Anthropic's negative market impact.
  • Theo states OpenClaw's heartbeat function, which polls for tasks, costs him $4.31 daily without active use, extrapolating to roughly $120 monthly in wasted API spend.
  • Ben cites a GitHub issue from an AMD AI head showing a massive spike in Claude Code usage and cost at their company. Input tokens increased 170x and costs jumped from $26 to over $42,000 monthly after model updates.
  • Theo reveals Anthropic confirmed a routing error last year where 0.8% to 16% of Sonnet requests were sent to a dumber, 1M-context version, establishing a precedent for performance regressions via model versioning.
  • Theo and Ben argue Anthropic's engineering is incompetent, citing recent changes like a new tokenizer, a 5-minute cache TTL, and hidden reasoning data that complicate their stack across three hardware providers, leading to reliability and intelligence issues.
  • The hosts claim Anthropic employees use different, superior internal versions of Claude and its tools, creating a disconnect where employees don't experience the external product's failures and dismiss user complaints.
Also from this episode: (3)

AI & Tech (3)

  • Anthropic is aggressively banning third-party tools like T3 Code and OpenClaw that interface with Claude Code. Theo attributes this to a compute crisis and poor caching implementations that increase costs.
  • Theo's primary conspiracy is that Anthropic now forces all Claude Code users onto the dumber 1M-context model version to route traffic away from scarce Nvidia GPUs and onto partners like AWS and Google TPUs, explaining the performance drop.
  • Theo attributes Anthropic's problems to a research-first, safety-obsessed culture that devalues engineering and product reliability, creating opaque policies and a 'holier-than-thou' attitude that frustrates developers.