Price:

AI & TECH

Anthropic warns Alibaba cloned US AI models

Friday, June 26, 2026 · from 3 podcasts, 4 episodes
  • Anthropic accuses Alibaba of harvesting 29M model outputs to clone US frontier AI, calling it a national security threat.
  • OpenAI’s lead has evaporated as Google and Anthropic match or surpass GPT-4 in speed and reasoning.
  • Foundation models risk becoming low-margin utilities, with value shifting to apps and industry-specific use cases.

The AI arms race just entered a new phase - one defined not by breakthroughs, but by cloning, cost, and control. Six weeks after OpenAI declared a 'code red' to stabilize ChatGPT, the competitive landscape has fractured. Google’s Gemini 3 and Anthropic’s Claude Opus 4.5 have erased OpenAI’s performance edge, forcing a retreat from experimental agents to core product defense.

The real threat may not be competition, but replication. On June 25, Nathaniel Whittemore reported that Anthropic filed a formal complaint with the Senate Banking Committee, accusing Alibaba of a massive 'distillation attack.' Using 25,000 fake accounts, Alibaba allegedly accessed Anthropic’s models nearly 30 million times in two months - harvesting outputs and reasoning traces to shortcut its own R&D.

"This is industrial-scale harvesting. By recording the outputs and reasoning traces of Claude, Alibaba can train its own models for a fraction of the R&D cost."

- Nathaniel Whittemore, The AI Daily Brief

The claim reframes data scraping as economic warfare. While Alibaba denies wrongdoing, the Pentagon has already designated it an affiliate of the Chinese military. Bipartisan legislation is now advancing to blacklist Chinese labs caught distilling US models - a move that could reshape global AI trade.

Meanwhile, the foundation model business model is cracking. Benedict Evans on the a16z Show argues that frontier models are becoming commoditized plumbing, like cellular networks. Despite $700B in annual capex from tech giants, history suggests the value flows to apps - not infrastructure. Without unique network effects, labs face margin collapse as models converge on capability.

OpenAI is responding by building vertically. It unveiled 'Jalapeño,' a custom ASIC developed with Broadcom, cutting inference costs. But even with AI-designed chips and 800M weekly users, OpenAI faces a cash crunch: trillion-dollar compute commitments depend on revenue that may never materialize if models become utilities.

"Foundation models are not products. They are building blocks, and history is unkind to the people who build the plumbing."

- Benedict Evans, The a16z Show

The pivot is clear: from platform dominance to cost control and vertical specialization. Anthropic bets on enterprise trust, baking a 'soul doc' into Claude’s weights for consistent, humane interactions. Google leverages ad-scale distribution. And China advances - GLM 5.2, an open-weight model, stunned Vercel’s CEO with its website design skill, proving sovereign alternatives can match US performance.

The next breakthrough won’t come from better models - they’re already good enough. It will come from who best understands law, medicine, or finance, and embeds that knowledge into the workflow. The AI race is no longer about who computes first. It’s about who owns the next layer.

Source Intelligence

- Deep dive into what was said in the episodes

CEO-Led AI Gets 3X the ROIJun 25

  • Micron's Q1 earnings exceeded expectations with 445% year-over-year revenue growth and a 74% jump from last quarter, prompting a 22% hike in next quarter's revenue forecast.
  • The KPMG Quarterly Pulse Survey revealed that 76% of senior leaders report AI is driving meaningful business value, up from 64%, with organizations shifting from experimentation to adoption phases.
  • KPMG's survey indicated that organizations with clear CEO accountability for AI strategy were three times more likely to report established ROI and significantly higher confidence in AI value.
  • Only about one-third of organizations have full visibility and active monitoring of their AI operating costs, highlighting a major challenge for the shift to a token-efficiency era.
Also from this episode: (11)

Chips (2)

  • OpenAI unveiled its custom-designed chip, codenamed Jalapeño, produced in collaboration with Broadcom, positioning it as the first AI accelerator in a multi-generation compute platform.
  • Micron expects the memory market to be undersupplied for at least the next year, forecasting gross margins to expand to 86% in Q4, signaling a structural shift in memory demand driven by AI.

AI Infrastructure (1)

  • OpenAI President Greg Brockman stated that demand for compute from their customers is 'insatiable,' with Broadcom CEO Hock Tan concurring and forecasting elevated demand through at least 2028.

Models (6)

  • OpenAI upgraded its free GPT-5.5 Instant model, improving its ability to understand user intent, handle complex constraints, and provide more useful recommendations.
  • Nathaniel Whittemore reported that prediction market odds for Anthropic's Fable 5 return by July 1st skyrocketed from 15% to 63% on Wednesday, potentially due to insider knowledge.
  • Wired reported that the Trump administration, dissatisfied with Dario Amodei, is negotiating directly with Anthropic co-founder Tom Brown regarding Fable's reinstatement, with no timeline yet set.
  • Ashwin Goel and Andreessen Horowitz raised concerns that Anthropic's Claude Tag, while a convenient feature, could lead to significant vendor lock-in by deeply embedding organizational context.
  • Anthropic accused Alibaba of illicitly accessing its models 29 million times through 25,000 fraudulent accounts from April to June to distill Claude's capabilities, terming it the 'largest distillation attack ever detected.'
  • Senators Hagerty and King proposed a bipartisan bill to address AI model distillation, which if passed, would blacklist or sanction Chinese labs found distilling US AI models.

Regulation (1)

  • Alibaba sued the Department of Defense over its designation as an affiliate of the Chinese military, claiming no such ties and that the Pentagon acted unlawfully in blocking its business and lobbying activities.

Big Tech (1)

  • Google DeepMind continues to experience talent departures, with senior researchers Jonas Adler and Alexander Pritzel joining Anthropic, amidst a reported delay for Gemini 3.5 Pro to a July launch.

Why AI Users Are Raving About GLM 5.2Jun 22

  • The Economist reported that Senator Mark Warner claimed NSA Director General Joshua Rudd told him Mythos broke into almost all classified U.S. systems in hours, not weeks, on June 11, the same day Amazon reported the jailbreak that led to the Fable 5 ban.
  • Reporter Shashank Joshi clarified that the Mythos breach claim should not be taken literally, likely referring to a controlled test with caveats, not a real-world attack. Policy analyst Peter Weildford suggested more plausible scenarios, such as a red team exercise or Mythos being given prior access.
  • In a Saturday interview, President Trump stated he does not regard Anthropic or Dario Amodei as a current national security threat, does not want to shut the company down, and explicitly ruled out using the Defense Production Act to control AI.
  • Nobel laureate John Jumper left Google DeepMind for Anthropic, following the recent departure of VP Noam Shazeer to OpenAI, amid reports of plummeting morale and frustration over the lab's fall to third or fourth place in the AI race.
  • Leo at Synthwave reported DeepMind staff are demoralized by Z AI's GLM 5.2 overtaking Gemini 3.1 Pro and the lab's four-month gap without a flagship model release, with Gemini 3.5 Pro reportedly slated for June 30 and viewed internally as 'not the step change we need.'
  • Elon Musk debated the timeline for a Chinese model matching Mythos, predicting Q1 2026 for true usefulness, while Z AI's CEO suggested it would be sooner, and Box's Aaron Levie highlighted the strategic importance of open models reaching frontier performance.
Also from this episode: (5)

Models (5)

  • Analyst Andrew Curran reported a new, more capable version of Mythos has finished training, speculating it could be called Mythos 5.1 or 6, and noted that banning public models does not slow internal development.
  • Industry observers found evidence of an upcoming Claude Sonnet 5 release on an Anthropic partner provider, while GPT 5.6 appears in Codex and OpenAI's Codex lead hinted at major upcoming front-end capability improvements.
  • GLM 5.2 is being hailed as a 'DeepSeek R1 moment' for open models, with users like Vercel's Guillermo Rauch and Itamar Golan reporting it feels meaningfully close to frontier lab quality for coding and real tasks.
  • Design Arena's benchmark found GLM 5.2 beat Fable 5 at website design due to better starting templates, avoidance of common coding errors, and more intricate outputs, though it lagged in game dev and 3D design and produced 25% more code with double the generation time.
  • Theo notes GLM 5.2 is not cheap to run, as its high token usage makes it more expensive than Opus 48 and GPT-5.5 Medium, while Itamar Golan estimates proper local deployment requires eight H200 GPUs costing around $400k.

What’s Next for Consumer AI? | Josh Elman Joins a16zJun 23

Also from this episode: (15)

Other (15)

  • Benedict Evans argues that foundation models are not products but rather commodities, with the real value expected to emerge higher up the application stack.
  • Agentic coding has achieved significant product-market fit, leading to a massive focus within the tech industry and creating a supply crunch for AI capacity.
  • Determining the impact of agentic coding on engineering jobs and team structures is premature, as the market is in flux due to ongoing supply-demand imbalances and pricing issues.
  • Corporations are using LLMs for specific back-office automation, such as cash flow forecasting for commodities companies, a distinct application from general chatbot interactions.
  • The rapid growth in AI infrastructure spending, with mobile data traffic rising by 1,500 to 2,000 times since 2009-2010, parallels historical patterns of accelerating tech adoption.
  • Despite mobile networks generating around $1 trillion in revenue and spending $200 billion annually on capex, they captured little value; most accrued to applications built on top.
  • Foundation models lack the network effects of platforms like Windows or iOS, making their long-term ability to differentiate or exert pricing power uncertain.
  • Benedict Evans projects three to six companies will develop frontier models, investing $200 billion to $2 trillion annually, which suggests future commoditization.
  • The high capex of major tech companies (Google, Meta, Microsoft spending over 50% of revenue) is unsustainable long-term and indicates a future slowdown due to financial gravity.
  • Large tech firms face an existential FOMO, compelling heavy AI investment to avoid falling behind, even as CFOs question the long-term returns and sustainability of such spending.
  • Measuring AI's return on investment is currently challenging because many benefits, like improved analytics or customer support, are difficult to quantify financially.
  • AI enables automation, making previously impossible tasks affordable and unlocking new business models, similar to Spotify's impact on music distribution.
  • AI will significantly transform advertising and e-commerce by enabling systems to deeply understand products and consumer behavior, leading to improved recommendations and conversion rates.
  • The ultimate challenge for AI is expanding beyond software development productivity into the broader economy, finding diverse applications for general users and other industries.
  • In 20 years, AI will likely be perceived as commonplace "magic," similar to how current computing and mobile technologies are seamlessly integrated and taken for granted.
Hard Fork
Hard Fork

Casey Newton

‘Hard Fork’ Live, Part 3: Differing Visions of an A.I. FutureJun 19

  • Kevin Roose says OpenAI declared a 'code red' emergency due to worrying trends in ChatGPT usage, prompting a reallocation of engineering resources to core product improvements and a delay of side projects like AI agents and the Pulse digest feature.
  • Roose and Casey Newton identify Google's Gemini 3 and Anthropic's Claude Opus 4.5 as the competitive catalysts for OpenAI's crisis. They argue these rival models challenge OpenAI's core pillars and threaten its subscription revenue moat.
  • Newton states OpenAI is massively leveraged with trillion-dollar spending commitments dependent on future revenue, and its product organization is unfocused, exemplified by the Sora video generator departure.
  • Roose notes a belief within AI circles that OpenAI has not had a successful pre-training run recently, a harder and more expensive problem to fix than post-training, indicating a fundamental research challenge.
  • Newton says OpenAI is training new models codenamed Garlic and Shallot, with internal optimism they could restore or advance the state-of-the-art frontier.
  • Kevin Roose finds Gemini 3 faster than competitors, a critical factor for frequent usage, and notes Google's potential to subsidize and drive down costs for AI services given its massive revenue base.
  • Casey Newton reports Google claims 650 million monthly Gemini users, while OpenAI reports over 800 million weekly ChatGPT users. Newton questions the intent behind Google's bundled distribution numbers.
  • Both hosts praise Claude Opus 4.5 for superior style transfer in writing and a consistent, humane interaction model. Newton says it passed a key test by writing in his stylistic voice, while Roose appreciates its empathetic but non-sycophantic tone.
  • Newton argues Anthropic's enterprise-first business model and aversion to ads or engagement optimization gives Claude a different, potentially more stable trajectory compared to OpenAI and Google's consumer-focused models.
  • Newton cites Anthropic's revenue growth, expecting $9 billion in annualized revenue by year's end, up from under $1 billion at the start, primarily from enterprise API sales, which directly competes with OpenAI's projected $20 billion.
  • The hosts discuss the 'soul document' discovered in Claude's weights, a biographical text about the model confirmed by Anthropic's Amanda Askell, reflecting the company's philosophical preparation for potential AI consciousness.
  • Newton highlights key AI industry departures: Yann LeCun leaving Meta to start a world models startup skeptical of LLM-based AGI, and John Giannandrea stepping down as Apple's AI head, which Newton interprets as Apple effectively giving up on in-house AI development.
  • Kevin Roose estimates AI tools have saved him a year of research time on his book project, representing a fundamental shift in creative workflows he would not abandon.
  • Casey Newton describes the 'California view' of AI (what it can do) versus the 'New York view' (what it can't do), advocating for a default perspective grounded in firsthand use to appreciate the technology's real impact.
  • The hosts review AI slop examples: fake AI ads for a non-existent Buckingham Palace Christmas market misleading tourists, AI-generated cooking recipes harming food blogger traffic, and Whirlpool using a deepfaked politician's voice in a Brazilian ad that won and later forfeited Cannes Lions awards.
  • Newton distinguishes between harmful slop that displaces human creators and benign or satirical slop, like viral AI-generated videos for a non-existent 'Bird Game 3', which he views as clever parody of sequels.