What happened with ai's scaling masks a learning crisis?

AI learns a million times slower than humans but burns through data and cash to hide it.

What happened with ai's scaling masks a learning crisis?

Human experts secretly train models task by task, not general intelligence.

What happened with ai's scaling masks a learning crisis?

The real bottleneck isn’t compute - it’s learning efficiency.

AI & TECH

AI's scaling masks a learning crisis

Tuesday, June 23, 2026 · from 2 podcasts

This Week in Startups Dwarkesh Podcast

AI learns a million times slower than humans but burns through data and cash to hide it.
Human experts secretly train models task by task, not general intelligence.
The real bottleneck isn’t compute - it’s learning efficiency.

AI isn’t getting smarter. It’s just getting hungrier. While labs pour billions into larger models and more data, a quiet consensus is forming: progress is built on brute force, not breakthroughs.

Dwarkesh Patel on the Dwarkesh Podcast laid it bare. A child learns language from roughly 200 million words. GPT-4 was trained on 100 trillion. That millionfold gap isn’t noise - it’s the signal. We haven’t cracked efficient learning. We’ve just scaled the waste.

Scaling laws offer little comfort. Even under optimistic Chinchilla models, bigger parameters only reduce data needs tenfold. Humans learn complex skills from one or two examples. AI runs thousands of rollouts per task using GRPO, a reinforcement learning method that simulates trial and error at massive scale.

The industry’s dirty secret? Human experts are doing the real work. Companies like Scale AI and Surge pay lawyers, consultants, and specialists to generate labeled data - thousands of examples for every narrow task. This isn’t emergent intelligence. It’s hand-stitched automation.

"The intelligence we see is a collection of expert trajectories congealed in an RL environment."
- Dwarkesh Patel, Dwarkesh Podcast

This synthetic data generation is why open-source models catch up so fast. They distill outputs from public APIs, effectively reverse-engineering the human-labeled training sets. The moat isn’t architecture or code - it’s access to expensive, expert-generated data.

And the money is real. Patel estimates the labeling industry already generates billions in revenue and is headed toward decabillions. Labs don’t need efficient AI to automate white-collar work. They can amortize massive training costs across billions of inferences. Brute force wins in the market, even if it fails in the lab.

Alice Zhang of Verge Labs offers a different path. Her team built a dataset from over 12,000 human brains - the largest directly grounded in patient tissue. For neuroscience, where you can’t biopsy living brains, this is the LiDAR that anchors inference from blood samples.

"We learned predicting patient response is more valuable than just discovering a drug."
- Alice Zhang, This Week in Startups

Zhang’s moat isn’t bigger models. It’s a decade-long pipeline of biological truth. Where others scale data, she grounds it. Her multimodal transformer fuses blood markers with molecular autopsies, predicting drug efficacy before symptoms appear.

The divergence is stark. One path doubles down on data volume. The other bets on data quality. One hires armies of experts to simulate reasoning. The other builds infrastructure to capture ground truth.

The real test comes next. Can AI automate its own research? If not, the data black hole will keep widening - and the cost of pretending will only grow.

2 Sources:

This Week in Startups Dwarkesh Podcast

#AI & Tech #Science #Models

Dwarkesh Patel

Source Intelligence

- Deep dive into what was said in the episodes

This Week in Startups

The hottest running app has nothing to do with speed | E2303 • Jun 22

Alice Zhang says Verge Labs built one of the field's largest brain datasets directly from patients, comprising over 12,000 human brains from 6,000 patients.
Alice Zhang states Verge Labs learned a critical lesson from its own drug development experience: predicting patient response is more valuable than just discovering a drug.
Alice Zhang says Verge Labs uses a multimodal transformer-based world model, representing each patient as a 512-dimensional vector, to fuse data and infer missing information like brain activity from blood alone.
Alice Zhang cites a partnership with Eli Lilly where 83% of AI-derived targets validated in wet lab experiments, far exceeding Lilly's 20% expectation.
Alice Zhang argues drug development costs $5 billion on average primarily due to nine out of ten failures at the expensive last stage, implying AI-driven efficiency could dramatically lower prices.
Alice Zhang claims Verge Labs' moat is its decade-long infrastructure for collecting brain tissue data, a difficult task others avoid, not just model scaling.

Also from this episode: (8)

Startups (6)

Louie Phillips says Interval is a gamified running app where users run around the block to claim territory on a live global map.
Louie Phillips states Interval intentionally excludes speed metrics, focusing instead on volume-based competition to allow anyone, including a slow-moving neighbor, to capture territory.
Louie Phillips describes future features like arenas for speed-based competition with daily leaderboards and volume-based leaderboards for regular territory capture.
Louie Phillips says Interval has grown to about a million downloads and about 100,000 followers on Instagram through organic social media content without paid media.
Louie Phillips says Interval's paid ad funnel on Meta provides predictable growth, with a cost per trial start of about $12 and an average customer lifetime of about 17 months.
Jason Calacanis describes Agree's average time from contract sent to signature as under 7 hours and median time from invoice sent to paid as 36 hours.

Business (2)

Jason Calacanis notes Blackstone purchased Hamilton Island for 1.2 billion Australian dollars, roughly $804 million US.
Alice Zhang details Verge Labs' revenue models: custom partnerships, direct insights licensing, and platform-as-a-service formats.

#Models #Health #Startups #Macro #Biology

Dwarkesh Podcast

The data black hole at the center of AI • Jun 19

Dwarkesh argues intelligence is defined by sample efficiency. AI progress has primarily come from widening data distributions and increasing compute, not improving sample efficiency.
Dwarkesh describes reinforcement learning as synthetic data generation. It uses compute against a verifier to find correct rollouts, which the model learns to predict.
Dwarkesh asserts models need immense amounts of highly specific expert data to learn. He cites tasks like converting Word documents and writing legal filings.
Dwarkesh estimates the expert data labeling industry generates billions in revenue and will reach decabillions.
Dwarkesh claims models perform thousands of rollouts per task with GRPO. This contrasts with humans practicing a problem only once or twice.
Dwarkesh argues AI labs can automate white-collar work despite inefficiency. AI can absorb gigawatts of training and amortize skills across billions of sessions.
Dwarkesh predicts demand for human software engineers will increase in 2027 due to AI's complementary role, despite automation expectations.
Dwarkesh outlines AI labs' plan to use automated AI researchers to solve the remaining sample efficiency problem.

Also from this episode: (5)

Models (3)

Dwarkesh states frontier models are trained on tens to hundreds of trillions of tokens. Humans see about 200 million tokens from birth to adulthood.
Dwarkesh counters Karpathi's evolution pre-training argument. He states the human genome is 3 GB with only 1-2% protein coding.
Dwarkesh says scaling laws cannot solve AI's sample inefficiency. Increasing parameters infinitely only reduces required data by a factor of 10.

Robotics (1)

Dwarkesh notes AI robotics requires millions of hours of demonstrations but still fails at complex open-ended tasks, while humans learn robotic operation within hours.

AI Infrastructure (1)

Dwarkesh compares human learning to driving with 20 hours of practice. Tesla and Whimo use three to four orders of magnitude more data.

#Models #AI Infrastructure #Big Tech #Startups

AI's scaling masks a learning crisis

Source Intelligence

Related Stories