UPDATED JUNE 15, 2026
UPDATED JUNE 15, 2026

The Frontier

Your signal. Your price.

Include
Lookback
||
  • · 10d ago

    A Stanford study found law professors preferred AI-generated legal answers over human-written ones in 75% of cases, with Google's Gemini 2.5 Pro winning 175.9% of its matchups against instructors.

  • · 12d ago

    Theo argues the SWE-Bench Pro benchmark is flawed because it uses contaminated data and outdated prompts, resulting in unrealistic scores like Gemini 1.5 Pro at 46% and Claude Sonnet 3.5 at 54%.

  • · 13d ago

    Gumo Roush observes that for most industrial tasks, Gemini models offer the best performance-cost combination, but frontier models from OpenAI and Anthropic dominate coding and cutting-edge work.

  • · 14d ago

    Swihart argues the perception of Zcash as the 'compliant' privacy coin is memetic, not regulatory. Exchanges like Gemini support shielded withdrawals, proving compliance is possible with engineering work.

  • · 15d ago

    Evans suggests models may become low-margin commodity infrastructure, with value accruing to application-layer companies. He notes the lack of clear network effects or radical differentiation between major models like Gemini and ChatGPT for average users.

  • · 15d ago

    Evans argues distribution becomes a critical moat when products are commodities, citing Meta's AI integration across services. He notes Meta's AI usage was competitive with ChatGPT and Gemini before recent launches, despite being written off in tech circles.

  • · 17d ago

    A host notes that while open models see some use, frontier intelligence models dominate for coding. Gemini models excel at industrial tasks like support and browser automation.

  • · 17d ago

    The CFTC and Gemini jointly filed to reverse a $5 million settlement from January 2025. The CFTC admitted its original complaint was based on a non-credible whistleblower, calling Gemini a fraud victim and citing improper personnel influence.

  • · 17d ago

    Bennett speculates the CFTC's reversal may be linked to Gemini's pivot into prediction markets, a sector the CFTC is aggressively pursuing to regulate over state gaming authorities.

  • · 18d ago

    Theo argues Google is not a serious company, pointing to a year-plus period of no notable frontier releases from its AI labs since Gemini 1.5 Flash, which he describes as a disaster.

  • · 18d ago

    Ben reveals a private software engineering benchmark showing GPT-4o and Claude 3.5 Opus leading, with a steep drop to Sonnet 3.6 and Gemini 1.5 Flash, and a final cliff to Gemini 1.0 Pro at 10% performance.

  • · 18d ago

    Ben asserts Google's models fail at reasoning, citing their tendency to get stuck in loops or berate themselves in traces, and posits that adding reasoning was the moment Gemini fell apart competitively.

  • · 19d ago

    Saager claims the Trump administration supports prediction markets because Trump and his allies financially benefit; he cites a NYT piece linking CFTC approvals to Trump-linked entities like Gemini and 1789 Capital.

  • · 20d ago

    Milan states Google's Gemini 3.5 Flash is a top closed-source model because it is very fast and cheaper, while Opus 4.7 is expensive and slower.

  • · 21d ago

    The Pentagon released over 50 declassified UFO videos per Trump's directive, including audio from Gemini 7 astronauts, but experts say none prove alien life, only unexplained phenomena.

  • · 22d ago

    Google reported a 40% quarter-over-quarter surge in paid enterprise Gemini customers. The company's infrastructure now processes 16 billion tokens per minute, a 60% increase from the previous quarter.

  • · 23d ago

    Meta released Muse Spark, its first natively multimodal reasoning model designed primarily for personal agents. The model scored 86.4 on CharViC's reasoning benchmark, beating Gemini 3.1 Pro by six points.

  • · 23d ago

    Google introduced notebooks in Gemini, a feature consolidating resource management across its products. Josh Woodward described it as building a 'second brain' by integrating Notebook LM's capabilities.

  • · 24d ago

    Pichai reports internal Google usage of Gemini models has doubled every week, a growth pattern he describes as unprecedented, accelerating their ability to hill climb and improve the models.

  • · 24d ago

    Google's AI infrastructure now processes 3.2 quadrillion tokens monthly, a 7X jump from 480 trillion last year. Gemini has over 900 million monthly users, and AI Overviews has 2.5 billion monthly users.

  • · 24d ago

    Gemini Omni is Google's new multimodal AI family capable of generating videos from text, photos, videos, and audio. Dave notes that Google DeepMind is the only remaining American frontier lab seriously pursuing video as a modality.

  • · 24d ago

    Gemini 3.5 Flash is Google's new high-throughput model, four times faster than other frontier models in output tokens per second. Alex Weizenner argues it is solidly mid-tier in raw capability compared to GPT 5.5 High.

  • · 24d ago

    Google's new universal cart aggregates products from YouTube, search, Gemini, and Gmail across merchants like Nike and Target. Alex Weizenner sees this as Google's attempt to compete with Amazon in retail e-commerce.

  • · 24d ago

    Google's Audio Glasses, launching this fall with Samsung and eyewear partners, provide all-day Gemini assistance via private audio without a display. Dave notes this will force a societal rift over pervasive recording.

  • · 24d ago

    Google launched the $2 million Build with Gemini XPRIZE hackathon to solve real-world problems impacting at least 100,000 people. Peter Diamandis says the goal is teaching entrepreneurship over seeking traditional jobs.

  • · 25d ago

    Google plans Gemini Spark, an always-on personal AI agent leveraging user context from apps and logged-in websites.

  • · 25d ago

    Gemini 3.2 Flash reportedly achieves 92% of GPT-5.5 performance on coding tasks with 15-20x cheaper inference.

  • · 29d ago

    The quarter saw rapid frontier model releases: GPT-5.2 Codex, Genie 3, Opus 4.6, GPT-5.3 Codex, Sonnet 4.6, Gemini 3.1 Pro, Nano Banana 2, and GPT-5.4, with no single benchmark winner across common tests.

  • · 29d ago

    Alex notes that AI models like Grok, Gemini, ChatGPT, and Claude, when asked about the released UAP data, generally conclude they are normal phenomena or secret U.S. missions, not alien.

End of 30-day results — 29 results
29 results