BrowseComp vs The Bots that Bluff

The AI Future's Project Part 3: 2027

AI’s gone full sci-fi and it’s only January. In this episode, host Emily Laird walks us through the first half of 2027 in the AI Futures Project, where fictional megacorp OpenBrain builds Agent-2 (thi...

10 Apr 20256min

The AI Future's Project Part 2: 2026

It’s 2026 and AI isn’t just helping out, it’s taking over the whiteboard and your white-collar job. In this episode, host Emily Laird breaks down the AI Futures Project’s bold predictions for 2026: Op...

9 Apr 20256min

The AI Future's Project Part 1

AI agents have officially stopped playing assistant and started acting like your over-caffeinated junior dev—occasionally brilliant, mostly chaotic, and somehow costing $500 a month. In this episode, ...

8 Apr 20258min

AGI is Coming. There, I Said It.

AI’s not just playing Jeopardy anymore, it’s coming for the Mensa crowd. In this episode, host Emily Laird spirals (productively) over Humanity’s Last Exam, a monster test built to measure whether AI ...

7 Apr 20257min

Google's Gemini 2.5

Google’s Gemini 2.5 just set a new standard on Humanity’s Last Exam and flexed hard in the AI Fight Club, outpacing GPT-4.5 and Grok-3. It reasons, codes, remembers a million tokens of context, and ha...

1 Apr 20257min

Humanity's Last Exam

Standardized tests are getting torched like marshmallows at a bonfire, and AI's top students are flunking the new final. In this episode, Emily unpacks Humanity’s Last Exam, a brutal, brain-bending te...

31 Mar 20257min

a16z Top Gen AI Consumer Apps Report: Big Money & Big Scams

Mobile AI apps are raking in cash—or at least convincing you that your fern obsession is worth $20 a month. This episode dissects the Andreessen Horowitz Top Gen AI Apps report, revealing why populari...

20 Mar 20256min

a16z Top Gen AI Consumer Apps Report: The VibeCoder Revolution

Forget dark-mode IDEs and midnight debugging crises—AI tools are ushering in a vibecoder revolution. This episode unpacks the surge in agentic IDEs like Cursor (think Jarvis meets Clippy, but actually...

19 Mar 20256min