BrowseComp vs The Bots that Bluff

GDPval-AA & the AI Hunger Games for Your Job

Is AI just good at trivia, or can it actually take your job? In this episode, host Emily Laird breaks down GDPval-AA, the benchmark pitting models against humans across 1,320 real world tasks, scored ...

16 Feb 9min

Claude Opus 4.6

Host Emily Laird cracks open Claude Opus 4.6, Anthropic’s Feb 5, 2026 release that feels less like a chatbot and more like a full-time coworker who never blinks. This episode breaks down what “agentic...

12 Feb 11min

OpenAI’s Frontier & the Rise of Digital Middle Management

Host Emily Laird breaks down Frontier, OpenAI’s agent management platform that’s less about Skynet and more about spreadsheets. This isn’t AI with feelings, it’s AI filing TPS reports… with supervisio...

11 Feb 9min

What is Prompt Injection?

What do Renaissance poets, Reddit trolls, and your company’s chatbot have in common? They’re all vulnerable to prompt injection. Host Emily Laird breaks down how language alone can hijack your AI syst...

10 Feb 8min

Claude in Excel: Because AI's a Freak in the Sheets Too... Yikes.

Host Emily Laird pulls back the pivot table on Claude in Excel, the AI quietly rewriting how we do budgets, audits, and corporate CYA. This isn’t Clippy’s grandkid. It’s a junior analyst with zero ego...

9 Feb 10min

What is Moltbook?

Host Emily Laird peels back the digital curtain on Moltbook, the AI-only social network where bots quote Camus, roleplay Cold War diplomats, and occasionally spark security breaches with the elegance ...

5 Feb 8min

What is Claude's Constitution?

Host Emily Laird cracks open the eerily polite brain of Claude, Anthropic’s AI, and its freshly published constitution. Forget rules of engagement, this is a machine with moral homework. From jailbrea...

4 Feb 7min