We Taught AI to Play Games—Now It’s a $3.6 Million Company
AI and I16 Loka 2025

We Taught AI to Play Games—Now It’s a $3.6 Million Company

This episode is a little different from our usual fare: It’s a conversation with our head of AI training Alex Duffy about Good Start Labs, a company he incubated inside Every. Today, Good Start Labs is spinning out of Every as a separate company with $3.6 million in funding from General Catalyst, Inovia, Every, and a group of angel investors from top-tier AI labs like DeepMind. We get into how Alex learned some of his biggest lessons about the real world from games, starting with RuneScape, which taught him how markets work and how not to get scammed. He explains why the static benchmarks we use to evaluate LLMs today are breaking down, and how games like Diplomacy offer a richer, more dynamic way to test and train large language models. Finally, Alex shares where he sees the most promise in AI—software, life sciences, and education—and why he believes games can make the models we use smarter, while helping people understand and use AI more effectively.

If you found this episode interesting, please like, subscribe, comment, and share.


Want even more?

Sign up for Every to unlock our ultimate guide to prompting ChatGPT here: https://every.ck.page/ultimate-guide-to-prompting-chatgpt. It’s usually only for paying subscribers, but you can get it here for free.


To hear more from Dan Shipper:


Timestamps

00:00:00 - Start

00:01:48 - Introduction

00:04:14 - Why evals and benchmarks are broken

00:07:13 - The sneakiest LLMs in the market

00:13:00 - A competition that turns prompting into a sport

00:15:49 - Building a business around using games to make AI better

00:22:39 - Can language models learn how to be funny

00:25:31 - Why games are a great way to evaluate and train new models

00:26:58 - What child psychology tells us about games and AI

00:30:10 - Using games to unlock continual learning in AI

00:36:42 - Why Alex cares deeply about games

00:44:37 - Where Alex sees the most promise in AI

00:50:54 - Rethinking how young people start their careers in the age of AI


Links to resources mentioned in the episode:

Jaksot(98)

How We Built 'Claudie,' Our AI Project Manager (Full Walkthrough)

How We Built 'Claudie,' Our AI Project Manager (Full Walkthrough)

A few weeks ago, Natalia Quintero wouldn’t have called herself technical. But since the beginning of January, she has woken up at 6 a.m. to vibe code with Claude. The AI project manager she built save...

4 Helmi 47min

How Andrew Wilkinson Uses Opus 4.5 in His Work and Life

How Andrew Wilkinson Uses Opus 4.5 in His Work and Life

Entrepreneur Andrew Wilkinson used to sleep nine hours a night. Now he wakes up at 4 a.m. and goes straight to work—because he can’t wait to keep building with Anthropic’s latest model, Opus 4.5.Two y...

21 Tammi 1h 2min

Why Your AI Learning Projects Keep Fizzling Out

Why Your AI Learning Projects Keep Fizzling Out

LLMs have made it absurdly easy to go deep on almost any topic. So why haven’t we all used ChatGPT to earn college degrees we wished we had majored in or pursued a niche interest, like learning how to...

14 Tammi 55min

Vibe Check: Claude Cowork Is Claude Code for the Rest of Us

Vibe Check: Claude Cowork Is Claude Code for the Rest of Us

Anthropic just dropped Claude Cowork—essentially Claude Code for everyone, not just engineers—and we got to chat about it with a product engineer at Anthropic who helped build it.In this live Vibe Che...

13 Tammi 1h 32min

AI in 2026: Reid Hoffman’s Predictions on Agents, Work, and Creation

AI in 2026: Reid Hoffman’s Predictions on Agents, Work, and Creation

From cofounding LinkedIn to backing OpenAI early, Reid Hoffman is in the habit of being right about the future, so we wanted to know what he saw coming in 2026.In his third appearance on AI & I, Hoffm...

7 Tammi 59min

Four Predictions for How AI Will Change Software in 2026

Four Predictions for How AI Will Change Software in 2026

Tomorrow is the first day of 2026, and to give our listeners a view of the trends that’ll shape the year ahead, Dan Shipper had Every COO Brandon Gell on AI & I to discuss their predictions for what’s...

31 Joulu 202537min

Best of the Pod: Reid Hoffman on How AI Is Answering Our Biggest Questions

Best of the Pod: Reid Hoffman on How AI Is Answering Our Biggest Questions

Learn how to use philosophy to run your business more effectively. Reid Hoffman thinks a masters in philosophy will help you run your business better than an MBA. Reid is a founder, investor, podcaste...

24 Joulu 20251h 1min

Attaining A Jhana Live: How Anyone Can Achieve Super Wellbeing

Attaining A Jhana Live: How Anyone Can Achieve Super Wellbeing

We recorded someone guide himself into a Jhana live on our podcast. And he narrated the whole process from start to finish.Jhanas are meditative bliss states and they traditionally require thousands o...

17 Joulu 20251h 15min