We Taught AI to Play Games—Now It’s a $3.6 Million Company
AI and I16 Loka 2025

We Taught AI to Play Games—Now It’s a $3.6 Million Company

This episode is a little different from our usual fare: It’s a conversation with our head of AI training Alex Duffy about Good Start Labs, a company he incubated inside Every. Today, Good Start Labs is spinning out of Every as a separate company with $3.6 million in funding from General Catalyst, Inovia, Every, and a group of angel investors from top-tier AI labs like DeepMind. We get into how Alex learned some of his biggest lessons about the real world from games, starting with RuneScape, which taught him how markets work and how not to get scammed. He explains why the static benchmarks we use to evaluate LLMs today are breaking down, and how games like Diplomacy offer a richer, more dynamic way to test and train large language models. Finally, Alex shares where he sees the most promise in AI—software, life sciences, and education—and why he believes games can make the models we use smarter, while helping people understand and use AI more effectively.

If you found this episode interesting, please like, subscribe, comment, and share.


Want even more?

Sign up for Every to unlock our ultimate guide to prompting ChatGPT here: https://every.ck.page/ultimate-guide-to-prompting-chatgpt. It’s usually only for paying subscribers, but you can get it here for free.


To hear more from Dan Shipper:


Timestamps

00:00:00 - Start

00:01:48 - Introduction

00:04:14 - Why evals and benchmarks are broken

00:07:13 - The sneakiest LLMs in the market

00:13:00 - A competition that turns prompting into a sport

00:15:49 - Building a business around using games to make AI better

00:22:39 - Can language models learn how to be funny

00:25:31 - Why games are a great way to evaluate and train new models

00:26:58 - What child psychology tells us about games and AI

00:30:10 - Using games to unlock continual learning in AI

00:36:42 - Why Alex cares deeply about games

00:44:37 - Where Alex sees the most promise in AI

00:50:54 - Rethinking how young people start their careers in the age of AI


Links to resources mentioned in the episode:

Jaksot(98)

Best of the Pod: How to Prepare for AGI According to Reid Hoffman

Best of the Pod: How to Prepare for AGI According to Reid Hoffman

AGI is coming. Reid Hoffman just wrote the book on how to prepare.According to Reid, every major tech breakthrough (the written word, the printing press, the telephone) triggered mass fear. But, contr...

27 Elo 20251h 10min

Best of the Pod: She Built an AI Product Manager Bringing in Six Figures—As A Side Hustle

Best of the Pod: She Built an AI Product Manager Bringing in Six Figures—As A Side Hustle

**Automate 80% of your repetitive writing, thinking, and creative tasks****Try Spiral made by Dan Shipper & Every: https://spiral.computer?utm_source=youtube**Claire Vo built ChatPRD—an on-demand chie...

20 Elo 20251h 6min

Best of the Pod: Vercel's Guillermo Rauch on AI and the Future of Coding

Best of the Pod: Vercel's Guillermo Rauch on AI and the Future of Coding

Read Dan Shipper's essay on the allocation economy: https://every.to/chain-of-thought/the-knowledge-economy-is-over-welcome-to-the-allocation-economyGuillermo Rauch is one of the most prolific coders ...

13 Elo 202558min

Best of the Pod: Dwarkesh Patel’s Quest to Learn Everything

Best of the Pod: Dwarkesh Patel’s Quest to Learn Everything

Dwarkesh Patel is on a quest to know everything. He’s using LLMs to enhance how he reads, learns, thinks, and conducts interviews. Dwarkesh is a podcaster who’s interviewed a wide range of people, lik...

30 Heinä 202550min

Intentional Tech: Designing AI for Human Flourishing | Alex Komoroske, Cofounder and CEO of Common Tools

Intentional Tech: Designing AI for Human Flourishing | Alex Komoroske, Cofounder and CEO of Common Tools

The smallest technical decisions become humanity's biggest pivots:The same-origin policy—a well-intentioned browser security rule from the 1990s—accidentally created Facebook, Google, and every data m...

9 Heinä 20251h 11min

Arc Had Millions of Users. Why They Left It Behind for Dia. | Josh Miller and Hursh Agrawal, cofounders of The Browser Company

Arc Had Millions of Users. Why They Left It Behind for Dia. | Josh Miller and Hursh Agrawal, cofounders of The Browser Company

If you had millions of people using a product you spent years building, would you kill it?That’s exactly what The Browser Company did with Arc.The internet backlash was intense, but cofounders Josh Mi...

2 Heinä 20251h 24min

How We Built Our AI Email Assistant: A Behind-the-Scenes Look at Cora

How We Built Our AI Email Assistant: A Behind-the-Scenes Look at Cora

You don’t need to handle your inbox anymore. It’s Cora’s job now. Cora is the AI chief of staff we built for your email at Every. It’s been in private beta for the last 6 months and currently manages ...

26 Kesä 202546min

Inside OpenAI: Coaching the People Creating AGI | Joe Hudson, Founder of The Art of Accomplishment

Inside OpenAI: Coaching the People Creating AGI | Joe Hudson, Founder of The Art of Accomplishment

Joe Hudson is a coach who works with the executives building AGI at OpenAI. From inside OpenAI, he witnesses the full spectrum of human emotion that comes with bringing something new into the world—th...

18 Kesä 202553min