We Taught AI to Play Games—Now It’s a $3.6 Million Company
AI and I16 Okt 2025

We Taught AI to Play Games—Now It’s a $3.6 Million Company

This episode is a little different from our usual fare: It’s a conversation with our head of AI training Alex Duffy about Good Start Labs, a company he incubated inside Every. Today, Good Start Labs is spinning out of Every as a separate company with $3.6 million in funding from General Catalyst, Inovia, Every, and a group of angel investors from top-tier AI labs like DeepMind. We get into how Alex learned some of his biggest lessons about the real world from games, starting with RuneScape, which taught him how markets work and how not to get scammed. He explains why the static benchmarks we use to evaluate LLMs today are breaking down, and how games like Diplomacy offer a richer, more dynamic way to test and train large language models. Finally, Alex shares where he sees the most promise in AI—software, life sciences, and education—and why he believes games can make the models we use smarter, while helping people understand and use AI more effectively.

If you found this episode interesting, please like, subscribe, comment, and share.


Want even more?

Sign up for Every to unlock our ultimate guide to prompting ChatGPT here: https://every.ck.page/ultimate-guide-to-prompting-chatgpt. It’s usually only for paying subscribers, but you can get it here for free.


To hear more from Dan Shipper:


Timestamps

00:00:00 - Start

00:01:48 - Introduction

00:04:14 - Why evals and benchmarks are broken

00:07:13 - The sneakiest LLMs in the market

00:13:00 - A competition that turns prompting into a sport

00:15:49 - Building a business around using games to make AI better

00:22:39 - Can language models learn how to be funny

00:25:31 - Why games are a great way to evaluate and train new models

00:26:58 - What child psychology tells us about games and AI

00:30:10 - Using games to unlock continual learning in AI

00:36:42 - Why Alex cares deeply about games

00:44:37 - Where Alex sees the most promise in AI

00:50:54 - Rethinking how young people start their careers in the age of AI


Links to resources mentioned in the episode:

Avsnitt(98)

 Spiral: Designing an AI Ghostwriter With Taste

Spiral: Designing an AI Ghostwriter With Taste

Good writing has always been downstream of good thinking. The average language model can help you write faster—but can it help you think better?Danny Aziz wrestled with this question while building th...

22 Okt 20251h 7min

Box CEO Aaron Levie on Why AI Agents Won’t Take Your Job

Box CEO Aaron Levie on Why AI Agents Won’t Take Your Job

Aaron Levie is AI-pilled, but he’s one of the few CEOs who sees a future where AI agents work for us, instead of replacing us—helping us to do more than we could before.Aaron’s been the CEO of Box for...

8 Okt 202552min

MCP Servers: Teaching AI to Use the Internet Like Humans

MCP Servers: Teaching AI to Use the Internet Like Humans

If your MCP server has dozens of tools, it’s probably built wrong.You need tools that are specific and clear for each use case—but you also can’t have too many. This creates an almost impossible trade...

1 Okt 202551min

Cognition’s CEO on What Comes After Code

Cognition’s CEO on What Comes After Code

The future has a way of showing up early to some places. In software engineering, one of those places is Cognition—the startup that made headlines in early 2024 with Devin, the world’s first autonomou...

24 Sep 202553min

One Developer Got Thousands of Users Before His App Launched

One Developer Got Thousands of Users Before His App Launched

Naveen Naidu built an app that found product-market fit backwards.Most apps launch first and then try to find users. Monologue, Naveen’s AI voice dictation app that came out of beta yesterday, did the...

17 Sep 202557min

Claude Code Can Be Your Second Brain

Claude Code Can Be Your Second Brain

Noah Brier uses Claude Code as his second brain—it’s the coolest notetaking setup we’ve ever seen.He has Claude running on a server in his basement hooked up to a VPN. It stores, reads, and writes to ...

10 Sep 20251h 11min

This AI Makes a Video Game World in 40 Milliseconds

This AI Makes a Video Game World in 40 Milliseconds

We had Dean Leitersdorf on the pod and he did something no guest had ever done.Mid-sentence, he transformed from a startup founder in a black t-shirt to a wizard with light shooting from his hands. Th...

3 Sep 20251h 5min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
market-makers
rss-elektrikerpodden
bosse-bildoktorn-och-hasse-p
natets-morka-sida
bilar-med-sladd
rss-laddstationen-med-elbilen-i-sverige
skogsforum-podcast
rss-uppgang-och-fall
gubbar-som-tjotar-om-bilar
developers-mer-an-bara-kod
rss-veckans-ai
rss-technokratin
hej-bruksbil
bli-saker-podden
rss-it-sakerhetspodden
algoritmen
rss-heja-framtiden
rss-en-ai-till-kaffet