We Taught AI to Play Games—Now It’s a $3.6 Million Company
AI and I16 Okt 2025

We Taught AI to Play Games—Now It’s a $3.6 Million Company

This episode is a little different from our usual fare: It’s a conversation with our head of AI training Alex Duffy about Good Start Labs, a company he incubated inside Every. Today, Good Start Labs is spinning out of Every as a separate company with $3.6 million in funding from General Catalyst, Inovia, Every, and a group of angel investors from top-tier AI labs like DeepMind. We get into how Alex learned some of his biggest lessons about the real world from games, starting with RuneScape, which taught him how markets work and how not to get scammed. He explains why the static benchmarks we use to evaluate LLMs today are breaking down, and how games like Diplomacy offer a richer, more dynamic way to test and train large language models. Finally, Alex shares where he sees the most promise in AI—software, life sciences, and education—and why he believes games can make the models we use smarter, while helping people understand and use AI more effectively.

If you found this episode interesting, please like, subscribe, comment, and share.


Want even more?

Sign up for Every to unlock our ultimate guide to prompting ChatGPT here: https://every.ck.page/ultimate-guide-to-prompting-chatgpt. It’s usually only for paying subscribers, but you can get it here for free.


To hear more from Dan Shipper:


Timestamps

00:00:00 - Start

00:01:48 - Introduction

00:04:14 - Why evals and benchmarks are broken

00:07:13 - The sneakiest LLMs in the market

00:13:00 - A competition that turns prompting into a sport

00:15:49 - Building a business around using games to make AI better

00:22:39 - Can language models learn how to be funny

00:25:31 - Why games are a great way to evaluate and train new models

00:26:58 - What child psychology tells us about games and AI

00:30:10 - Using games to unlock continual learning in AI

00:36:42 - Why Alex cares deeply about games

00:44:37 - Where Alex sees the most promise in AI

00:50:54 - Rethinking how young people start their careers in the age of AI


Links to resources mentioned in the episode:

Avsnitt(98)

Do 60-Minute Coding Tasks in 60 Seconds—With AI - Ep. 41 with Steve Krouse

Do 60-Minute Coding Tasks in 60 Seconds—With AI - Ep. 41 with Steve Krouse

Here’s the most compelling benchmark of AI progress:  A task that took 60 minutes a year ago now takes 60 seconds. In January 2024, Geoffrey Litt and I spent an hour coaxing ChatGPT and Replit to buil...

4 Dec 20241h 1min

How We Incubate and Launch New Products With AI - Ep. 40 with Danny Aziz, Brandon Gell

How We Incubate and Launch New Products With AI - Ep. 40 with Danny Aziz, Brandon Gell

Over the last few months at Every, we’ve: Launched two AI products Acquired tens of thousands of users Released a new incubation in private alpha The weird thing is: We’re a media company w...

27 Nov 20241h

His GPT Wrapper Has Half a Million Users—And Keeps Growing - Ep. 39 with Vicente Silveira

His GPT Wrapper Has Half a Million Users—And Keeps Growing - Ep. 39 with Vicente Silveira

Everyone told Vicente Silveira that his startup—a GPT wrapper—would fail.  Instead, one year later, it’s thriving—with about 500,000 registered users, nearly 3,000 paying subscribers, and over 2 milli...

20 Nov 20241h 3min

How to Win With Prompt Engineering - Ep. 38 with Jared Zoneraich

How to Win With Prompt Engineering - Ep. 38 with Jared Zoneraich

Prompt engineering matters more than ever. But it’s evolving into something totally new:  A way for non-technical domain experts to solve complex problems with AI. I spent an hour talking to prompt wi...

13 Nov 20241h 2min

How Notion Cofounder Simon Last Builds AI for Millions of Users - Ep. 37 with Simon Last

How Notion Cofounder Simon Last Builds AI for Millions of Users - Ep. 37 with Simon Last

This episode is sponsored by Notion. I’ve been using Notion to manage my professional and personal life for almost 10 years. As a company, they pay attention to the craft and ideas underlying the soft...

8 Nov 202455min

How Union Square Ventures Built an AI Brain for Venture Capital - Ep. 36 with Matt Cynamon

How Union Square Ventures Built an AI Brain for Venture Capital - Ep. 36 with Matt Cynamon

Union Square Ventures is building an AI operating system to support their investment team.  But it’s not what you think: It’s a constellation of AI tools that captures and synthesizes the firm's colle...

30 Okt 20241h 9min

Building AI That Builds Itself - Ep. 35 with Yohei Nakajima

Building AI That Builds Itself - Ep. 35 with Yohei Nakajima

Yohei Nakajima leads a double life.  By day, he’s a general partner of a small venture firm, Untapped Capital.  By night, he’s one of the most prolific internet tinkerers in AI. (He also sometimes wor...

23 Okt 202458min

How to Use AI to Become a Learning Machine - Ep. 34 with Simon Eskildsen

How to Use AI to Become a Learning Machine - Ep. 34 with Simon Eskildsen

This episode is sponsored by Reflect. It’s the ultra-fast note-taking app that’s about to change the way you take notes. To boost your productivity with advanced features like custom prompts and voice...

11 Sep 20241h 13min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
market-makers
rss-elektrikerpodden
rss-laddstationen-med-elbilen-i-sverige
skogsforum-podcast
bilar-med-sladd
rss-uppgang-och-fall
natets-morka-sida
gubbar-som-tjotar-om-bilar
bosse-bildoktorn-och-hasse-p
rss-technokratin
developers-mer-an-bara-kod
rss-veckans-ai
bli-saker-podden
hej-bruksbil
rss-it-sakerhetspodden
algoritmen
vi-bilagares-podcast
rss-milpodden