We Taught AI to Play Games—Now It’s a $3.6 Million Company
AI and I16 Okt 2025

We Taught AI to Play Games—Now It’s a $3.6 Million Company

This episode is a little different from our usual fare: It’s a conversation with our head of AI training Alex Duffy about Good Start Labs, a company he incubated inside Every. Today, Good Start Labs is spinning out of Every as a separate company with $3.6 million in funding from General Catalyst, Inovia, Every, and a group of angel investors from top-tier AI labs like DeepMind. We get into how Alex learned some of his biggest lessons about the real world from games, starting with RuneScape, which taught him how markets work and how not to get scammed. He explains why the static benchmarks we use to evaluate LLMs today are breaking down, and how games like Diplomacy offer a richer, more dynamic way to test and train large language models. Finally, Alex shares where he sees the most promise in AI—software, life sciences, and education—and why he believes games can make the models we use smarter, while helping people understand and use AI more effectively.

If you found this episode interesting, please like, subscribe, comment, and share.


Want even more?

Sign up for Every to unlock our ultimate guide to prompting ChatGPT here: https://every.ck.page/ultimate-guide-to-prompting-chatgpt. It’s usually only for paying subscribers, but you can get it here for free.


To hear more from Dan Shipper:


Timestamps

00:00:00 - Start

00:01:48 - Introduction

00:04:14 - Why evals and benchmarks are broken

00:07:13 - The sneakiest LLMs in the market

00:13:00 - A competition that turns prompting into a sport

00:15:49 - Building a business around using games to make AI better

00:22:39 - Can language models learn how to be funny

00:25:31 - Why games are a great way to evaluate and train new models

00:26:58 - What child psychology tells us about games and AI

00:30:10 - Using games to unlock continual learning in AI

00:36:42 - Why Alex cares deeply about games

00:44:37 - Where Alex sees the most promise in AI

00:50:54 - Rethinking how young people start their careers in the age of AI


Links to resources mentioned in the episode:

Episoder(98)

How Two Engineers Ship Like a Team of 15 With AI Agents | Kieran Klaassen, Nityesh Agarwal

How Two Engineers Ship Like a Team of 15 With AI Agents | Kieran Klaassen, Nityesh Agarwal

If you’re using AI to just write code, you’re missing out.Two engineers at Every shipped six features, five bug fixes, and three infrastructure updates in one week—and they did it by designing workflo...

11 Jun 202554min

The Future of AI in Medicine: From Rules to Intuition | Awais Aftab, Psychiatrist and writer

The Future of AI in Medicine: From Rules to Intuition | Awais Aftab, Psychiatrist and writer

OCD treatment changed my life—but it took me a decade of chasing down wrong answers to be diagnosed. In the rush to create scalable treatments, disorders like depression and OCD are squeezed into diag...

4 Jun 202553min

GitHub CEO on the AI Coding Arms Race: One Agent, 150M+ Devs

GitHub CEO on the AI Coding Arms Race: One Agent, 150M+ Devs

GitHub Copilot has 15 million users—more than Cursor and Windsurf combined. So why does it feel like they're losing the AI coding race?Last week at Microsoft Build, I interviewed the CEO of GitHub Tho...

28 Mai 202530min

Kevin Scott on The Future of Programming, AI Agents, and Microsoft’s Big Bet on the Agentic Web

Kevin Scott on The Future of Programming, AI Agents, and Microsoft’s Big Bet on the Agentic Web

I interviewed Microsoft CTO Kevin Scott about the future of agents and software engineering for another special edition of AI & I. With 41 years of programming behind him, Kevin has lived through near...

20 Mai 202528min

OpenAI Launches Codex: An Autonomous Programming Agent

OpenAI Launches Codex: An Autonomous Programming Agent

OpenAI just launched Codex, a brand-new coding agent that can build features and fix bugs autonomously. We’ve been testing it at Every for a few days, and I’m impressed.I invited Alexander Embiricos, ...

16 Mai 202542min

The $10B Hedge Fund CEO Who’s Betting Big on AI | Will England, Walleye Capital

The $10B Hedge Fund CEO Who’s Betting Big on AI | Will England, Walleye Capital

Will England just pivoted his $10B AUM hedge fund to go all in on AI with a firm-wide email:“I wrote this email using ChatGPT—you should too. As a hedge fund, we should be ashamed to leave money on th...

14 Mai 20251h 7min

Jhana Meditation Silenced Her Mind—And Changed Her View On AI | Nadia Asparouhova, Author and researcher

Jhana Meditation Silenced Her Mind—And Changed Her View On AI | Nadia Asparouhova, Author and researcher

After two Jhana meditation retreats Nadia Asparouhova could silence her mind, change her emotional state at will, and even intentionally slip out of consciousness. It challenged the idea that our mind...

7 Mai 202553min

The Next AI Wave Will Be Social, Not Solo | Sarah Tavel, Benchmark and ex-Pinterest

The Next AI Wave Will Be Social, Not Solo | Sarah Tavel, Benchmark and ex-Pinterest

Sarah Tavel thinks it's criminal that ChatGPT isn’t inherently social.There’s no easy way to discover great prompts or share the ones that worked. As a venture partner at Benchmark, Sarah believes tha...

30 Apr 202549min

Populært innen Teknologi

romkapsel
rss-avskiltet
tomprat-med-gunnar-tjomlid
teknisk-sett
nasjonal-sikkerhetsmyndighet-nsm
energi-og-klima
rss-impressions-2
shifter
elektropodden
lydartikler-fra-aftenposten
fornybaren
hans-petter-og-co
smart-forklart
pedagogisk-intelligens
rss-alt-vi-kan
rss-ki-praten
rss-fish-ships
rss-tendencast-kunstig-intelligens-og-juss-2
rss-heis
rss-for-alarmen-gar