#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Jaksot(491)

#466 – Jeffrey Wasserstrom: China, Xi Jinping, Trade War, Taiwan, Hong Kong, Mao

#466 – Jeffrey Wasserstrom: China, Xi Jinping, Trade War, Taiwan, Hong Kong, Mao

Jeffrey Wasserstrom is a historian of modern China. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep466-sc See below for timestamps, transcript, and to give feedbac...

24 Huhti 20253h 14min

#465 – Robert Rodriguez: Sin City, Desperado, El Mariachi, Alita, and Filmmaking

#465 – Robert Rodriguez: Sin City, Desperado, El Mariachi, Alita, and Filmmaking

Robert Rodriguez is a legendary filmmaker and creator of Sin City, El Mariachi, Desperado, Spy Kids, Machete, From Dusk Till Dawn, Alita: Battle Angel, The Faculty, and his newest venture Brass Knuckl...

17 Huhti 20253h 35min

#464 – Dave Smith: Israel, Ukraine, Epstein, Mossad, Conspiracies & Antisemitism

#464 – Dave Smith: Israel, Ukraine, Epstein, Mossad, Conspiracies & Antisemitism

Dave Smith is a comedian, libertarian, political commentator, and the host of Part of the Problem podcast. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep464-sc Se...

9 Huhti 20253h 26min

#463 – Douglas Murray: Putin, Zelenskyy, Trump, Israel, Netanyahu, Hamas & Gaza

#463 – Douglas Murray: Putin, Zelenskyy, Trump, Israel, Netanyahu, Hamas & Gaza

Douglas Murray is the author of On Democracies and Death Cults, The War on The West, and The Madness of Crowds. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep463-...

30 Maalis 20253h 16min

#462 – Ezra Klein and Derek Thompson: Politics, Trump, AOC, Elon & DOGE

#462 – Ezra Klein and Derek Thompson: Politics, Trump, AOC, Elon & DOGE

Ezra Klein is one of the most influential voices representing the left-wing of American politics. He is a columnist for the NY Times and host of The Ezra Klein Show. Derek Thompson is a writer at The ...

26 Maalis 20250s

#461 – ThePrimeagen: Programming, AI, ADHD, Productivity, Addiction, and God

#461 – ThePrimeagen: Programming, AI, ADHD, Productivity, Addiction, and God

ThePrimeagen (aka Michael Paulson) is a programmer who has educated, entertained, and inspired millions of people to build software and have fun doing it. Thank you for listening ❤ Check out our spons...

22 Maalis 20255h 30min

#460 – Narendra Modi: Prime Minister of India – Power, Democracy, War & Peace

#460 – Narendra Modi: Prime Minister of India – Power, Democracy, War & Peace

Narendra Modi is the Prime Minister of India. On YouTube this episode is available in English, Hindi, Russian (and soon other languages). Captions and voice-over audio tracks are provided (for the mai...

16 Maalis 20253h 25min

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Dylan Patel is the founder of SemiAnalysis, a research & analysis company specializing in semiconductors, GPUs, CPUs, and AI hardware. Nathan Lambert is a research scientist at the Allen Institute for...

3 Helmi 20255h 16min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
rss-poliisin-mieli
tiedekulma-podcast
rss-duodecim-lehti
utelias-mieli
docemilia
rss-ammamafia
rss-laakaripodi
rss-mental-race
sotataidon-ytimessa
menologeja-tutkimusmatka-vaihdevuosiin
rss-vaasan-yliopiston-podcastit
rss-opeklubi
rss-ylistys-elaimille
rss-tervetta-skeptisyytta
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
rss-lihavuudesta-podcast
rss-sosiopodi