#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Jaksot(491)

#378 – Anna Frebel: Origin and Evolution of the Universe, Galaxies, and Stars

#378 – Anna Frebel: Origin and Evolution of the Universe, Galaxies, and Stars

Anna Frebel is an astronomer and astrophysicist at MIT. Please support this podcast by checking out our sponsors: – Hexclad Cookware: https://hexclad.com/lex and use code LEX to get 10% off – Numerai:...

18 Touko 20232h 23min

#377 – Harvey Silverglate: Freedom of Speech

#377 – Harvey Silverglate: Freedom of Speech

Harvey Silverglate is a free speech advocate, co-founder of FIRE, the Foundation for Individual Rights in Expression, and author of several books on freedom of speech and criminal justice. He is runni...

16 Touko 20231h 55min

#376 – Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation

#376 – Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation

Stephen Wolfram is a computer scientist, mathematician, theoretical physicist, and the founder of Wolfram Research, a company behind Wolfram|Alpha, Wolfram Language, and the Wolfram Physics and Metama...

9 Touko 20234h 19min

#375 – David Pakman: Politics of Trump, Biden, Bernie, AOC, Socialism & Wokeism

#375 – David Pakman: Politics of Trump, Biden, Bernie, AOC, Socialism & Wokeism

David Pakman is a left-wing progressive political commentator and host of The David Pakman Show. Please support this podcast by checking out our sponsors: – Eight Sleep: https://www.eightsleep.com/lex...

6 Touko 20233h 36min

#374 – Robert Playter: Boston Dynamics CEO on Humanoid and Legged Robotics

#374 – Robert Playter: Boston Dynamics CEO on Humanoid and Legged Robotics

Robert Playter is CEO of Boston Dynamics, a legendary robotics company that over 30 years has created some of the most elegant, dextrous, and simply amazing robots ever built, including the humanoid r...

28 Huhti 20232h 32min

#373 – Manolis Kellis: Evolution of Human Civilization and Superintelligent AI

#373 – Manolis Kellis: Evolution of Human Civilization and Superintelligent AI

Manolis Kellis is a computational biologist at MIT. Please support this podcast by checking out our sponsors: – Eight Sleep: https://www.eightsleep.com/lex to get special savings – NetSuite: http://ne...

21 Huhti 20232h 35min

#372 – Simone Giertz: Queen of Sh*tty Robots, Innovative Engineering, and Design

#372 – Simone Giertz: Queen of Sh*tty Robots, Innovative Engineering, and Design

Simone Giertz is an inventor, designer, engineer, and roboticist famous for a combination of humor and brilliant creative design in the systems and products she creates. Please support this podcast by...

16 Huhti 20232h 4min

#371 – Max Tegmark: The Case for Halting AI Development

#371 – Max Tegmark: The Case for Halting AI Development

Max Tegmark is a physicist and AI researcher at MIT, co-founder of the Future of Life Institute, and author of Life 3.0: Being Human in the Age of Artificial Intelligence. Please support this podcast ...

13 Huhti 20232h 53min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
tiedekulma-podcast
rss-poliisin-mieli
utelias-mieli
rss-duodecim-lehti
rss-laakaripodi
rss-opeklubi
rss-lihavuudesta-podcast
sotataidon-ytimessa
hippokrateen-vastaanotolla
rss-vaasan-yliopiston-podcastit
rss-ammamafia
rss-ylistys-elaimille