#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Jaksot(495)

Jeff Atwood: Stack Overflow and Coding Horror

Jeff Atwood: Stack Overflow and Coding Horror

Jeff Atwood is a co-founder of Stack Overflow and Stack Exchange, websites that are visited by millions of people every day. Much like with Wikipedia, it is difficult to understate the impact on globa...

29 Marras 20181h 20min

Guido van Rossum: Python

Guido van Rossum: Python

Guido van Rossum is the creator of Python, one of the most popular and impactful programming languages in the world. Video version is available on YouTube. If you would like to get more information ab...

22 Marras 20181h 26min

Vladimir Vapnik: Statistical Learning

Vladimir Vapnik: Statistical Learning

Vladimir Vapnik is the co-inventor of support vector machines, support vector clustering, VC theory, and many foundational ideas in statistical learning. His work has been cited over 170,000 times. He...

16 Marras 201854min

Yoshua Bengio: Deep Learning

Yoshua Bengio: Deep Learning

Yoshua Bengio, along with Geoffrey Hinton and Yann Lecun, is considered one of the three people most responsible for the advancement of deep learning during the 1990s, 2000s, and now. Cited 139,000 ti...

20 Loka 201842min

Steven Pinker: AI in the Age of Reason

Steven Pinker: AI in the Age of Reason

Steven Pinker is a professor at Harvard and before that was a professor at MIT. He is the author of many books, several of which have had a big impact on the way I see the world for the better. In par...

17 Loka 201838min

Christof Koch: Consciousness

Christof Koch: Consciousness

A conversation with Christof Koch as part of MIT course on Artificial General Intelligence. Video version is available on YouTube. He is the President and Chief Scientific Officer of the Allen Institu...

2 Syys 201859min

Max Tegmark: Life 3.0

Max Tegmark: Life 3.0

A conversation with Max Tegmark as part of MIT course on Artificial General Intelligence. Video version is available on YouTube. He is a Physics Professor at MIT, co-founder of the Future of Life Inst...

26 Elo 20181h 22min

Suosittua kategoriassa Tiede

tiedekulma-podcast
rss-mita-tulisi-tietaa
rss-poliisin-mieli
utelias-mieli
rss-metsantuntijat-podcast
rss-duodecim-lehti
mielipaivakirja
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
docemilia
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-bios-podcast
rss-astetta-parempi-elama-podcast
rss-tiedetta-vai-tarinaa
rss-lapsuuden-rakentajat-podcast
rss-sosiopodi
rss-miljonaarien-tasavalta