#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Jaksot(495)

Gavin Miller: Adobe Research

Gavin Miller: Adobe Research

Gavin Miller is the Head of Adobe Research. Adobe have empowered artists, designers, and creative minds from all professions working in the digital medium for over 30 years with software such as Photo...

10 Kesä 20191h 9min

Rajat Monga: TensorFlow

Rajat Monga: TensorFlow

Rajat Monga is an Engineering Director at Google, leading the TensorFlow team. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman...

3 Kesä 20191h 11min

Chris Lattner: Compilers, LLVM, Swift, TPU, and ML Accelerators

Chris Lattner: Compilers, LLVM, Swift, TPU, and ML Accelerators

Chris Lattner is a senior director at Google working on several projects including CPU, GPU, TPU accelerators for TensorFlow, Swift for TensorFlow, and all kinds of machine learning compiler magic goi...

13 Touko 20191h 13min

Oriol Vinyals: DeepMind AlphaStar, StarCraft, Language, and Sequences

Oriol Vinyals: DeepMind AlphaStar, StarCraft, Language, and Sequences

Oriol Vinyals is a senior research scientist at Google DeepMind. Before that he was at Google Brain and Berkeley. His research has been cited over 39,000 times. He is one of the most brilliant and imp...

29 Huhti 20191h 46min

Ian Goodfellow: Generative Adversarial Networks (GANs)

Ian Goodfellow: Generative Adversarial Networks (GANs)

Ian Goodfellow is the author of the popular textbook on deep learning (simply titled “Deep Learning”). He coined the term Generative Adversarial Networks (GANs) and with his 2014 paper is responsible ...

18 Huhti 20191h 8min

Elon Musk: Tesla Autopilot

Elon Musk: Tesla Autopilot

Elon Musk is the CEO of Tesla, SpaceX, Neuralink, and a co-founder of several other companies. Video version is available on YouTube. If you would like to get more information about this podcast go to...

12 Huhti 201932min

Greg Brockman: OpenAI and AGI

Greg Brockman: OpenAI and AGI

Greg Brockman is the Co-Founder and CTO of OpenAI, a research organization developing ideas in AI that lead eventually to a safe & friendly artificial general intelligence that benefits and empowers h...

3 Huhti 20191h 25min

Eric Weinstein: Revolutionary Ideas in Science, Math, and Society

Eric Weinstein: Revolutionary Ideas in Science, Math, and Society

Eric Weinstein is a mathematician, economist, physicist, and managing director of Thiel Capital. He formed the “intellectual dark web” which is a loosely assembled group of public intellectuals includ...

20 Maalis 20191h 21min

Suosittua kategoriassa Tiede

tiedekulma-podcast
rss-mita-tulisi-tietaa
rss-poliisin-mieli
utelias-mieli
rss-metsantuntijat-podcast
rss-duodecim-lehti
mielipaivakirja
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
docemilia
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-bios-podcast
rss-astetta-parempi-elama-podcast
rss-tiedetta-vai-tarinaa
rss-lapsuuden-rakentajat-podcast
rss-sosiopodi
rss-miljonaarien-tasavalta