#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Jaksot(495)

Leslie Kaelbling: Reinforcement Learning, Planning, and Robotics

Leslie Kaelbling: Reinforcement Learning, Planning, and Robotics

Leslie Kaelbling is a roboticist and professor at MIT. She is recognized for her work in reinforcement learning, planning, robot navigation, and several other topics in AI. She won the IJCAI Computers...

12 Maalis 20191h 1min

Kyle Vogt: Cruise Automation

Kyle Vogt: Cruise Automation

Kyle Vogt is the President and CTO of Cruise Automation, leading an effort in trying to solve one of the biggest robotics challenges of our time: vehicle autonomy. He is the co-founder of 2 successful...

7 Helmi 201955min

Tomaso Poggio: Brains, Minds, and Machines

Tomaso Poggio: Brains, Minds, and Machines

Tomaso Poggio is a professor at MIT and is the director of the Center for Brains, Minds, and Machines. Cited over 100,000 times, his work has had a profound impact on our understanding of the nature o...

19 Tammi 20191h 20min

Tuomas Sandholm: Poker and Game Theory

Tuomas Sandholm: Poker and Game Theory

Tuomas Sandholm is a professor at CMU and co-creator of Libratus, which is the first AI system to beat top human players at the game of Heads-Up No-Limit Texas Hold’em. He has published over 450 paper...

28 Joulu 20181h 6min

Juergen Schmidhuber: Godel Machines, Meta-Learning, and LSTMs

Juergen Schmidhuber: Godel Machines, Meta-Learning, and LSTMs

Juergen Schmidhuber is the co-creator of long short-term memory networks (LSTMs) which are used in billions of devices today for speech recognition, translation, and much more. Over 30 years, he has p...

23 Joulu 20181h 20min

Pieter Abbeel: Deep Reinforcement Learning

Pieter Abbeel: Deep Reinforcement Learning

Pieter Abbeel is a professor at UC Berkeley, director of the Berkeley Robot Learning Lab, and is one of the top researchers in the world working on how to make robots understand and interact with the ...

16 Joulu 201842min

Stuart Russell: Long-Term Future of AI

Stuart Russell: Long-Term Future of AI

Stuart Russell is a professor of computer science at UC Berkeley and a co-author of the book that introduced me and millions of other people to AI, called Artificial Intelligence: A Modern Approach. ...

9 Joulu 20181h 26min

Eric Schmidt: Google

Eric Schmidt: Google

Eric Schmidt was the CEO of Google from 2001 to 2011, and its executive chairman from 2011 to 2017, guiding the company through a period of incredible growth and a series of world-changing innovations...

4 Joulu 201833min

Suosittua kategoriassa Tiede

tiedekulma-podcast
rss-mita-tulisi-tietaa
rss-poliisin-mieli
utelias-mieli
rss-metsantuntijat-podcast
rss-duodecim-lehti
mielipaivakirja
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
docemilia
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-bios-podcast
rss-astetta-parempi-elama-podcast
rss-tiedetta-vai-tarinaa
rss-lapsuuden-rakentajat-podcast
rss-sosiopodi
rss-miljonaarien-tasavalta