#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(497)

Rohit Prasad: Amazon Alexa and Conversational AI

Rohit Prasad: Amazon Alexa and Conversational AI

Rohit Prasad is the vice president and head scientist of Amazon Alexa and one of its original creators. This conversation is part of the Artificial Intelligence podcast. If you would like to get more ...

14 Joulu 20191h 46min

Judea Pearl: Causal Reasoning, Counterfactuals, Bayesian Networks, and the Path to AGI

Judea Pearl: Causal Reasoning, Counterfactuals, Bayesian Networks, and the Path to AGI

Judea Pearl is a professor at UCLA and a winner of the Turing Award, that’s generally recognized as the Nobel Prize of computing. He is one of the seminal figures in the field of artificial intelligen...

11 Joulu 20191h 23min

Whitney Cummings: Comedy, Robotics, Neurology, and Love

Whitney Cummings: Comedy, Robotics, Neurology, and Love

Whitney Cummings is a stand-up comedian, actor, producer, writer, director, and the host of a new podcast called Good for You. Her most recent Netflix special called “Can I Touch It?” features in part...

5 Joulu 20191h 17min

Ray Dalio: Principles, the Economic Machine, Artificial Intelligence & the Arc of Life

Ray Dalio: Principles, the Economic Machine, Artificial Intelligence & the Arc of Life

Ray Dalio is the founder, Co-Chairman and Co-Chief Investment Officer of Bridgewater Associates, one of the world’s largest and most successful investment firms that is famous for the principles of ra...

2 Joulu 20191h 30min

Noam Chomsky: Language, Cognition, and Deep Learning

Noam Chomsky: Language, Cognition, and Deep Learning

Noam Chomsky is one of the greatest minds of our time and is one of the most cited scholars in history. He is a linguist, philosopher, cognitive scientist, historian, social critic, and political acti...

29 Marras 201936min

Gilbert Strang: Linear Algebra, Deep Learning, Teaching, and MIT OpenCourseWare

Gilbert Strang: Linear Algebra, Deep Learning, Teaching, and MIT OpenCourseWare

Gilbert Strang is a professor of mathematics at MIT and perhaps one of the most famous and impactful teachers of math in the world. His MIT OpenCourseWare lectures on linear algebra have been viewed m...

25 Marras 201950min

Dava Newman: Space Exploration, Space Suits, and Life on Mars

Dava Newman: Space Exploration, Space Suits, and Life on Mars

Dava Newman is the Apollo Program professor of AeroAstro at MIT and the former Deputy Administrator of NASA and has been a principal investigator on four spaceflight missions. Her research interests a...

22 Marras 201939min

Michael Kearns: Algorithmic Fairness, Bias, Privacy, and Ethics in Machine Learning

Michael Kearns: Algorithmic Fairness, Bias, Privacy, and Ethics in Machine Learning

Michael Kearns is a professor at University of Pennsylvania and a co-author of the new book Ethical Algorithm that is the focus of much of our conversation, including algorithmic fairness, bias, priva...

19 Marras 20191h 49min

Suosittua kategoriassa Tiede

rss-poliisin-mieli
tiedekulma-podcast
rss-mita-tulisi-tietaa
docemilia
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-duodecim-lehti
sotataidon-ytimessa
rss-tiedetta-vai-tarinaa
utelias-mieli
radio-antro
rss-bios-podcast
rss-ranskaa-raakana
rss-kasvatuspsykologiaa-kaikille
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
rss-lapsuuden-rakentajat-podcast
rss-sosiopodi