#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(497)

#81 – Anca Dragan: Human-Robot Interaction and Reward Engineering

#81 – Anca Dragan: Human-Robot Interaction and Reward Engineering

Anca Dragan is a professor at Berkeley, working on human-robot interaction — algorithms that look beyond the robot’s function in isolation, and generate robot behavior that accounts for interaction an...

19 Mars 20201h 39min

#80 – Vitalik Buterin: Ethereum, Cryptocurrency, and the Future of Money

#80 – Vitalik Buterin: Ethereum, Cryptocurrency, and the Future of Money

Vitalik Buterin is co-creator of Ethereum and ether, which is a cryptocurrency that is currently the second-largest digital currency after bitcoin. Ethereum has a lot of interesting technical ideas th...

16 Mars 20201h 35min

#79 – Lee Smolin: Quantum Gravity and Einstein’s Unfinished Revolution

#79 – Lee Smolin: Quantum Gravity and Einstein’s Unfinished Revolution

Lee Smolin is a theoretical physicist, co-inventor of loop quantum gravity, and a contributor of many interesting ideas to cosmology, quantum field theory, the foundations of quantum mechanics, theore...

7 Mars 20201h 10min

#78 – Ann Druyan: Cosmos, Carl Sagan, Voyager, and the Beauty of Science

#78 – Ann Druyan: Cosmos, Carl Sagan, Voyager, and the Beauty of Science

Ann Druyan is the writer, producer, director, and one of the most important and impactful communicators of science in our time. She co-wrote the 1980 science documentary series Cosmos hosted by Carl S...

5 Mars 20201h 9min

#77 – Alex Garland: Ex Machina, Devs, Annihilation, and the Poetry of Science

#77 – Alex Garland: Ex Machina, Devs, Annihilation, and the Poetry of Science

Alex Garland is a writer and director of many imaginative and philosophical films from the dreamlike exploration of human self-destruction in the movie Annihilation to the deep questions of consciousn...

3 Mars 20201h 11min

#76 – John Hopfield: Physics View of the Mind and Neurobiology

#76 – John Hopfield: Physics View of the Mind and Neurobiology

John Hopfield is professor at Princeton, whose life’s work weaved beautifully through biology, chemistry, neuroscience, and physics. Most crucially, he saw the messy world of biology through the pierc...

29 Feb 20201h 13min

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

Marcus Hutter is a senior research scientist at DeepMind and professor at Australian National University. Throughout his career of research, including with Jürgen Schmidhuber and Shane Legg, he has pr...

26 Feb 20201h 40min

#74 – Michael I. Jordan: Machine Learning, Recommender Systems, and the Future of AI

#74 – Michael I. Jordan: Machine Learning, Recommender Systems, and the Future of AI

Michael I. Jordan is a professor at Berkeley, and one of the most influential people in the history of machine learning, statistics, and artificial intelligence. He has been cited over 170,000 times a...

24 Feb 20201h 46min

Populärt inom Vetenskap

allt-du-velat-veta
p3-dystopia
dumma-manniskor
rss-ufobortom-rimligt-tvivel
ufo-sverige
kapitalet-en-podd-om-ekonomi
svd-nyhetsartiklar
hacka-livet
sexet
paranormalt-med-caroline-giertz
rss-vetenskapsradion
det-morka-psyket
rss-vetenskapsradion-2
ufo-sverige-2
rss-spraket
medicinvetarna
dumforklarat
halsorevolutionen
rss-dennis-world
rss-klotet