#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Episoder(491)

#458 – Marc Andreessen: Trump, Power, Tech, AI, Immigration & Future of America

#458 – Marc Andreessen: Trump, Power, Tech, AI, Immigration & Future of America

Marc Andreessen is an entrepreneur, investor, co-creator of Mosaic, co-founder of Netscape, and co-founder of the venture capital firm Andreessen Horowitz. Thank you for listening ❤ Check out our spon...

26 Jan 20253h 57min

#457 – Jennifer Burns: Milton Friedman, Ayn Rand, Economics, Capitalism, Freedom

#457 – Jennifer Burns: Milton Friedman, Ayn Rand, Economics, Capitalism, Freedom

Jennifer Burns is a historian of ideas, focusing on the evolution of economic, political, and social ideas in the United States in the 20th century. She wrote two biographies, one on Milton Friedman, ...

19 Jan 20250s

#456 – Volodymyr Zelenskyy: Ukraine, War, Peace, Putin, Trump, NATO, and Freedom

#456 – Volodymyr Zelenskyy: Ukraine, War, Peace, Putin, Trump, NATO, and Freedom

Volodymyr Zelenskyy is the President of Ukraine. On YouTube this episode is available in English, Ukrainian, and Russian. Captions and voice-over audio tracks are provided in English, Ukrainian, Russi...

6 Jan 20253h 13min

#455 – Adam Frank: Alien Civilizations and the Search for Extraterrestrial Life

#455 – Adam Frank: Alien Civilizations and the Search for Extraterrestrial Life

Adam Frank is an astrophysicist studying star systems and the search for extraterrestrial life and alien civilizations. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsor...

22 Des 20243h 39min

#454 – Saagar Enjeti: Trump, MAGA, DOGE, Obama, FDR, JFK, History & Politics

#454 – Saagar Enjeti: Trump, MAGA, DOGE, Obama, FDR, JFK, History & Politics

Saagar Enjeti is a political journalist & commentator, co-host of Breaking Points with Krystal and Saagar and The Realignment Podcast. He is exceptionally well-read, and the books he recommends are al...

8 Des 20243h 39min

#453 – Javier Milei: President of Argentina – Freedom, Economics, and Corruption

#453 – Javier Milei: President of Argentina – Freedom, Economics, and Corruption

Javier Milei is the President of Argentina. This episode is available in both English and Spanish. Thank you for listening ❤ Check out our sponsors: https://lexfridman.com/sponsors/ep453-sc See below ...

20 Nov 20242h 8min

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

#452 – Dario Amodei: Anthropic CEO on Claude, AGI & the Future of AI & Humanity

Dario Amodei is the CEO of Anthropic, the company that created Claude. Amanda Askell is an AI researcher working on Claude’s character and personality. Chris Olah is an AI researcher working on mechan...

11 Nov 20245h 22min

#451 – Rick Spence: CIA, KGB, Illuminati, Secret Societies, Cults & Conspiracies

#451 – Rick Spence: CIA, KGB, Illuminati, Secret Societies, Cults & Conspiracies

Rick Spence is a historian specializing in the history of intelligence agencies, espionage, secret societies, conspiracies, the occult, and military history. Thank you for listening ❤ Check out our sp...

30 Okt 20243h 36min

Populært innen Vitenskap

fastlegen
rekommandert
rss-rekommandert
jss
tingenes-tilstand
forskningno
sinnsyn
dekodet-2
rss-paradigmepodden
pod-britannia
villmarksliv
tomprat-med-gunnar-tjomlid
fjellsportpodden
hva-er-greia-med
tidlose-historier
vett-og-vitenskap-med-gaute-einevoll
rss-nysgjerrige-norge
kvinnehelsepodden
diagnose
fremtid-pa-frys