#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Episoder(495)

#422 – Mark Cuban: Shark Tank, DEI & Wokeism Debate, Elon Musk, Politics & Drugs

#422 – Mark Cuban: Shark Tank, DEI & Wokeism Debate, Elon Musk, Politics & Drugs

Mark Cuban is a businessman, investor, star of TV series Shark Tank, long-time principal owner of Dallas Mavericks, and founder of Cost Plus Drugs. Please support this podcast by checking out our spon...

29 Mar 20242h 23min

#421 – Dana White: UFC, Fighting, Khabib, Conor, Tyson, Ali, Rogan, Elon & Zuck

#421 – Dana White: UFC, Fighting, Khabib, Conor, Tyson, Ali, Rogan, Elon & Zuck

Dana White is the CEO and president of the UFC. Please support this podcast by checking out our sponsors: – LMNT: https://drinkLMNT.com/lex to get free sample pack – Notion: https://notion.com/lex – A...

25 Mar 20241h 36min

#420 – Annie Jacobsen: Nuclear War, CIA, KGB, Aliens, Area 51, Roswell & Secrecy

#420 – Annie Jacobsen: Nuclear War, CIA, KGB, Aliens, Area 51, Roswell & Secrecy

Annie Jacobsen is an investigative journalist and author of “Nuclear War: A Scenario” and many other books on war, weapons, government secrecy, and national security. Please support this podcast by ch...

22 Mar 20243h 12min

#419 – Sam Altman: OpenAI, GPT-5, Sora, Board Saga, Elon Musk, Ilya, Power & AGI

#419 – Sam Altman: OpenAI, GPT-5, Sora, Board Saga, Elon Musk, Ilya, Power & AGI

Sam Altman is the CEO of OpenAI, the company behind GPT-4, ChatGPT, Sora, and many other state-of-the-art AI technologies. Please support this podcast by checking out our sponsors: – Cloaked: https://...

18 Mar 20242h 2min

#418 – Israel-Palestine Debate: Finkelstein, Destiny, M. Rabbani & Benny Morris

#418 – Israel-Palestine Debate: Finkelstein, Destiny, M. Rabbani & Benny Morris

Norman Finkelstein and Benny Morris are historians. Mouin Rabbani is a Middle East analyst. Steven Bonnell (aka Destiny) is a political livestreamer. Please support this podcast by checking out our sp...

14 Mar 20245h 4min

#417 – Kimbal Musk: The Art of Cooking, Tesla, SpaceX, Zip2, and Family

#417 – Kimbal Musk: The Art of Cooking, Tesla, SpaceX, Zip2, and Family

Kimbal Musk is a chef, entrepreneur, and author of The Kitchen Cookbook: Cooking for Your Community. Please support this podcast by checking out our sponsors: – Eight Sleep: https://eightsleep.com/lex...

10 Mar 20241h 53min

#416 – Yann Lecun: Meta AI, Open Source, Limits of LLMs, AGI & the Future of AI

#416 – Yann Lecun: Meta AI, Open Source, Limits of LLMs, AGI & the Future of AI

Yann LeCun is the Chief AI Scientist at Meta, professor at NYU, Turing Award winner, and one of the most influential researchers in the history of AI. Please support this podcast by checking out our s...

7 Mar 20242h 54min

#415 – Serhii Plokhy: History of Ukraine, Russia, Soviet Union, KGB, Nazis & War

#415 – Serhii Plokhy: History of Ukraine, Russia, Soviet Union, KGB, Nazis & War

Serhii Plokhy is a Ukrainian historian at Harvard University, director of the Ukrainian Research Institute, and an author of many books on history of Eastern Europe, including his latest book The Russ...

4 Mar 20243h 27min

Populært innen Vitenskap

fastlegen
rekommandert
tingenes-tilstand
jss
rss-rekommandert
sinnsyn
forskningno
liberal-halvtime
fjellsportpodden
rss-nysgjerrige-norge
kvinnehelsepodden
nordnorsk-historie
villmarksliv
vett-og-vitenskap-med-gaute-einevoll
hva-er-greia-med
smart-forklart
nevropodden
tidlose-historier
aldring-og-helse-podden
rss-radium