#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning.

Support this podcast by signing up with these sponsors:
– MasterClass: https://masterclass.com/lex
– Cash App – use code “LexPodcast” and download:
– Cash App (App Store): https://apple.co/2sPrUHe
– Cash App (Google Play): https://bit.ly/2MlvP5w

EPISODE LINKS:
Reinforcement learning (book): https://amzn.to/2Jwp5zG

This conversation is part of the Artificial Intelligence podcast. If you would like to get more information about this podcast go to https://lexfridman.com/ai or connect with @lexfridman on Twitter, LinkedIn, Facebook, Medium, or YouTube where you can watch the video versions of these conversations. If you enjoy the podcast, please rate it 5 stars on Apple Podcasts, follow on Spotify, or support it on Patreon.

Here’s the outline of the episode. On some podcast players you should be able to click the timestamp to jump to that time.

OUTLINE:
00:00 – Introduction
04:09 – First program
11:11 – AlphaGo
21:42 – Rule of the game of Go
25:37 – Reinforcement learning: personal journey
30:15 – What is reinforcement learning?
43:51 – AlphaGo (continued)
53:40 – Supervised learning and self play in AlphaGo
1:06:12 – Lee Sedol retirement from Go play
1:08:57 – Garry Kasparov
1:14:10 – Alpha Zero and self play
1:31:29 – Creativity in AlphaZero
1:35:21 – AlphaZero applications
1:37:59 – Reward functions
1:40:51 – Meaning of life

Avsnitt(496)

#383 – Mark Zuckerberg: Future of AI at Meta, Facebook, Instagram, and WhatsApp

#383 – Mark Zuckerberg: Future of AI at Meta, Facebook, Instagram, and WhatsApp

Mark Zuckerberg is CEO of Meta. Please support this podcast by checking out our sponsors: – Numerai: https://numer.ai/lex – Shopify: https://shopify.com/lex to get $1 per month trial – BetterHelp: htt...

8 Juni 20232h 47min

#382 – Bert Kreischer: Comedy, Drinking, Rogan, Segura, Churchill & Kim Jong Un

#382 – Bert Kreischer: Comedy, Drinking, Rogan, Segura, Churchill & Kim Jong Un

Bert Kreischer is a comedian, actor, and podcaster. Check him out on Bertcast, 2 Bears 1 Cave, Something is Burning, and the new movie The Machine. Please support this podcast by checking out our spon...

5 Juni 20232h 11min

#381 – Chris Lattner: Future of Programming and AI

#381 – Chris Lattner: Future of Programming and AI

Chris Lattner is a legendary software and hardware engineer, leading projects at Apple, Tesla, Google, SiFive, and Modular AI, including the development of Swift, LLVM, Clang, MLIR, CIRCT, TPUs, and M...

2 Juni 20233h 38min

#380 – Neil Gershenfeld: Self-Replicating Robots and the Future of Fabrication

#380 – Neil Gershenfeld: Self-Replicating Robots and the Future of Fabrication

Neil Gershenfeld is the director of the MIT Center for Bits and Atoms. Please support this podcast by checking out our sponsors: – LMNT: https://drinkLMNT.com/lex to get free sample pack – NetSuite: h...

28 Maj 20232h 11min

#379 – Randall Kennedy: The N-Word – History of Race, Law, Politics, and Power

#379 – Randall Kennedy: The N-Word – History of Race, Law, Politics, and Power

Randall Kennedy is a law professor at Harvard and author of many seminal books on race, law, history, culture, and politics. Please support this podcast by checking out our sponsors: – Eight Sleep: ht...

24 Maj 20233h 14min

#378 – Anna Frebel: Origin and Evolution of the Universe, Galaxies, and Stars

#378 – Anna Frebel: Origin and Evolution of the Universe, Galaxies, and Stars

Anna Frebel is an astronomer and astrophysicist at MIT. Please support this podcast by checking out our sponsors: – Hexclad Cookware: https://hexclad.com/lex and use code LEX to get 10% off – Numerai:...

18 Maj 20232h 23min

#377 – Harvey Silverglate: Freedom of Speech

#377 – Harvey Silverglate: Freedom of Speech

Harvey Silverglate is a free speech advocate, co-founder of FIRE, the Foundation for Individual Rights in Expression, and author of several books on freedom of speech and criminal justice. He is runni...

16 Maj 20231h 55min

#376 – Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation

#376 – Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation

Stephen Wolfram is a computer scientist, mathematician, theoretical physicist, and the founder of Wolfram Research, a company behind Wolfram|Alpha, Wolfram Language, and the Wolfram Physics and Metama...

9 Maj 20234h 19min

Populärt inom Vetenskap

dumma-manniskor
p3-dystopia
allt-du-velat-veta
svd-nyhetsartiklar
kapitalet-en-podd-om-ekonomi
rss-vetenskapsradion
det-morka-psyket
rss-spraket
rss-vetenskapsradion-2
dumforklarat
sexet
rss-ufo-bortom-rimligt-tvivel-2
rss-odla
medicinvetarna
hacka-livet
barnpsykologerna
rss-arkeologi-historia-podden-som-graver-i-vart-kulturlandskap
halsorevolutionen
paranormalt-med-caroline-giertz
vetenskapsradion