Tim & Heinrich — Democraticizing Reinforcement Learning Research

Tim & Heinrich — Democraticizing Reinforcement Learning Research

Since reinforcement learning requires hefty compute resources, it can be tough to keep up without a serious budget of your own. Find out how the team at Facebook AI Research (FAIR) is looking to increase access and level the playing field with the help of NetHack, an archaic rogue-like video game from the late 80s.

Links discussed:

The NetHack Learning Environment:

https://ai.facebook.com/blog/nethack-learning-environment-to-advance-deep-reinforcement-learning/

Reinforcement learning, intrinsic motivation:

https://arxiv.org/abs/2002.12292

Knowledge transfer:

https://arxiv.org/abs/1910.08210


Tim Rocktäschel is a Research Scientist at Facebook AI Research (FAIR) London and a Lecturer in the Department of Computer Science at University College London (UCL). At UCL, he is a member of the UCL Centre for Artificial Intelligence and the UCL Natural Language Processing group. Prior to that, he was a Postdoctoral Researcher in the Whiteson Research Lab, a Stipendiary Lecturer in Computer Science at Hertford College, and a Junior Research Fellow in Computer Science at Jesus College, at the University of Oxford.

https://twitter.com/_rockt


Heinrich Kuttler is an AI and machine learning researcher at Facebook AI Research (FAIR) and before that was a research engineer and team lead at DeepMind.

https://twitter.com/HeinrichKuttler

https://www.linkedin.com/in/heinrich-kuttler/


Topics covered:

0:00 a lack of reproducibility in RL

1:05 What is NetHack and how did the idea come to be?

5:46 RL in Go vs NetHack

11:04 performance of vanilla agents, what do you optimize for

18:36 transferring domain knowledge, source diving

22:27 human vs machines intrinsic learning

28:19 ICLR paper - exploration and RL strategies

35:48 the future of reinforcement learning

43:18 going from supervised to reinforcement learning

45:07 reproducibility in RL

50:05 most underrated aspect of ML, biggest challenges?


Get our podcast on these other platforms:

Apple Podcasts: http://wandb.me/apple-podcasts

Spotify: http://wandb.me/spotify

Google: http://wandb.me/google-podcasts

YouTube: http://wandb.me/youtube

Soundcloud: http://wandb.me/soundcloud


Tune in to our bi-weekly virtual salon and listen to industry leaders and researchers in machine learning share their research:

http://wandb.me/salon


Join our community of ML practitioners where we host AMA's, share interesting projects and meet other people working in Deep Learning:

http://wandb.me/slack


Our gallery features curated machine learning reports by researchers exploring deep learning techniques, Kagglers showcasing winning models, and industry leaders sharing best practices:

https://wandb.ai/gallery

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(136)

The Startup Powering The Data Behind AGI

The Startup Powering The Data Behind AGI

In this episode of Gradient Dissent, Lukas Biewald talks with the CEO & founder of Surge AI, the billion-dollar company quietly powering the next generation of frontier LLMs. They discuss Surge's orig...

16 Syys 202556min

Arvind Jain on Building Glean and the Future of Enterprise AI

Arvind Jain on Building Glean and the Future of Enterprise AI

In this episode of Gradient Dissent, Lukas Biewald sits down with Arvind Jain, CEO and founder of Glean. They discuss Glean's evolution from solving enterprise search to building agentic AI tools that...

5 Elo 202543min

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

In this episode of Gradient Dissent, Lukas Biewald talks with Jarek Kutylowski, CEO and founder of DeepL, an AI-powered translation company. Jarek shares DeepL’s journey from launching neural machine ...

8 Heinä 202542min

GitHub CEO Thomas Dohmke on Copilot and the Future of Software Development

GitHub CEO Thomas Dohmke on Copilot and the Future of Software Development

In this episode of Gradient Dissent, Lukas Biewald sits down with Thomas Dohmke, CEO of GitHub, to talk about the future of software engineering in the age of AI. They discuss how GitHub Copilot was b...

10 Kesä 20251h 9min

From Pharma to AGI Hype, and Developing AI in Finance: Martin Shkreli’s Journey

From Pharma to AGI Hype, and Developing AI in Finance: Martin Shkreli’s Journey

In this episode of Gradient Dissent, Lukas Biewald talks with Martin Shkreli — the infamous "pharma bro" turned founder — about his path from hedge fund manager and pharma CEO to convicted felon and n...

20 Touko 20251h 30min

Inside Cursor: The future of AI coding with Co-founder Sualeh Asif

Inside Cursor: The future of AI coding with Co-founder Sualeh Asif

In this episode of Gradient Dissent, host Lukas Biewald talks with Sualeh Asif, the CPO and co-founder of Cursor, one of the fastest-growing and most loved AI-powered coding platforms. Sualeh shares t...

29 Huhti 202549min

Inside the Dark Web, AI and Cybersecurity with Christopher Ahlberg CEO of Recorded Future

Inside the Dark Web, AI and Cybersecurity with Christopher Ahlberg CEO of Recorded Future

In this episode of Gradient Dissent, host Lukas Biewald talks with Christopher Ahlberg, CEO of Recorded Future, a pioneering cybersecurity company leveraging AI to provide intelligence insights. Chris...

8 Huhti 202550min

AI, autonomy, and the future of naval warfare with Captain Jon Haase, United States Navy

AI, autonomy, and the future of naval warfare with Captain Jon Haase, United States Navy

In this episode of Gradient Dissent, host Lukas Biewald speaks with Captain Jon Haase, United States Navy about real-world applications of AI and autonomy in defense. From underwater mine detection wi...

25 Maalis 20251h 1min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
psykopodiaa-podcast
rss-rahapodi
mimmit-sijoittaa
rss-oivalluksia-rahasta-elamasta
rss-rahamania
rss-sami-miettinen-neuvottelija
rss-startup-ministerio
asuntoasiaa-paivakirjat
rss-lahtijat
rahapuhetta
sijoituspodi
hyva-paha-johtaminen
rss-kaikki-koroista
rss-bisnesta-bebeja
rss-karon-grilli
rss-lentopaivakirjat
rss-set-for-life-sijoita-ja-vaurastu
rss-h-asselmoilanen
rss-paivystyspodi