Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Episoder(782)

Unifying Vision and Language Models with Mohit Bansal - #636

Unifying Vision and Language Models with Mohit Bansal - #636

Today we're joined by Mohit Bansal, Parker Professor, and Director of the MURGe-Lab at UNC, Chapel Hill. In our conversation with Mohit, we explore the concept of unification in AI models, highlightin...

3 Jul 202348min

Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635

Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635

Today we kick off our coverage of the 2023 CVPR conference joined by Fatih Porikli, a Senior Director of Technology at Qualcomm. In our conversation with Fatih, we covered quite a bit of ground, touch...

26 Jun 202352min

Mojo: A Supercharged Python for AI with Chris Lattner - #634

Mojo: A Supercharged Python for AI with Chris Lattner - #634

Today we’re joined by Chris Lattner, Co-Founder and CEO of Modular. In our conversation with Chris, we discuss Mojo, a new programming language for AI developers. Mojo is unique in this space and simp...

19 Jun 202357min

Stable Diffusion and LLMs at the Edge with Jilei Hou - #633

Stable Diffusion and LLMs at the Edge with Jilei Hou - #633

Today we’re joined by Jilei Hou, a VP of Engineering at Qualcomm Technologies. In our conversation with Jilei, we focus on the emergence of generative AI, and how they've worked towards providing thes...

12 Jun 202340min

Modeling Human Behavior with Generative Agents with Joon Sung Park - #632

Modeling Human Behavior with Generative Agents with Joon Sung Park - #632

Today we’re joined by Joon Sung Park, a PhD Student at Stanford University. Joon shares his passion for creating AI systems that can solve human problems and his work on the recent paper Generative Ag...

5 Jun 202346min

Towards Improved Transfer Learning with Hugo Larochelle - #631

Towards Improved Transfer Learning with Hugo Larochelle - #631

Today we’re joined by Hugo Larochelle, a research scientist at Google Deepmind. In our conversation with Hugo, we discuss his work on transfer learning, understanding the capabilities of deep learning...

29 Mai 202338min

Language Modeling With State Space Models with Dan Fu - #630

Language Modeling With State Space Models with Dan Fu - #630

Today we’re joined by Dan Fu, a PhD student at Stanford University. In our conversation with Dan, we discuss the limitations of state space models in language modeling and the search for alternative b...

22 Mai 202328min

Building Maps and Spatial Awareness in Blind AI Agents with Dhruv Batra - #629

Building Maps and Spatial Awareness in Blind AI Agents with Dhruv Batra - #629

Today we continue our coverage of ICLR 2023 joined by Dhruv Batra, an associate professor at Georgia Tech and research director of the Fundamental AI Research (FAIR) team at META. In our conversation,...

15 Mai 202343min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
lydartikler-fra-aftenposten
popradet
stopp-verden
fotballpodden-2
dine-penger-pengeradet
det-store-bildet
rss-gukild-johaug
i-retten
nokon-ma-ga
hanna-de-heldige
e24-podden
rss-ness
aftenbla-bla
rss-dannet-uten-piano
frokostshowet-pa-p5
grasoner-den-nye-kalde-krigen