Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Avsnitt(781)

How a Global Energy Company Adopts ML & AI with Nicholas Osborn - TWiML Talk #150

How a Global Energy Company Adopts ML & AI with Nicholas Osborn - TWiML Talk #150

On today’s show I’m excited to share this interview with Nick Osborn, a longtime listener of the show and Leader of the Global Machine Learning Project Management Office at AES Corporation, a Fortune ...

14 Juni 201846min

Problem Formulation for Machine Learning with Romer Rosales - TWiML Talk #149

Problem Formulation for Machine Learning with Romer Rosales - TWiML Talk #149

In this episode, i'm joined by Romer Rosales, Director of AI at LinkedIn. We begin with a discussion of graphical models and approximate probability inference, and he helps me make an important connec...

11 Juni 201850min

AI for Materials Discovery with Greg Mulholland - TWiML Talk #148

AI for Materials Discovery with Greg Mulholland - TWiML Talk #148

In this episode I’m joined by Greg Mulholland, Founder and CEO of Citrine Informatics, which is applying AI to the discovery and development of new materials. Greg and I start out with an exploration ...

7 Juni 201842min

Data Innovation & AI at Capital One with Adam Wenchel - TWiML Talk #147

Data Innovation & AI at Capital One with Adam Wenchel - TWiML Talk #147

In this episode I’m joined by Adam Wenchel, vice president of AI and Data Innovation at Capital One, to discuss how Machine Learning & AI are being integrated into their day-to-day practices, and how ...

4 Juni 201845min

Deep Gradient Compression for Distributed Training with Song Han - TWiML Talk #146

Deep Gradient Compression for Distributed Training with Song Han - TWiML Talk #146

On today’s show I chat with Song Han, assistant professor in MIT’s EECS department, about his research on Deep Gradient Compression. In our conversation, we explore the challenge of distributed traini...

31 Maj 201846min

Masked Autoregressive Flow for Density Estimation with George Papamakarios - TWiML Talk #145

Masked Autoregressive Flow for Density Estimation with George Papamakarios - TWiML Talk #145

In this episode, University of Edinburgh Phd student George Papamakarios and I discuss his paper “Masked Autoregressive Flow for Density Estimation.” George walks us through the idea of Masked Autoreg...

28 Maj 201834min

Training Data for Computer Vision at Figure Eight with Qazaleh Mirsharif - TWiML Talk #144

Training Data for Computer Vision at Figure Eight with Qazaleh Mirsharif - TWiML Talk #144

For today’s show, the last in our TrainAI series, I'm joined by Qazaleh Mirsharif, a machine learning scientist working on computer vision at Figure Eight. Qazaleh and I caught up at the TrainAI confe...

25 Maj 201821min

Agile Data Science with Sarah Aerni - TWiML Talk #143

Agile Data Science with Sarah Aerni - TWiML Talk #143

Today we continue our TrainAI series with Sarah Aerni, Director of Data Science at Salesforce Einstein. Sarah and I sat down at the TrainAI conference to discuss her talk “Notes from the Field: The Pl...

24 Maj 201838min

Populärt inom Politik & nyheter

svenska-fall
aftonbladet-krim
p3-krim
rss-krimstad
spar
fordomspodden
flashback-forever
rss-sanning-konsekvens
aftonbladet-daily
rss-vad-fan-hande
motiv
rss-expressen-dok
rss-frandfors-horna
rss-krimreportrarna
dagens-eko
politiken
krimmagasinet
rss-flodet
rss-aftonbladet-krim
kungligt