Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Jaksot(779)

Robotics at OpenAI with Jonas Schneider - TWiML Talk #76

Robotics at OpenAI with Jonas Schneider - TWiML Talk #76

The show is part of a series that I’m really excited about, in part because I’ve been working to bring them to you for quite a while now. The focus of the series is a sampling of the interesting work ...

1 Joulu 201745min

AI Robustness and Safety with Dario Amodei - TWiML Talk #75

AI Robustness and Safety with Dario Amodei - TWiML Talk #75

The show is part of a series that I’m really excited about, in part because I’ve been working to bring them to you for quite a while now. The focus of the series is a sampling of the interesting work ...

30 Marras 201736min

Towards Artificial General Intelligence with Greg Brockman - TWiML Talk #74

Towards Artificial General Intelligence with Greg Brockman - TWiML Talk #74

The show is part of a series that I’m really excited about, in part because I’ve been working to bring them to you for quite a while now. The focus of the series is a sampling of the interesting work ...

28 Marras 201755min

Explaining Black Box Predictions with Sam Ritchie - TWiML Talk #73

Explaining Black Box Predictions with Sam Ritchie - TWiML Talk #73

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

25 Marras 201738min

Experimental Creative Writing with the Vectorized Word - Allison Parish - TWIML Talk #72

Experimental Creative Writing with the Vectorized Word - Allison Parish - TWIML Talk #72

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

24 Marras 201728min

The Biological Path Towards Strong AI - Matthew Taylor - TWiML Talk #71

The Biological Path Towards Strong AI - Matthew Taylor - TWiML Talk #71

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

22 Marras 201737min

Pytorch: Fast Differentiable Dynamic Graphs in Python with Soumith Chintala - TWiML Talk #70

Pytorch: Fast Differentiable Dynamic Graphs in Python with Soumith Chintala - TWiML Talk #70

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

21 Marras 201742min

Accessible Machine Learning for the Enterprise Developer with Ryan Sevey & Jason Montgomery

Accessible Machine Learning for the Enterprise Developer with Ryan Sevey & Jason Montgomery

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

20 Marras 201745min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
tervo-halme
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
rss-vaalirankkurit-podcast
rss-podme-livebox
viisupodi
otetaan-yhdet
et-sa-noin-voi-sanoo-esittaa
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
io-techin-tekniikkapodcast
linda-maria
rikosmyytit
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
rss-merja-mahkan-rahat
mtv-uutiset-polloraati
rss-aika-ankkuri
rss-kaikki-uusiksi
rss-raha-talous-ja-politiikka