Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Jaksot(778)

(5/5) AlphaVertex - Creating a Worldwide Financial Knowledge Graph - TWiML Talk #18

(5/5) AlphaVertex - Creating a Worldwide Financial Knowledge Graph - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with AlphaVertex, a FinTech start...

7 Huhti 201726min

(4/5) Behold.ai - Increasing Efficiency of Healthcare Insurance Billing with NLP - TWiML Talk #18

(4/5) Behold.ai - Increasing Efficiency of Healthcare Insurance Billing with NLP - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Behold.ai, which uses comput...

7 Huhti 201716min

(3/5) Cambrian Intelligence - Using AI to Simplify the Programming of Robots - TWiML Talk #18

(3/5) Cambrian Intelligence - Using AI to Simplify the Programming of Robots - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Cambrian Intelligence, a com...

7 Huhti 201723min

(2/5) Klustera - Location-Based Intelligence for Smarter Marketing - TWiML Talk #18

(2/5) Klustera - Location-Based Intelligence for Smarter Marketing - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Klustera, a company applying...

7 Huhti 201722min

(1/5) HelloVera - AI-Powered Customer Support  - TWiML Talk #18

(1/5) HelloVera - AI-Powered Customer Support - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with HelloVera, a company applyin...

7 Huhti 201725min

Interactive Machine Learning Systems with Alekh Agarwal - TWiML Talk #17

Interactive Machine Learning Systems with Alekh Agarwal - TWiML Talk #17

This week my guest is Alekh Agarwal. Alekh is a researcher with Microsoft Research whose research is focused on Interactive Machine Learning. In our discussion, Alekh and I discuss various aspects of ...

31 Maalis 201730min

Machine Learning in Cybersecurity with Evan Wright - TWiML Talk #16

Machine Learning in Cybersecurity with Evan Wright - TWiML Talk #16

This week my guest is Evan Wright, principal data scientist at cybersecurity startup Anomali. In my interview with Evan, he and I discussed about a number of topics surrounding the use of machine lear...

24 Maalis 20171h 4min

Domain Knowledge in Machine Learning Models for Sustainability with Stefano Ermon - TWiML Talk #15

Domain Knowledge in Machine Learning Models for Sustainability with Stefano Ermon - TWiML Talk #15

My guest this week is Stefano Ermon, Assistant Professor of Computer Science at Stanford University, and Fellow at Stanford’s Woods Institute for the Environment. Stefano and I met at the Re-Work Deep...

17 Maalis 201754min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
tervo-halme
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
viisupodi
rss-vaalirankkurit-podcast
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
otetaan-yhdet
linda-maria
io-techin-tekniikkapodcast
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rikosmyytit
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
viela-yksi-sivu
rss-uusi-juttu
rss-aika-ankkuri
rss-kaikki-uusiksi
rss-merja-mahkan-rahat