Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Avsnitt(765)

This Week in Machine Learning & AI - 6/17/16: Apple's New ML APIs, IBM Brings Deep Learning Thunder

This Week in Machine Learning & AI - 6/17/16: Apple's New ML APIs, IBM Brings Deep Learning Thunder

This Week in Machine Learning & AI brings you the week’s most interesting and important stories from the world of machine learning and artificial intelligence. This week’s podcast digs into Apple's ML and AI announcements at WWDC, looks at IBM's new Deep Thunder offering, and discusses exciting new deep learning research from MIT, OpenAI and Google. Show notes available at https://twimlai.com/5.

18 Juni 201624min

This Week In Machine Learning & AI - 6/10/16: Self-Motivated AI, Plus A Kill-Switch for Rogue Bots

This Week In Machine Learning & AI - 6/10/16: Self-Motivated AI, Plus A Kill-Switch for Rogue Bots

This Week in Machine Learning & AI brings you the week’s most interesting and important stories from the world of machine learning and artificial intelligence. This week’s podcast looks at new research on intrinsic motivation for AI systems, a kill-switch for intelligent agents, "knu" chips for machine learning, a screenplay made by a neural net, and more. Show notes and subscribe links at https://cloudpul.se/twiml/4.

11 Juni 201624min

This Week In Machine Learning & AI - 6/3/16: Facebook's DeepText, ML & Art, Artificial Assistants

This Week In Machine Learning & AI - 6/3/16: Facebook's DeepText, ML & Art, Artificial Assistants

This Week in Machine Learning & AI brings you the week’s most interesting and important stories from the world of machine learning and artificial intelligence. This week’s podcast looks at Facebooks' new DeepText engine, creating music & art with deep learning and Google Magenta, how to build artificial assistants and bots, and applying economics to machine learning models. For show notes visit: https://cloudpul.se/posts/twiml-facebooks-deeptext-ml-art-artificial-assistants

4 Juni 201624min

This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars

This Week In Machine Learning & AI - 5/27/16: The White House on AI & Aggressive Self-Driving Cars

This Week in Machine Learning & AI brings you the week's most interesting and important stories from the world of machine learning and artificial intelligence. This week's episode explores the White House workshops on AI, human bias in AI and machine learning models, a company working on machine learning for small datasets, plus the latest AI & ML news and a self-driving car that learned how to drive aggressively.

28 Maj 201625min

This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE

This Week In Machine Learning & AI - 5/20/16: AI at Google I/O, Amazon's Deep Learning DSSTNE

This Week In Machine Learning & AI - May 20, 2016. Google I/O, deep learning hardware and an AI to save you from conference call hell.

21 Maj 201619min

Populärt inom Politik & nyheter

svenska-fall
p3-krim
rss-krimstad
rss-viva-fotboll
fordomspodden
flashback-forever
aftonbladet-daily
rss-sanning-konsekvens
rss-vad-fan-hande
olyckan-inifran
dagens-eko
rss-frandfors-horna
krimmagasinet
rss-krimreportrarna
rss-expressen-dok
motiv
svd-dokumentara-berattelser-2
blenda-2
svd-nyhetsartiklar
spotlight