Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Jaksot(778)

Expressive AI - Generated Music With Google's Performance RNN - Doug Eck - TWiML Talk #32

Expressive AI - Generated Music With Google's Performance RNN - Doug Eck - TWiML Talk #32

My guest for this second show in our O’Reilly AI series is Doug Eck of Google Brain. Doug did a keynote at the O’Reilly conference on Magenta, Google’s project for melding machine learning and the art...

5 Heinä 201746min

The Power Of Probabilistic Programming with Ben Vigoda - TWiML Talk #33

The Power Of Probabilistic Programming with Ben Vigoda - TWiML Talk #33

My guest for this third episode in the O'Reilly AI series is Ben Vigoda. Ben is the founder and CEO of Gamalon, a DARPA-funded startup working on Bayesian Program Synthesis. We dive into what exactly ...

5 Heinä 201742min

Video Object Detection At Scale with Reza Zadeh - TWiML Talk #34

Video Object Detection At Scale with Reza Zadeh - TWiML Talk #34

My guest for the fourth show in the O'Reilly AI Series is Reza Zadeh. Reza is an adjunct professor of computational mathematics at Stanford University and founder and CEO of the startup Matroid. Reza ...

5 Heinä 201752min

Enhancing Customer Experiences With Emotional AI with Rana El Kaliouby - TWiML Talk #35

Enhancing Customer Experiences With Emotional AI with Rana El Kaliouby - TWiML Talk #35

My guest for this show is Rana el Kaliouby. Rana is co-founder and CEO of Affectiva. Affectiva, as Rana puts it, "is on a mission to humanize technology by bringing in artificial emotional intelligenc...

5 Heinä 201733min

Natural Language Understanding for Amazon Alexa with Zornitsa Kozareva - TWiML Talk #30

Natural Language Understanding for Amazon Alexa with Zornitsa Kozareva - TWiML Talk #30

Our guest this week is Zornitsa Kozareva, Manager of Machine Learning with Amazon Web Services Deep Learning, where she leads a group focused on natural language processing and dialogue systems for pr...

29 Kesä 201755min

Robotic Perception and Control with Chelsea Finn - TWiML Talk #29

Robotic Perception and Control with Chelsea Finn - TWiML Talk #29

This week we continue our series on industrial applications of machine learning and AI with a conversation with Chelsea Finn, a PhD student at UC Berkeley. Chelsea’s research is focused on machine lea...

23 Kesä 201754min

Reinforcement Learning Deep Dive with Pieter Abbeel - TWiML Talk #28

Reinforcement Learning Deep Dive with Pieter Abbeel - TWiML Talk #28

This week our guest is Pieter Abbeel, Assistant Professor at UC Berkeley, Research Scientist at OpenAI, and Cofounder of Gradescope. Pieter has an extensive background in AI research, going way back t...

17 Kesä 201752min

Intelligent Autonomous Robots with Ilia Baranov - TWiML Talk #27

Intelligent Autonomous Robots with Ilia Baranov - TWiML Talk #27

Our first guest in the Industrial AI series is Ilia Baranov, engineering manager at Clearpath Robotics. Ilia is responsible for setting the engineering direction for all of Clearpath’s research platfo...

9 Kesä 201753min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
tervo-halme
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
viisupodi
rss-vaalirankkurit-podcast
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
otetaan-yhdet
linda-maria
io-techin-tekniikkapodcast
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rikosmyytit
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
viela-yksi-sivu
rss-uusi-juttu
rss-aika-ankkuri
rss-kaikki-uusiksi
rss-merja-mahkan-rahat