Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Episoder(779)

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Today’s show, which concludes the first season of the Industrial AI Series, features my interview with Bonsai co-founder and CEO Mark Hammond. I sat down with Mark at Bonsai HQ a few weeks ago and we ...

21 Aug 20171h 5min

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Recently I had a chance to catch up with a friend and friend of the show, Josh Bloom, vice president of data & analytics at GE Digital. If you’ve been listening for a while, you already know that Josh...

14 Aug 201752min

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

The show you’re listening to features my interview with Erin Shellman. Erin is a statistician and data science manager with Zymergen, a company using robots and machine learning to engineer better mic...

5 Aug 201735min

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

This show features my interview with Drew Conway, whose Wrangle keynote could have been called “Confessions of a CIA Data Scientist.” The focus of our interview, and of Drew’s presentation, is an inte...

5 Aug 201734min

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

The show you’re about to listen to features my interview with Sharath Rao, Tech Lead Manager & Machine Learning Engineer at Instacart I reached out to Sharath about being on the show and was blown awa...

4 Aug 201731min

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

This week, I’m happy to bring you my interview with Calvin Seward, a research scientist with Berlin, Germany based Zalando. While our American listeners might not know the name Zalando, they’re one of...

31 Jul 201746min

Deep Robotic Learning with Sergey Levine - TWiML Talk #37

Deep Robotic Learning with Sergey Levine - TWiML Talk #37

This week we continue our Industrial AI series with Sergey Levine, an Assistant Professor at UC Berkeley whose research focus is Deep Robotic Learning. Sergey is part of the same research team as a co...

24 Jul 201746min

Smart Buildings & IoT with Yodit Stanton - TWiML Talk #36

Smart Buildings & IoT with Yodit Stanton - TWiML Talk #36

After a brief hiatus, the Industrial AI Series is making its triumphant return! Our guest this week is Yodit Stanton, a self-described Data Nerd, and the Founder & CEO of Opensensors.io. OpenSensors.i...

17 Jul 201753min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
stopp-verden
popradet
det-store-bildet
dine-penger-pengeradet
rss-gukild-johaug
bt-dokumentar-2
lydartikler-fra-aftenposten
hanna-de-heldige
fotballpodden-2
nokon-ma-ga
e24-podden
frokostshowet-pa-p5
aftenbla-bla
rss-ness
rss-penger-polser-og-politikk
rss-dannet-uten-piano