Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Avsnitt(781)

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

My guest this week is Jennifer Prendki. That name might sound familiar, as she was one of the great speakers from my Future of Data Summit back in May. At the time, Jennifer was senior data science ma...

5 Sep 201748min

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

This week we have a very special interview to share with you! Those of you who’ve been receiving my newsletter for a while might remember that while in Switzerland last month, I had the pleasure of in...

28 Aug 20171h 3min

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Today’s show, which concludes the first season of the Industrial AI Series, features my interview with Bonsai co-founder and CEO Mark Hammond. I sat down with Mark at Bonsai HQ a few weeks ago and we ...

21 Aug 20171h 5min

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Recently I had a chance to catch up with a friend and friend of the show, Josh Bloom, vice president of data & analytics at GE Digital. If you’ve been listening for a while, you already know that Josh...

14 Aug 201752min

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

The show you’re listening to features my interview with Erin Shellman. Erin is a statistician and data science manager with Zymergen, a company using robots and machine learning to engineer better mic...

5 Aug 201735min

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

This show features my interview with Drew Conway, whose Wrangle keynote could have been called “Confessions of a CIA Data Scientist.” The focus of our interview, and of Drew’s presentation, is an inte...

5 Aug 201734min

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

The show you’re about to listen to features my interview with Sharath Rao, Tech Lead Manager & Machine Learning Engineer at Instacart I reached out to Sharath about being on the show and was blown awa...

4 Aug 201731min

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

This week, I’m happy to bring you my interview with Calvin Seward, a research scientist with Berlin, Germany based Zalando. While our American listeners might not know the name Zalando, they’re one of...

31 Juli 201746min

Populärt inom Politik & nyheter

aftonbladet-krim
svenska-fall
rss-krimstad
p3-krim
rss-expressen-dok
fordomspodden
flashback-forever
rss-sanning-konsekvens
motiv
aftonbladet-daily
grans
rss-vad-fan-hande
rss-krimreportrarna
spar
rss-frandfors-horna
rss-flodet
blenda-2
krimmagasinet
olyckan-inifran
dagens-eko