Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Avsnitt(781)

ML Use Cases at Think Big Analytics with Mo Patel and Laura Frølich - TWiML Talk #54

ML Use Cases at Think Big Analytics with Mo Patel and Laura Frølich - TWiML Talk #54

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. This time around, I speak with Mo Patel, practice director of AI & deep ...

6 Okt 201745min

Intel Nervana Devcloud with Naveen Rao & Scott Apeland - TWiML Talk #51

Intel Nervana Devcloud with Naveen Rao & Scott Apeland - TWiML Talk #51

In this episode, I talk to Naveen Rao, VP and GM of Intel’s AI Products Group, and Scott Apeland, director of Intel’s Developer Network. It's been a few months since we last spoke to Naveen, so he giv...

6 Okt 201737min

Ray: A Distributed Computing Platform for Reinforcement Learning with Ion Stoica - TWiML Talk #55

Ray: A Distributed Computing Platform for Reinforcement Learning with Ion Stoica - TWiML Talk #55

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. In this episode, I talk with Ion Stoica, professor of computer science &...

5 Okt 201728min

Topological Data Analysis with Gunnar Carlsson - TWiML Talk #53

Topological Data Analysis with Gunnar Carlsson - TWiML Talk #53

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. My guest for this show is Gunnar Carlsson, professor emeritus of mathema...

3 Okt 201733min

Bayesian Optimization for Hyperparameter Tuning with Scott Clark - TWiML Talk #50

Bayesian Optimization for Hyperparameter Tuning with Scott Clark - TWiML Talk #50

As you all know, a few weeks ago, I spent some time in SF at the Artificial Intelligence Conference. While I was there, I had just enough time to sneak away and catch up with Scott Clark, Co-Founder a...

2 Okt 201747min

Symbolic and Sub-Symbolic Natural Language Processing with Jonathan Mugan - TWiML Talk #49

Symbolic and Sub-Symbolic Natural Language Processing with Jonathan Mugan - TWiML Talk #49

Like last week’s interview with Bruno Goncalves, this week’s interview was also recorded at the last O’Reilly AI Conference back in New York in June. Also like last week’s show, this week’s is also fo...

25 Sep 201743min

Word2Vec & Friends with Bruno Gonçalves - TWiML Talk #48

Word2Vec & Friends with Bruno Gonçalves - TWiML Talk #48

This week i'm bringing you an interview from Bruno Goncalves, a Moore-Sloan Data Science Fellow at NYU. As you’ll hear in the interview, Bruno is a longtime listener of the podcast. We were able to co...

19 Sep 201732min

Evolutionary Algorithms in Machine Learning with Risto Miikkulainen - TWiML Talk #47

Evolutionary Algorithms in Machine Learning with Risto Miikkulainen - TWiML Talk #47

My guest this week is Risto Miikkulainen, professor of computer science at UT-Austin and vice president of Research at Sentient Technologies. Risto came locked and loaded to discuss a topic that we've...

11 Sep 201758min

Populärt inom Politik & nyheter

aftonbladet-krim
svenska-fall
p3-krim
rss-krimstad
spar
fordomspodden
flashback-forever
rss-sanning-konsekvens
aftonbladet-daily
rss-vad-fan-hande
motiv
rss-expressen-dok
rss-frandfors-horna
dagens-eko
rss-krimreportrarna
politiken
blenda-2
rss-aftonbladet-krim
rss-flodet
olyckan-inifran