Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Jaksot(778)

Topological Data Analysis with Gunnar Carlsson - TWiML Talk #53

Topological Data Analysis with Gunnar Carlsson - TWiML Talk #53

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. My guest for this show is Gunnar Carlsson, professor emeritus of mathema...

3 Loka 201733min

Bayesian Optimization for Hyperparameter Tuning with Scott Clark - TWiML Talk #50

Bayesian Optimization for Hyperparameter Tuning with Scott Clark - TWiML Talk #50

As you all know, a few weeks ago, I spent some time in SF at the Artificial Intelligence Conference. While I was there, I had just enough time to sneak away and catch up with Scott Clark, Co-Founder a...

2 Loka 201747min

Symbolic and Sub-Symbolic Natural Language Processing with Jonathan Mugan - TWiML Talk #49

Symbolic and Sub-Symbolic Natural Language Processing with Jonathan Mugan - TWiML Talk #49

Like last week’s interview with Bruno Goncalves, this week’s interview was also recorded at the last O’Reilly AI Conference back in New York in June. Also like last week’s show, this week’s is also fo...

25 Syys 201743min

Word2Vec & Friends with Bruno Gonçalves - TWiML Talk #48

Word2Vec & Friends with Bruno Gonçalves - TWiML Talk #48

This week i'm bringing you an interview from Bruno Goncalves, a Moore-Sloan Data Science Fellow at NYU. As you’ll hear in the interview, Bruno is a longtime listener of the podcast. We were able to co...

19 Syys 201732min

Evolutionary Algorithms in Machine Learning with Risto Miikkulainen - TWiML Talk #47

Evolutionary Algorithms in Machine Learning with Risto Miikkulainen - TWiML Talk #47

My guest this week is Risto Miikkulainen, professor of computer science at UT-Austin and vice president of Research at Sentient Technologies. Risto came locked and loaded to discuss a topic that we've...

11 Syys 201758min

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

My guest this week is Jennifer Prendki. That name might sound familiar, as she was one of the great speakers from my Future of Data Summit back in May. At the time, Jennifer was senior data science ma...

5 Syys 201748min

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

This week we have a very special interview to share with you! Those of you who’ve been receiving my newsletter for a while might remember that while in Switzerland last month, I had the pleasure of in...

28 Elo 20171h 3min

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Today’s show, which concludes the first season of the Industrial AI Series, features my interview with Bonsai co-founder and CEO Mark Hammond. I sat down with Mark at Bonsai HQ a few weeks ago and we ...

21 Elo 20171h 5min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
tervo-halme
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
viisupodi
rss-vaalirankkurit-podcast
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
otetaan-yhdet
linda-maria
io-techin-tekniikkapodcast
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rikosmyytit
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
viela-yksi-sivu
rss-uusi-juttu
rss-aika-ankkuri
rss-kaikki-uusiksi
rss-merja-mahkan-rahat