Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Off-Line, Off-Policy RL for Real-World Decision Making at Facebook - #448

Today we’re joined by Jason Gauci, a Software Engineering Manager at Facebook AI. In our conversation with Jason, we explore their Reinforcement Learning platform, Re-Agent (Horizon). We discuss the role of decision making and game theory in the platform and the types of decisions they’re using Re-Agent to make, from ranking and recommendations to their eCommerce marketplace. Jason also walks us through the differences between online/offline and on/off policy model training, and where Re-Agent sits in this spectrum. Finally, we discuss the concept of counterfactual causality, and how they ensure safety in the results of their models. The complete show notes for this episode can be found at twimlai.com/go/448.

Episoder(781)

Trends in Deep Learning with Jeremy Howard - TWiML Talk #214

Trends in Deep Learning with Jeremy Howard - TWiML Talk #214

In this episode of our AI Rewind series, we’re bringing back one of your favorite guests of the year, Jeremy Howard, founder and researcher at Fast.ai. Jeremy joins us to discuss trends in Deep Learn...

24 Des 20181h 8min

Training Large-Scale Deep Nets with RL with Nando de Freitas - TWiML Talk #213

Training Large-Scale Deep Nets with RL with Nando de Freitas - TWiML Talk #213

Today we close out both our NeurIPS series joined by Nando de Freitas, Team Lead & Principal Scientist at Deepmind. In our conversation, we explore his interest in understanding the brain and working ...

20 Des 201855min

Making Algorithms Trustworthy with David Spiegelhalter - TWiML Talk #212

Making Algorithms Trustworthy with David Spiegelhalter - TWiML Talk #212

Today we’re joined by David Spiegelhalter, Chair of Winton Center for Risk and Evidence Communication at Cambridge University and President of the Royal Statistical Society. David, an invited speaker ...

20 Des 201823min

Designing Computer Systems for Software with Kunle Olukotun - TWiML Talk #211

Designing Computer Systems for Software with Kunle Olukotun - TWiML Talk #211

Today we’re joined by Kunle Olukotun, Professor in the department of EE and CS at Stanford University, and Chief Technologist at Sambanova Systems. Kunle was an invited speaker at NeurIPS this year, p...

18 Des 201855min

Operationalizing Ethical AI with Kathryn Hume - TWiML Talk #210

Operationalizing Ethical AI with Kathryn Hume - TWiML Talk #210

Today we conclude our Trust in AI series with this conversation with Kathryn Hume, VP of Strategy at Integrate AI. We discuss her newly released white paper “Responsible AI in the Consumer Enterprise,...

14 Des 201853min

Approaches to Fairness in Machine Learning with Richard Zemel - TWiML Talk #209

Approaches to Fairness in Machine Learning with Richard Zemel - TWiML Talk #209

Today we continue our exploration of Trust in AI with this interview with Richard Zemel, Professor in the department of Computer Science at the University of Toronto and Research Director at Vector In...

12 Des 201845min

Trust and AI with Parinaz Sobhani - TWiML Talk #208

Trust and AI with Parinaz Sobhani - TWiML Talk #208

In today’s episode we’re joined by Parinaz Sobhani, Director of Machine Learning at Georgian Partners. In our conversation, Parinaz and I discuss some of the main issues falling under the “trust” um...

11 Des 201846min

Unbiased Learning from Biased User Feedback with Thorsten Joachims - TWiML Talk #207

Unbiased Learning from Biased User Feedback with Thorsten Joachims - TWiML Talk #207

In the final episode of our re:Invent series, we're joined by Thorsten Joachims, Professor in the Department of Computer Science at Cornell University. We discuss his presentation “Unbiased Learning f...

7 Des 201840min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
stopp-verden
i-retten
popradet
lydartikler-fra-aftenposten
rss-gukild-johaug
det-store-bildet
nokon-ma-ga
dine-penger-pengeradet
fotballpodden-2
rss-ness
hanna-de-heldige
aftenbla-bla
frokostshowet-pa-p5
rss-dannet-uten-piano
rss-utenrikskomiteen-med-bogen-og-grasvik
ta-dokumentar