Goodhart's Law in Reinforcement Learning
Data Skeptic5 Maalis 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(601)

Recommender Systems Live from FARCON 2017

Recommender Systems Live from FARCON 2017

Recommender systems play an important role in providing personalized content to online users. Yet, typical data mining techniques are not well suited for the unique challenges that recommender systems...

15 Syys 201746min

[MINI] Long Short Term Memory

[MINI] Long Short Term Memory

Thanks to our sponsor brilliant.org/dataskeptics A Long Short Term Memory (LSTM) is a neural unit, often used in Recurrent Neural Network (RNN) which attempts to provide the network the capacity to st...

8 Syys 201715min

Zillow Zestimate

Zillow Zestimate

Zillow is a leading real estate information and home-related marketplace. We interviewed Andrew Martin, a data science Research Manager at Zillow, to learn more about how Zillow uses data science and ...

1 Syys 201737min

Cardiologist Level Arrhythmia Detection with CNNs

Cardiologist Level Arrhythmia Detection with CNNs

Our guest Pranav Rajpurkar and his coauthored recently published Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks, a paper in which they demonstrate the use of Convolutional ...

25 Elo 201732min

[MINI] Recurrent Neural Networks

[MINI] Recurrent Neural Networks

RNNs are a class of deep learning models designed to capture sequential behavior.  An RNN trains a set of weights which depend not just on new input but also on the previous state of the neural networ...

18 Elo 201717min

Project Common Voice

Project Common Voice

Thanks to our sponsor Springboard. In this week's episode, guest Andre Natal from Mozilla joins our host, Kyle Polich, to discuss a couple exciting new developments in open source speech recognition s...

11 Elo 201731min

[MINI] Bayesian Belief Networks

[MINI] Bayesian Belief Networks

A Bayesian Belief Network is an acyclic directed graph composed of nodes that represent random variables and edges that imply a conditional dependence between them. It's an intuitive way of encoding y...

4 Elo 201717min

pix2code

pix2code

In this episode, Tony Beltramelli of UIzard Technologies joins our host, Kyle Polich, to talk about the ideas behind his latest app that can transform graphic design into functioning code, as well as ...

28 Heinä 201726min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
rss-poliisin-mieli
tiedekulma-podcast
utelias-mieli
docemilia
rss-duodecim-lehti
rss-tiedetta-vai-tarinaa
rss-totuuden-liepeilla
university-of-eastern-finland
filocast-filosofian-perusteet
rss-duokkari-ekstra
rss-laakaripodi
rss-ylistys-elaimille
rss-lapsuuden-rakentajat-podcast