Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mar 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Data Provenance and Reproducibility with Pachyderm

Data Provenance and Reproducibility with Pachyderm

Versioning isn't just for source code. Being able to track changes to data is critical for answering questions about data provenance, quality, and reproducibility. Daniel Whitenack joins me this week ...

3 Feb 201740min

[MINI] Logistic Regression on Audio Data

[MINI] Logistic Regression on Audio Data

Logistic Regression is a popular classification algorithm. In this episode, we discuss how it can be used to determine if an audio clip represents one of two given speakers. It assumes an output varia...

27 Jan 201720min

Studying Competition and Gender Through Chess

Studying Competition and Gender Through Chess

Prior work has shown that people's response to competition is in part predicted by their gender. Understanding why and when this occurs is important in areas such as labor market outcomes. A well stru...

20 Jan 201734min

[MINI] Dropout

[MINI] Dropout

Deep learning can be prone to overfit a given problem. This is especially frustrating given how much time and computational resources are often required to converge. One technique for fighting overfit...

13 Jan 201715min

The Police Data and the Data Driven Justice Initiatives

The Police Data and the Data Driven Justice Initiatives

In this episode I speak with Clarence Wardell and Kelly Jin about their mutual service as part of the White House's Police Data Initiative and Data Driven Justice Initiative respectively. The Police D...

6 Jan 201749min

The Library Problem

The Library Problem

We close out 2016 with a discussion of a basic interview question which might get asked when applying for a data science job. Specifically, how a library might build a model to predict if a book will ...

30 Des 201635min

2016 Holiday Special

2016 Holiday Special

Today's episode is a reading of Isaac Asimov's Franchise.  As mentioned on the show, this is just a work of fiction to be enjoyed and not in any way some obfuscated political statement.  Enjoy, and h...

23 Des 201639min

[MINI] Entropy

[MINI] Entropy

Classically, entropy is a measure of disorder in a system. From a statistical perspective, it is more useful to say it's a measure of the unpredictability of the system. In this episode we discuss how...

16 Des 201616min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
sinnsyn
rekommandert
forskningno
villmarksliv
liberal-halvtime
rss-paradigmepodden
rss-zahid-ali-hjelper-deg
jss
tomprat-med-gunnar-tjomlid
kvinnehelsepodden
nordnorsk-historie
fjellsportpodden
rss-inn-til-kjernen-med-sunniva-rose
rss-rekommandert
tidlose-historier
rss-overskuddsliv
rss-bondevennen