Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mar 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Deep Fakes

Deep Fakes

Digital videos can be described as sequences of still images and associated audio. Audio is easy to fake. What about video? A video can easily be broken down into a sequence of still images replayed r...

21 Sep 201830min

Fake News Midterm

Fake News Midterm

In this episode, Kyle reviews what we've learned so far in our series on Fake News and talks briefly about where we're going next.

14 Sep 201819min

Quality Score

Quality Score

Two weeks ago we discussed click through rates or CTRs and their usefulness and limits as a metric. Today, we discuss a related metric known as quality score. While that phrase has probably been used ...

7 Sep 201818min

The Knowledge Illusion

The Knowledge Illusion

Kyle interviews Steven Sloman, Professor in the school of Cognitive, Linguistic, and Psychological Sciences at Brown University. Steven is co-author of The Knowledge Illusion: Why We Never Think Alone...

31 Aug 201840min

Click Through Rates

Click Through Rates

A Click Through Rate (CTR) is the proportion of clicks to impressions of some item of content shared online. This terminology is most commonly used in digital advertising but applies just as well to c...

24 Aug 201831min

Algorithmic Detection of Fake News

Algorithmic Detection of Fake News

The scale and frequency with which information can be distributed on social media makes the problem of fake news a rapidly metastasizing issue. To do any content filtering or labeling demands an algor...

17 Aug 201846min

Ant Intelligence

Ant Intelligence

If you prepared a list of creatures regarded as highly intelligent, it's unlikely ants would make the cut. This is expected, as on an individual level, ants do not generally display behavior that most...

10 Aug 201828min

Human Detection of Fake News

Human Detection of Fake News

With publications such as "Prior exposure increases perceived accuracy of fake news", "Lazy, not biased: Susceptibility to partisan fake news is better explained by lack of reasoning than by motivated...

3 Aug 201828min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
forskningno
liberal-halvtime
rekommandert
rss-zahid-ali-hjelper-deg
sinnsyn
villmarksliv
rss-paradigmepodden
jss
tomprat-med-gunnar-tjomlid
fjellsportpodden
tidlose-historier
rss-overskuddsliv
dekodet-2
kvinnehelsepodden
rss-inn-til-kjernen-med-sunniva-rose
diagnose
nevropodden