Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mars 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(601)

Deep Fakes

Deep Fakes

Digital videos can be described as sequences of still images and associated audio. Audio is easy to fake. What about video? A video can easily be broken down into a sequence of still images replayed r...

21 Sep 201830min

Fake News Midterm

Fake News Midterm

In this episode, Kyle reviews what we've learned so far in our series on Fake News and talks briefly about where we're going next.

14 Sep 201819min

Quality Score

Quality Score

Two weeks ago we discussed click through rates or CTRs and their usefulness and limits as a metric. Today, we discuss a related metric known as quality score. While that phrase has probably been used ...

7 Sep 201818min

The Knowledge Illusion

The Knowledge Illusion

Kyle interviews Steven Sloman, Professor in the school of Cognitive, Linguistic, and Psychological Sciences at Brown University. Steven is co-author of The Knowledge Illusion: Why We Never Think Alone...

31 Aug 201840min

Click Through Rates

Click Through Rates

A Click Through Rate (CTR) is the proportion of clicks to impressions of some item of content shared online. This terminology is most commonly used in digital advertising but applies just as well to c...

24 Aug 201831min

Algorithmic Detection of Fake News

Algorithmic Detection of Fake News

The scale and frequency with which information can be distributed on social media makes the problem of fake news a rapidly metastasizing issue. To do any content filtering or labeling demands an algor...

17 Aug 201846min

Ant Intelligence

Ant Intelligence

If you prepared a list of creatures regarded as highly intelligent, it's unlikely ants would make the cut. This is expected, as on an individual level, ants do not generally display behavior that most...

10 Aug 201828min

Human Detection of Fake News

Human Detection of Fake News

With publications such as "Prior exposure increases perceived accuracy of fake news", "Lazy, not biased: Susceptibility to partisan fake news is better explained by lack of reasoning than by motivated...

3 Aug 201828min

Populärt inom Vetenskap

dumma-manniskor
allt-du-velat-veta
p3-dystopia
rss-ufobortom-rimligt-tvivel
medicinvetarna
ufo-sverige
rss-vetenskapsradion
paranormalt-med-caroline-giertz
kapitalet-en-podd-om-ekonomi
svd-nyhetsartiklar
rss-spraket
dumforklarat
hacka-livet
rss-odla
rss-vetenskapsradion-2
det-morka-psyket
rss-arkeologi-historia-podden-som-graver-i-vart-kulturlandskap
sexet
halsorevolutionen
ufo-sverige-2