Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mar 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Shapley Values

Shapley Values

Kyle and Linhda discuss how Shapley Values might be a good tool for determining what makes the cut for a home renovation.

6 Mar 202020min

Anchors as Explanations

Anchors as Explanations

We welcome back Marco Tulio Ribeiro to discuss research he has done since our original discussion on LIME. In particular, we ask the question Are Red Roses Red? and discuss how Anchors provide high pr...

28 Feb 202037min

Mathematical Models of Ecological Systems

Mathematical Models of Ecological Systems

22 Feb 202036min

Adversarial Explanations

Adversarial Explanations

Walt Woods joins us to discuss his paper Adversarial Explanations for Understanding Image Classification Decisions and Improved Neural Network Robustness with co-authors Jack Chen and Christof Teusche...

14 Feb 202036min

ObjectNet

ObjectNet

Andrei Barbu joins us to discuss ObjectNet - a new kind of vision dataset. In contrast to ImageNet, ObjectNet seeks to provide images that are more representative of the types of images an autonomous ...

7 Feb 202038min

Visualization and Interpretability

Visualization and Interpretability

Enrico Bertini joins us to discuss how data visualization can be used to help make machine learning more interpretable and explainable. Find out more about Enrico at http://enrico.bertini.io/. More fr...

31 Jan 202035min

Interpretable One Shot Learning

Interpretable One Shot Learning

We welcome Su Wang back to Data Skeptic to discuss the paper Distributional modeling on a diet: One-shot word learning from text only.

26 Jan 202030min

Fooling Computer Vision

Fooling Computer Vision

Wiebe van Ranst joins us to talk about a project in which specially designed printed images can fool a computer vision system, preventing it from identifying a person.  Their attack targets the popula...

22 Jan 202025min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
forskningno
liberal-halvtime
rekommandert
rss-zahid-ali-hjelper-deg
sinnsyn
villmarksliv
rss-paradigmepodden
jss
tomprat-med-gunnar-tjomlid
fjellsportpodden
tidlose-historier
rss-overskuddsliv
dekodet-2
kvinnehelsepodden
rss-inn-til-kjernen-med-sunniva-rose
diagnose
nevropodden