Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mar 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Spam Filtering with Naive Bayes

Spam Filtering with Naive Bayes

Today's spam filters are advanced data driven tools. They rely on a variety of techniques to effectively and often seamlessly filter out junk email from good email. Whitelists, blacklists, traffic ana...

27 Jul 201819min

The Spread of Fake News

The Spread of Fake News

How does fake news get spread online? Its not just a matter of manipulating search algorithms. The social platforms for sharing play a major role in the distribution of fake news. But how significant ...

20 Jul 201845min

Fake News

Fake News

This episode kicks off our new theme of "Fake News" with guests Robert Sheaffer and Brad Schwartz. Fake news is a new label for an old idea. For our purposes, we will define fake news information crea...

13 Jul 201838min

Dev Ops for Data Science

Dev Ops for Data Science

We revisit the 2018 Microsoft Build in this episode, focusing on the latest ideas in DevOps. Kyle interviews Cloud Developer Advocates Damien Brady, Paige Bailey, and Donovan Brown to talk about DevOp...

11 Jul 201838min

First Order Logic

First Order Logic

Logic is a fundamental of mathematical systems. It's roots are the values true and false and it's power is in what it's rules allow you to prove. Prepositional logic provides it's user variables. This...

6 Jul 201816min

Blind Spots in Reinforcement Learning

Blind Spots in Reinforcement Learning

An intelligent agent trained in a simulated environment may be prone to making mistakes in the real world due to discrepancies between the training and real-world conditions. The areas where an agent ...

29 Jun 201827min

Defending Against Adversarial Attacks

Defending Against Adversarial Attacks

In this week's episode, our host Kyle interviews Gokula Krishnan from ETH Zurich, about his recent contributions to defenses against adversarial attacks. The discussion centers around his latest paper...

22 Jun 201831min

Transfer Learning

Transfer Learning

On a long car ride, Linhda and Kyle record a short episode. This discussion is about transfer learning, a technique using in machine learning to leverage training from one domain to have a head start ...

15 Jun 201818min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
forskningno
rekommandert
sinnsyn
rss-zahid-ali-hjelper-deg
villmarksliv
rss-paradigmepodden
jss
liberal-halvtime
tomprat-med-gunnar-tjomlid
fjellsportpodden
tidlose-historier
kvinnehelsepodden
nevropodden
rss-overskuddsliv
nordnorsk-historie
dekodet-2
aldring-og-helse-podden