Goodhart's Law in Reinforcement Learning
Data Skeptic5 Maalis 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(601)

MS Connect Conference

MS Connect Conference

Cloud services are now ubiquitous in data science and more broadly in technology as well. This week, I speak to Mark Souza, Tobias Ternström, and Corey Sanders about various aspects of data at scale. ...

9 Joulu 201642min

Causal Impact

Causal Impact

Today's episode is all about Causal Impact, a technique for estimating the impact of a particular event on a time series. We talk to William Martin about his research into the impact releases have on ...

2 Joulu 201634min

[MINI] The Bootstrap

[MINI] The Bootstrap

The Bootstrap is a method of resampling a dataset to possibly refine it's accuracy and produce useful metrics on the result. The bootstrap is a useful statistical technique and is leveraged in Bagging...

25 Marras 201610min

[MINI] Gini Coefficients

[MINI] Gini Coefficients

The Gini Coefficient (as it relates to decision trees) is one approach to determining the optimal decision to introduce which splits your dataset as part of a decision tree. To pick the right feature ...

18 Marras 201615min

Unstructured Data for Finance

Unstructured Data for Finance

Financial analysis techniques for studying numeric, well structured data are very mature. While using unstructured data in finance is not necessarily a new idea, the area is still very greenfield. On ...

11 Marras 201633min

[MINI] AdaBoost

[MINI] AdaBoost

AdaBoost is a canonical example of the class of AnyBoost algorithms that create ensembles of weak learners. We discuss how a complex problem like predicting restaurant failure (which is surely caused ...

4 Marras 201610min

Stealing Models from the Cloud

Stealing Models from the Cloud

Platform as a service is a growing trend in data science where services like fraud analysis and face detection can be provided via APIs. Such services turn the actual model into a black box to the con...

28 Loka 201637min

[MINI] Calculating Feature Importance

[MINI] Calculating Feature Importance

For machine learning models created with the random forest algorithm, there is no obvious diagnostic to inform you which features are more important in the output of the model. Some straightforward bu...

21 Loka 201613min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
rss-poliisin-mieli
tiedekulma-podcast
utelias-mieli
docemilia
rss-duodecim-lehti
rss-tiedetta-vai-tarinaa
rss-totuuden-liepeilla
university-of-eastern-finland
filocast-filosofian-perusteet
rss-duokkari-ekstra
rss-laakaripodi
rss-ylistys-elaimille
rss-lapsuuden-rakentajat-podcast