Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mar 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

[MINI] Conditional Independence

[MINI] Conditional Independence

In statistics, two random variables might depend on one another (for example, interest rates and new home purchases). We call this conditional dependence. An important related concept exists called co...

21 Jul 201714min

Estimating Sheep Pain with Facial Recognition

Estimating Sheep Pain with Facial Recognition

Animals can't tell us when they're experiencing pain, so we have to rely on other cues to help treat their discomfort. But it is often difficult to tell how much an animal is suffering. The sheep, for...

14 Jul 201727min

CosmosDB

CosmosDB

This episode collects interviews from my recent trip to Microsoft Build where I had the opportunity to speak with Dharma Shukla and Syam Nair about the recently announced CosmosDB. CosmosDB is a globa...

7 Jul 201733min

[MINI] The Vanishing Gradient

[MINI] The Vanishing Gradient

This episode discusses the vanishing gradient - a problem that arises when training deep neural networks in which nearly all the gradients are very close to zero by the time back-propagation has reach...

30 Jun 201715min

Doctor AI

Doctor AI

hen faced with medical issues, would you want to be seen by a human or a machine? In this episode, guest Edward Choi, co-author of the study titled Doctor AI: Predicting Clinical Events via Recurrent ...

23 Jun 201741min

[MINI] Activation Functions

[MINI] Activation Functions

In a neural network, the output value of a neuron is almost always transformed in some way using a function. A trivial choice would be a linear transformation which can only scale the data. However, o...

16 Jun 201714min

MS Build 2017

MS Build 2017

This episode recaps the Microsoft Build Conference.  Kyle recently attended and shares some thoughts on cloud, databases, cognitive services, and artificial intelligence.  The episode includes intervi...

9 Jun 201727min

[MINI] Max-pooling

[MINI] Max-pooling

Max-pooling is a procedure in a neural network which has several benefits. It performs dimensionality reduction by taking a collection of neurons and reducing them to a single value for future layers ...

2 Jun 201712min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
sinnsyn
rekommandert
forskningno
villmarksliv
liberal-halvtime
rss-paradigmepodden
rss-zahid-ali-hjelper-deg
jss
tomprat-med-gunnar-tjomlid
kvinnehelsepodden
nordnorsk-historie
fjellsportpodden
rss-inn-til-kjernen-med-sunniva-rose
rss-rekommandert
tidlose-historier
rss-overskuddsliv
rss-bondevennen