Goodhart's Law in Reinforcement Learning
Data Skeptic5 Mars 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(601)

[MINI] Conditional Independence

[MINI] Conditional Independence

In statistics, two random variables might depend on one another (for example, interest rates and new home purchases). We call this conditional dependence. An important related concept exists called co...

21 Juli 201714min

Estimating Sheep Pain with Facial Recognition

Estimating Sheep Pain with Facial Recognition

Animals can't tell us when they're experiencing pain, so we have to rely on other cues to help treat their discomfort. But it is often difficult to tell how much an animal is suffering. The sheep, for...

14 Juli 201727min

CosmosDB

CosmosDB

This episode collects interviews from my recent trip to Microsoft Build where I had the opportunity to speak with Dharma Shukla and Syam Nair about the recently announced CosmosDB. CosmosDB is a globa...

7 Juli 201733min

[MINI] The Vanishing Gradient

[MINI] The Vanishing Gradient

This episode discusses the vanishing gradient - a problem that arises when training deep neural networks in which nearly all the gradients are very close to zero by the time back-propagation has reach...

30 Juni 201715min

Doctor AI

Doctor AI

hen faced with medical issues, would you want to be seen by a human or a machine? In this episode, guest Edward Choi, co-author of the study titled Doctor AI: Predicting Clinical Events via Recurrent ...

23 Juni 201741min

[MINI] Activation Functions

[MINI] Activation Functions

In a neural network, the output value of a neuron is almost always transformed in some way using a function. A trivial choice would be a linear transformation which can only scale the data. However, o...

16 Juni 201714min

MS Build 2017

MS Build 2017

This episode recaps the Microsoft Build Conference.  Kyle recently attended and shares some thoughts on cloud, databases, cognitive services, and artificial intelligence.  The episode includes intervi...

9 Juni 201727min

[MINI] Max-pooling

[MINI] Max-pooling

Max-pooling is a procedure in a neural network which has several benefits. It performs dimensionality reduction by taking a collection of neurons and reducing them to a single value for future layers ...

2 Juni 201712min

Populärt inom Vetenskap

dumma-manniskor
p3-dystopia
allt-du-velat-veta
kapitalet-en-podd-om-ekonomi
rss-ufobortom-rimligt-tvivel
rss-vetenskapsradion
rss-vetenskapsradion-2
rss-spraket
paranormalt-med-caroline-giertz
svd-nyhetsartiklar
det-morka-psyket
rss-odla
medicinvetarna
barnpsykologerna
naturmorgon
dumforklarat
sexet
rss-arkeologi-historia-podden-som-graver-i-vart-kulturlandskap
hacka-livet
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli