Goodhart's Law in Reinforcement Learning
Data Skeptic5 Maalis 2021

Goodhart's Law in Reinforcement Learning

Hal Ashton, a PhD student from the University College of London, joins us today to discuss a recent work Causal Campbell-Goodhart's law and Reinforcement Learning.

"Only buy honey from a local producer." - Hal Ashton

Works Mentioned:

"Causal Campbell-Goodhart's law and Reinforcement Learning"by Hal AshtonBook

"The Book of Why"by Judea PearlPaper

Thanks to our sponsor!

When your business is ready to make that next hire, find the right person with LinkedIn Jobs. Just visit LinkedIn.com/DATASKEPTIC to post a job for free! Terms and conditions apply

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(601)

I LLM and You Can Too

I LLM and You Can Too

It took a massive financial investment for the first large language models (LLMs) to be created.  Did their corporate backers lock these tools away for all but the richest?  No.  They provided comodit...

23 Joulu 202323min

Q&A with Kyle

Q&A with Kyle

We celebrate episode 1000000000 with some Q&A from host Kyle Polich.  We boil this episode down to four key questions: 1) How do you find guests 2) What is Data Skeptic all about? 3) What is Kyle all ...

19 Joulu 202340min

LLMs for Data Analysis

LLMs for Data Analysis

In this episode, we are joined by Amir Netz, a Technical Fellow at Microsoft and the CTO of Microsoft Fabric. He discusses how companies can use Microsoft's latest tools for business intelligence. Ami...

12 Joulu 202329min

AI Platforms

AI Platforms

Our guest today is Eric Boyd, the Corporate Vice President of AI at Microsoft. Eric joins us to share how organizations can leverage AI for faster development. Eric shared the benefits of using natura...

4 Joulu 202333min

Deploying LLMs

Deploying LLMs

We are excited to be joined by Aaron Reich and Priyanka Shah. Aaron is the CTO at Avanade, while Priyanka leads their AI/IoT offering for the SEA Region. Priyanka is also the MVP for Microsoft AI. The...

27 Marras 202335min

A Survey Assessing Github Copilot

A Survey Assessing Github Copilot

In this episode, we are joined by Jenny Liang, a PhD student at Carnegie Mellon University, where she studies the usability of code generation tools. She discusses her recent survey on the usability o...

20 Marras 202326min

Program Aided Language Models

Program Aided Language Models

We are joined by Aman Madaan and Shuyan Zhou. They are both PhD students at the Language Technology Institute at Carnegie Mellon University. They join us to discuss their latest published paper, PAL: ...

13 Marras 202332min

Which Programming Language is ChatGPT Best At

Which Programming Language is ChatGPT Best At

In this episode, we have Alessio Buscemi, a software engineer at Lifeware SA. Alessio was a post-doctoral researcher at the University of Luxembourg. He joins us to discuss his paper, A Comparative St...

6 Marras 202340min

Suosittua kategoriassa Tiede

rss-poliisin-mieli
tiedekulma-podcast
rss-mita-tulisi-tietaa
docemilia
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-duodecim-lehti
sotataidon-ytimessa
rss-tiedetta-vai-tarinaa
rss-lapsuuden-rakentajat-podcast
utelias-mieli
radio-antro
rss-bios-podcast
rss-ranskaa-raakana
rss-metsantuntijat-podcast
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
rss-lihavuudesta-podcast
rss-sosiopodi