AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Jaksot(779)

Making Algorithms Trustworthy with David Spiegelhalter - TWiML Talk #212

Making Algorithms Trustworthy with David Spiegelhalter - TWiML Talk #212

Today we’re joined by David Spiegelhalter, Chair of Winton Center for Risk and Evidence Communication at Cambridge University and President of the Royal Statistical Society. David, an invited speaker ...

20 Joulu 201823min

Designing Computer Systems for Software with Kunle Olukotun - TWiML Talk #211

Designing Computer Systems for Software with Kunle Olukotun - TWiML Talk #211

Today we’re joined by Kunle Olukotun, Professor in the department of EE and CS at Stanford University, and Chief Technologist at Sambanova Systems. Kunle was an invited speaker at NeurIPS this year, p...

18 Joulu 201855min

Operationalizing Ethical AI with Kathryn Hume - TWiML Talk #210

Operationalizing Ethical AI with Kathryn Hume - TWiML Talk #210

Today we conclude our Trust in AI series with this conversation with Kathryn Hume, VP of Strategy at Integrate AI. We discuss her newly released white paper “Responsible AI in the Consumer Enterprise,...

14 Joulu 201853min

Approaches to Fairness in Machine Learning with Richard Zemel - TWiML Talk #209

Approaches to Fairness in Machine Learning with Richard Zemel - TWiML Talk #209

Today we continue our exploration of Trust in AI with this interview with Richard Zemel, Professor in the department of Computer Science at the University of Toronto and Research Director at Vector In...

12 Joulu 201845min

Trust and AI with Parinaz Sobhani - TWiML Talk #208

Trust and AI with Parinaz Sobhani - TWiML Talk #208

In today’s episode we’re joined by Parinaz Sobhani, Director of Machine Learning at Georgian Partners. In our conversation, Parinaz and I discuss some of the main issues falling under the “trust” um...

11 Joulu 201846min

Unbiased Learning from Biased User Feedback with Thorsten Joachims - TWiML Talk #207

Unbiased Learning from Biased User Feedback with Thorsten Joachims - TWiML Talk #207

In the final episode of our re:Invent series, we're joined by Thorsten Joachims, Professor in the Department of Computer Science at Cornell University. We discuss his presentation “Unbiased Learning f...

7 Joulu 201840min

Language Parsing and Character Mining with Jinho Choi - TWiML Talk #206

Language Parsing and Character Mining with Jinho Choi - TWiML Talk #206

Today we’re joined by Jinho Choi, assistant professor of computer science at Emory University. Jinho presented at the conference on ELIT, their cloud-based NLP platform. In our conversation, we discu...

5 Joulu 201847min

re:Invent Roundup Roundtable 2018 with Dave McCrory and Val Bercovici - TWiML Talk #205

re:Invent Roundup Roundtable 2018 with Dave McCrory and Val Bercovici - TWiML Talk #205

I’m excited to present our second annual re:Invent Roundtable Roundup. This year I’m joined by Dave McCrory, VP of Software Engineering at Wise.io at GE Digital, and Val Bercovici, Founder and CEO of ...

3 Joulu 20181h 7min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
ootsa-kuullut-tasta-2
rss-ootsa-kuullut-tasta
tervo-halme
politiikan-puskaradio
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
viisupodi
otetaan-yhdet
rss-vaalirankkurit-podcast
rss-asiastudio
the-ulkopolitist
radio-antro
io-techin-tekniikkapodcast
linda-maria
rss-mina-ukkola
rss-kaikki-uusiksi
rikosmyytit
rss-kiina-ilmiot
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset