AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Episoder(780)

ML for Understanding Satellite Imagery at Scale with Kyle Story - TWiML Talk #173

ML for Understanding Satellite Imagery at Scale with Kyle Story - TWiML Talk #173

Today we’re joined by Kyle Story, computer vision engineer at Descartes Labs. Kyle and I caught up after his recent talk at the Google Cloud Next Conference titled “How Computers See the Earth: A Mac...

16 Aug 201856min

Generating Ground-Level Images From Overhead Imagery Using GANs with Yi Zhu - TWiML Talk #172

Generating Ground-Level Images From Overhead Imagery Using GANs with Yi Zhu - TWiML Talk #172

Today we’re joined by Yi Zhu, a PhD candidate at UC Merced focused on geospatial image analysis. In our conversation, Yi and I take a look at his recent paper “What Is It Like Down There? Generating D...

13 Aug 201838min

Vision Systems for Planetary Landers and Drones with Larry Matthies - TWiML Talk #171

Vision Systems for Planetary Landers and Drones with Larry Matthies - TWiML Talk #171

Today we’re joined by Larry Matthies, Sr. Research Scientist and head of computer vision in the mobility and robotics division at JPL. In our conversation, we discuss two talks he gave at CVPR a few w...

9 Aug 201843min

Learning Semantically Meaningful and Actionable Representations with Ashutosh Saxena - TWiML Talk #170

Learning Semantically Meaningful and Actionable Representations with Ashutosh Saxena - TWiML Talk #170

In this episode i'm joined by Ashutosh Saxena, a veteran of Andrew Ng’s Stanford Machine Learning Group, and co-founder and CEO of Caspar.ai. Ashutosh and I discuss his RoboBrain project, a computatio...

6 Aug 201845min

AI Innovation for Clinical Decision Support with Joe Connor - TWiML Talk #169

AI Innovation for Clinical Decision Support with Joe Connor - TWiML Talk #169

In this episode I speak with Joe Connor, Founder of Experto Crede. In our conversation, we explore his experiences bringing AI powered healthcare projects to market in collaboration with the UK Natio...

2 Aug 201842min

Dynamic Visual Localization and Segmentation with Laura Leal-Taixé -TWiML Talk #168

Dynamic Visual Localization and Segmentation with Laura Leal-Taixé -TWiML Talk #168

In this episode I'm joined by Laura Leal-Taixé, Professor at the Technical University of Munich where she leads the Dynamic Vision and Learning Group. In our conversation, we discuss several of her r...

30 Jul 201844min

Conversational AI for the Intelligent Workplace with Gillian McCann - TWiML Talk #167

Conversational AI for the Intelligent Workplace with Gillian McCann - TWiML Talk #167

In this episode I'm joined by Gillian McCann, Head of Cloud Engineering and AI at Workgrid Software. In our conversation, which focuses on Workgrid’s use of cloud-based AI services, Gillian details so...

26 Jul 201836min

Computer Vision and Intelligent Agents for Wildlife Conservation with Jason Holmberg - TWiML Talk #166

Computer Vision and Intelligent Agents for Wildlife Conservation with Jason Holmberg - TWiML Talk #166

In this episode, I'm joined by Jason Holmberg, Executive Director and Director of Engineering at WildMe. Jason and I discuss Wildme's pair of open source computer vision based conservation projects, W...

22 Jul 201848min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
stopp-verden
popradet
i-retten
lydartikler-fra-aftenposten
det-store-bildet
rss-gukild-johaug
dine-penger-pengeradet
nokon-ma-ga
fotballpodden-2
rss-ness
hanna-de-heldige
aftenbla-bla
frokostshowet-pa-p5
rss-dannet-uten-piano
rss-penger-polser-og-politikk
unitedno