AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Jaksot(779)

Fighting Fraud with Machine Learning at Shopify with Solmaz Shahalizadeh - TWiML Talk #60

Fighting Fraud with Machine Learning at Shopify with Solmaz Shahalizadeh - TWiML Talk #60

The podcast you’re about to hear is the first of a series of shows recorded at the Georgian Partners Portfolio Conference last week in Toronto. My guest for this show is Solmaz Shahalizadeh, Director ...

30 Loka 201735min

Modeling Human Drivers for Autonomous Vehicles with Katie Driggs-Campbell - TWiML Talk #59

Modeling Human Drivers for Autonomous Vehicles with Katie Driggs-Campbell - TWiML Talk #59

We are back with our third show this week, episode 3 of our Autonomous Vehicles Series. My guest this time is Katie Driggs-Campbell, PostDoc in the Intelligent Systems Lab at Stanford University’s Dep...

27 Loka 201733min

Perception Models for Self-Driving Cars with Jianxiong Xiao - TWiML Talk #58

Perception Models for Self-Driving Cars with Jianxiong Xiao - TWiML Talk #58

We are back with our second show this week, episode 2 of our Autonomous Vehicles Series. This time around we are joined by Jianxiong Xiao of AutoX, a company building computer vision centric solutions...

25 Loka 201741min

Training Data for Autonomous Vehicles - Daryn Nakhuda - TWiML Talk #57

Training Data for Autonomous Vehicles - Daryn Nakhuda - TWiML Talk #57

The episode you are about to hear is the first of a new series of shows on Autonomous Vehicles. We all know that self-driving cars is one of the hottest topics in ML & AI, so we had to dig a little de...

23 Loka 201747min

Human Factors in Machine Intelligence with James Guszcza - TWiML Talk #56

Human Factors in Machine Intelligence with James Guszcza - TWiML Talk #56

As you all know, a few weeks ago, I spent some time in SF at the Artificial Intelligence Conference. I sat down with James Guszcza, US Chief Data Scientist at Deloitte Consulting to talk about human f...

16 Loka 201742min

AI-Powered Conversational Interfaces with Paul Tepper - TWiML Talk #52

AI-Powered Conversational Interfaces with Paul Tepper - TWiML Talk #52

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. My guest for this show is Paul Tepper, worldwide head of cognitive innov...

6 Loka 201736min

ML Use Cases at Think Big Analytics with Mo Patel and Laura Frølich - TWiML Talk #54

ML Use Cases at Think Big Analytics with Mo Patel and Laura Frølich - TWiML Talk #54

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. This time around, I speak with Mo Patel, practice director of AI & deep ...

6 Loka 201745min

Intel Nervana Devcloud with Naveen Rao & Scott Apeland - TWiML Talk #51

Intel Nervana Devcloud with Naveen Rao & Scott Apeland - TWiML Talk #51

In this episode, I talk to Naveen Rao, VP and GM of Intel’s AI Products Group, and Scott Apeland, director of Intel’s Developer Network. It's been a few months since we last spoke to Naveen, so he giv...

6 Loka 201737min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
tervo-halme
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
rss-vaalirankkurit-podcast
rss-podme-livebox
viisupodi
otetaan-yhdet
et-sa-noin-voi-sanoo-esittaa
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
io-techin-tekniikkapodcast
linda-maria
rikosmyytit
rss-polikulaari-humanisti-vastaa-ja-muut-ts-podcastit
rss-merja-mahkan-rahat
mtv-uutiset-polloraati
rss-aika-ankkuri
rss-kaikki-uusiksi
rss-raha-talous-ja-politiikka