AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Jaksot(779)

Geometric Statistics in Machine Learning w/ geomstats with Nina Miolane - TWiML Talk #196

Geometric Statistics in Machine Learning w/ geomstats with Nina Miolane - TWiML Talk #196

In this episode we’re joined by Nina Miolane, researcher and lecturer at Stanford University. Nina and I spoke about her work in the field of geometric statistics in ML, specifically the application o...

1 Marras 201843min

Milestones in Neural Natural Language Processing with Sebastian Ruder - TWiML Talk #195

Milestones in Neural Natural Language Processing with Sebastian Ruder - TWiML Talk #195

In this episode, we’re joined by Sebastian Ruder, PhD student studying NLP at National University of Ireland and Research Scientist at text analysis startup Aylien. We discuss recent milestones in neu...

29 Loka 20181h 1min

Natural Language Processing at StockTwits with Garrett Hoffman - TWiML Talk #194

Natural Language Processing at StockTwits with Garrett Hoffman - TWiML Talk #194

In this episode, we’re joined by Garrett Hoffman, Director of Data Science at Stocktwits. Stocktwits is a social network for the investing community which has its roots in the use of the $cashtag on T...

25 Loka 201850min

Advanced Reinforcement Learning & Data Science for Social Impact with Vukosi Marivate - TWiML Talk #193

Advanced Reinforcement Learning & Data Science for Social Impact with Vukosi Marivate - TWiML Talk #193

In the final episode of our Deep Learning Indaba series, we speak with Vukosi Marivate, Chair of Data Science at the University of Pretoria and a co-organizer of the Indaba. My conversation with Vuko...

23 Loka 201846min

AI Ethics, Strategic Decisioning and Game Theory with Osonde Osoba - TWiML Talk #192

AI Ethics, Strategic Decisioning and Game Theory with Osonde Osoba - TWiML Talk #192

In this episode of our Deep Learning Indaba Series, we’re joined by Osonde Osoba, Engineer at RAND Corporation. Osonde and I spoke on the heels of the Indaba, where he presented on AI Ethics and Poli...

18 Loka 201847min

Acoustic Word Embeddings for Low Resource Speech Processing with Herman Kamper - TWiML Talk #191

Acoustic Word Embeddings for Low Resource Speech Processing with Herman Kamper - TWiML Talk #191

In this episode of our Deep Learning Indaba Series, we’re joined by Herman Kamper, lecturer at Stellenbosch University in SA and a co-organizer of the Indaba. We discuss his work on limited- and zero...

16 Loka 20181h 1min

Learning Representations for Visual Search with Naila Murray - TWiML Talk #190

Learning Representations for Visual Search with Naila Murray - TWiML Talk #190

In this episode of our Deep Learning Indaba series, we’re joined by Naila Murray, Senior Research Scientist and Group Lead in the computer vision group at Naver Labs Europe. Naila presented at the In...

12 Loka 201841min

Evaluating Model Explainability Methods with Sara Hooker - TWiML Talk #189

Evaluating Model Explainability Methods with Sara Hooker - TWiML Talk #189

In this, the first episode of the Deep Learning Indaba series, we’re joined by Sara Hooker, AI Resident at Google Brain. I spoke with Sara in the run-up to the Indaba about her work on interpretabilit...

10 Loka 20181h 3min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
ootsa-kuullut-tasta-2
rss-ootsa-kuullut-tasta
tervo-halme
politiikan-puskaradio
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
viisupodi
otetaan-yhdet
rss-vaalirankkurit-podcast
rss-asiastudio
the-ulkopolitist
radio-antro
io-techin-tekniikkapodcast
linda-maria
rss-mina-ukkola
rss-kaikki-uusiksi
rikosmyytit
rss-kiina-ilmiot
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset