AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Jaksot(779)

Graph Analytic Systems with Zachary Hanif - TWiML Talk #188

Graph Analytic Systems with Zachary Hanif - TWiML Talk #188

In this, the final episode of our Strata Data Conference series, we’re joined by Zachary Hanif, Director of Machine Learning at Capital One’s Center for Machine Learning. We start our discussion wit...

8 Loka 201854min

Diversification in Recommender Systems with Ahsan Ashraf - TWiML Talk #187

Diversification in Recommender Systems with Ahsan Ashraf - TWiML Talk #187

In this episode of our Strata Data conference series, we’re joined by Ahsan Ashraf, data scientist at Pinterest. We discuss his presentation, “Diversification in recommender systems: Using topical var...

4 Loka 201844min

The Fastai v1 Deep Learning Framework with Jeremy Howard - TWiML Talk #186

The Fastai v1 Deep Learning Framework with Jeremy Howard - TWiML Talk #186

In today's episode we're presenting a special conversation with Jeremy Howard, founder and researcher at Fast.ai. This episode is being released today in conjunction with the company’s announcement of...

2 Loka 20181h 11min

Federated ML for Edge Applications with Justin Norman - TWiML Talk #185

Federated ML for Edge Applications with Justin Norman - TWiML Talk #185

In this episode we’re joined by Justin Norman, Director of Research and Data Science Services at Cloudera Fast Forward Labs. In my chat with Justin we start with an update on the company before diving...

27 Syys 201847min

Exploring Dark Energy & Star Formation w/ ML with Viviana Acquaviva - TWiML Talk #184

Exploring Dark Energy & Star Formation w/ ML with Viviana Acquaviva - TWiML Talk #184

In today’s episode of our Strata Data series, we’re joined by Viviana Acquaviva, Associate Professor at City Tech, the New York City College of Technology. In our conversation, we discuss an ongoing p...

26 Syys 201840min

Document Vectors in the Wild with James Dreiss - TWiML Talk #183

Document Vectors in the Wild with James Dreiss - TWiML Talk #183

In this episode of our Strata Data series we’re joined by James Dreiss, Senior Data Scientist at international news syndicate Reuters. James and I sat down to discuss his talk from the conference “Doc...

24 Syys 201840min

Applied Machine Learning for Publishers with Naveed Ahmad - TWiML Talk #182

Applied Machine Learning for Publishers with Naveed Ahmad - TWiML Talk #182

In today’s episode we’re joined by Naveed Ahmad, Senior Director of data engineering and machine learning at Hearst Newspapers. In our conversation, we discuss into the role of ML at Hearst, including...

20 Syys 201839min

Anticipating Superintelligence with Nick Bostrom - TWiML Talk #181

Anticipating Superintelligence with Nick Bostrom - TWiML Talk #181

In this episode, we’re joined by Nick Bostrom, professor at the University of Oxford and head of the Future of Humanity Institute, a multidisciplinary institute focused on answering big-picture questi...

17 Syys 201844min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
ootsa-kuullut-tasta-2
rss-ootsa-kuullut-tasta
tervo-halme
politiikan-puskaradio
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
viisupodi
otetaan-yhdet
rss-vaalirankkurit-podcast
rss-asiastudio
the-ulkopolitist
radio-antro
io-techin-tekniikkapodcast
linda-maria
rss-mina-ukkola
rss-kaikki-uusiksi
rikosmyytit
rss-kiina-ilmiot
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset