AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Jaksot(779)

Knowledge Graphs and Expert Augmentation with Marisa Boston - TWiML Talk #204

Knowledge Graphs and Expert Augmentation with Marisa Boston - TWiML Talk #204

Today we’re joined by Marisa Boston, Director of Cognitive Technology in KPMG’s Cognitive Automation Lab. We caught up to discuss some of the ways that KPMG is using AI to build tools that help augmen...

29 Marras 201846min

ML/DL for Non-Stationary Time Series Analysis in Financial Markets and Beyond with Stuart Reid - TWiML Talk #203

ML/DL for Non-Stationary Time Series Analysis in Financial Markets and Beyond with Stuart Reid - TWiML Talk #203

Today, we’re joined by Stuart Reid, Chief Scientist at NMRQL Research. NMRQL is an investment management firm that uses ML algorithms to make adaptive, unbiased, scalable, and testable trading decisi...

26 Marras 201858min

Industrializing Machine Learning at Shell with Daniel Jeavons - TWiML Talk #202

Industrializing Machine Learning at Shell with Daniel Jeavons - TWiML Talk #202

In this episode of our AI Platforms series, we’re joined by Daniel Jeavons, General Manager of Data Science at Shell. In our conversation, we explore the evolution of analytics and data science at Sh...

21 Marras 201845min

Resurrecting a Recommendations Platform at Comcast with Leemay Nassery - TWiML Talk #201

Resurrecting a Recommendations Platform at Comcast with Leemay Nassery - TWiML Talk #201

In this episode of our AI Platforms series, we’re joined by Leemay Nassery, Senior Engineering Manager and head of the recommendations team at Comcast. In our conversation, Leemay and I discuss just h...

19 Marras 201847min

Productive Machine Learning at LinkedIn with Bee-Chung Chen - TWiML Talk #200

Productive Machine Learning at LinkedIn with Bee-Chung Chen - TWiML Talk #200

In this episode of our AI Platforms series, we’re joined by Bee-Chung Chen, Principal Staff Engineer and Applied Researcher at LinkedIn. Bee-Chung and I caught up to discuss LinkedIn’s internal AI aut...

15 Marras 201847min

Scaling Deep Learning on Kubernetes at OpenAI with Christopher Berner - TWiML Talk #199

Scaling Deep Learning on Kubernetes at OpenAI with Christopher Berner - TWiML Talk #199

In this episode of our AI Platforms series we’re joined by OpenAI’s Head of Infrastructure, Christopher Berner. In our conversation, we discuss the evolution of OpenAI’s deep learning platform, the co...

12 Marras 201849min

Bighead: Airbnb's Machine Learning Platform with Atul Kale - TWiML Talk #198

Bighead: Airbnb's Machine Learning Platform with Atul Kale - TWiML Talk #198

In this episode of our AI Platforms series, we’re joined by Atul Kale, Engineering Manager on the machine learning infrastructure team at Airbnb. In our conversation, we discuss Airbnb’s internal mac...

8 Marras 201849min

Facebook's FBLearner Platform with Aditya Kalro - TWiML Talk #197

Facebook's FBLearner Platform with Aditya Kalro - TWiML Talk #197

In the kickoff episode of our AI Platforms series, we’re joined by Aditya Kalro, Engineering Manager at Facebook, to discuss their internal machine learning platform FBLearner Flow. FBLearner Flow is ...

6 Marras 201838min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
ootsa-kuullut-tasta-2
rss-ootsa-kuullut-tasta
tervo-halme
politiikan-puskaradio
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
viisupodi
otetaan-yhdet
rss-vaalirankkurit-podcast
rss-asiastudio
the-ulkopolitist
radio-antro
io-techin-tekniikkapodcast
linda-maria
rss-mina-ukkola
rss-kaikki-uusiksi
rikosmyytit
rss-kiina-ilmiot
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset