AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

AI Agents: Substance or Snake Oil with Arvind Narayanan - #704

Today, we're joined by Arvind Narayanan, professor of Computer Science at Princeton University to discuss his recent works, AI Agents That Matter and AI Snake Oil. In “AI Agents That Matter”, we explore the range of agentic behaviors, the challenges in benchmarking agents, and the ‘capability and reliability gap’, which creates risks when deploying AI agents in real-world applications. We also discuss the importance of verifiers as a technique for safeguarding agent behavior. We then dig into the AI Snake Oil book, which uncovers examples of problematic and overhyped claims in AI. Arvind shares various use cases of failed applications of AI, outlines a taxonomy of AI risks, and shares his insights on AI’s catastrophic risks. Additionally, we also touched on different approaches to LLM-based reasoning, his views on tech policy and regulation, and his work on CORE-Bench, a benchmark designed to measure AI agents' accuracy in computational reproducibility tasks. The complete show notes for this episode can be found at https://twimlai.com/go/704.

Episoder(781)

Hyperparameter Optimization through Neural Network Partitioning with Christos Louizos - #627

Hyperparameter Optimization through Neural Network Partitioning with Christos Louizos - #627

Today we kick off our coverage of the 2023 ICLR conference joined by Christos Louizos, an ML researcher at Qualcomm Technologies. In our conversation with Christos, we explore his paper Hyperparameter...

1 Mai 202333min

Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626

Are LLMs Overhyped or Underappreciated? with Marti Hearst - #626

Today we’re joined by Marti Hearst, Professor at UC Berkeley. In our conversation with Marti, we explore the intricacies of AI language models and their usefulness in improving efficiency but also the...

24 Apr 202337min

Are Large Language Models a Path to AGI? with Ben Goertzel - #625

Are Large Language Models a Path to AGI? with Ben Goertzel - #625

Today we’re joined by Ben Goertzel, CEO of SingularityNET. In our conversation with Ben, we explore all things AGI, including the potential scenarios that could arise with the advent of AGI and his pr...

17 Apr 202359min

Open Source Generative AI at Hugging Face with Jeff Boudier - #624

Open Source Generative AI at Hugging Face with Jeff Boudier - #624

Today we’re joined by Jeff Boudier, head of product at Hugging Face 🤗. In our conversation with Jeff, we explore the current landscape of open-source machine learning tools and models, the recent shi...

11 Apr 202333min

Generative AI at the Edge with Vinesh Sukumar - #623

Generative AI at the Edge with Vinesh Sukumar - #623

Today we’re joined by Vinesh Sukumar, a senior director and head of AI/ML product management at Qualcomm Technologies. In our conversation with Vinesh, we explore how mobile and automotive devices hav...

3 Apr 202339min

Runway Gen-2: Generative AI for Video Creation with Anastasis Germanidis - #622

Runway Gen-2: Generative AI for Video Creation with Anastasis Germanidis - #622

Today we’re joined by Anastasis Germanidis, Co-Founder and CTO of RunwayML. Amongst all the product and model releases over the past few months, Runway threw its hat into the ring with Gen-1, a model ...

27 Mar 202349min

Watermarking Large Language Models to Fight Plagiarism with Tom Goldstein - 621

Watermarking Large Language Models to Fight Plagiarism with Tom Goldstein - 621

Today we’re joined by Tom Goldstein, an associate professor at the University of Maryland. Tom’s research sits at the intersection of ML and optimization and has previously been featured in the New Yo...

20 Mar 202351min

Does ChatGPT “Think”? A Cognitive Neuroscience Perspective with Anna Ivanova - #620

Does ChatGPT “Think”? A Cognitive Neuroscience Perspective with Anna Ivanova - #620

Today we’re joined by Anna Ivanova, a postdoctoral researcher at MIT Quest for Intelligence. In our conversation with Anna, we discuss her recent paper Dissociating language and thought in large langu...

13 Mar 202345min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
forklart
aftenpodden-usa
popradet
stopp-verden
lydartikler-fra-aftenposten
i-retten
rss-gukild-johaug
fotballpodden-2
det-store-bildet
dine-penger-pengeradet
nokon-ma-ga
rss-ness
hanna-de-heldige
aftenbla-bla
bt-dokumentar-2
e24-podden
frokostshowet-pa-p5
rss-dannet-uten-piano