AI Agents for Data Analysis with Shreya Shankar - #703

AI Agents for Data Analysis with Shreya Shankar - #703

Today, we're joined by Shreya Shankar, a PhD student at UC Berkeley to discuss DocETL, a declarative system for building and optimizing LLM-powered data processing pipelines for large-scale and complex document analysis tasks. We explore how DocETL's optimizer architecture works, the intricacies of building agentic systems for data processing, the current landscape of benchmarks for data processing tasks, how these differ from reasoning-based benchmarks, and the need for robust evaluation methods for human-in-the-loop LLM workflows. Additionally, Shreya shares real-world applications of DocETL, the importance of effective validation prompts, and building robust and fault-tolerant agentic systems. Lastly, we cover the need for benchmarks tailored to LLM-powered data processing tasks and the future directions for DocETL. The complete show notes for this episode can be found at https://twimlai.com/go/703.

Episoder(781)

Pytorch: Fast Differentiable Dynamic Graphs in Python with Soumith Chintala - TWiML Talk #70

Pytorch: Fast Differentiable Dynamic Graphs in Python with Soumith Chintala - TWiML Talk #70

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

21 Nov 201742min

Accessible Machine Learning for the Enterprise Developer with Ryan Sevey & Jason Montgomery

Accessible Machine Learning for the Enterprise Developer with Ryan Sevey & Jason Montgomery

This week, we’ll be featuring a series of shows recorded from Strange Loop, a great developer-focused conference that takes place every year right in my backyard! The conference is a multi-disciplinar...

20 Nov 201745min

Bridging the Gap Between Academic and Industry Careers with Ross Fadely - TWiML Talk #68

Bridging the Gap Between Academic and Industry Careers with Ross Fadely - TWiML Talk #68

We close out our NYU Future Labs AI Summit interview series with Ross Fadely, a New York based AI lead with Insight Data Science. Insight is an interesting company offering a free seven week post-doct...

16 Nov 201719min

The Limitations of Human-in-the-Loop AI with Dennis Mortensen - TWiML Talk #67

The Limitations of Human-in-the-Loop AI with Dennis Mortensen - TWiML Talk #67

We continue our NYU Future Labs AI Summit interview series with Dennis Mortensen, founder and CEO of X.ai, a company whose AI-based personal assistant Amy helps users with scheduling meetings. I caugh...

13 Nov 201735min

Nexus Lab Cohort 2 - Second Mind - TWiML Talk #66

Nexus Lab Cohort 2 - Second Mind - TWiML Talk #66

The podcast you’re about to hear is the fourth of a series of shows recorded at the NYU Future Labs AI Summit last week in New York City. In this show, I speak with Kul Singh, CEO and Founder of Secon...

9 Nov 201721min

Nexus Lab Cohort 2 - Bite.ai - TWiML Talk #65

Nexus Lab Cohort 2 - Bite.ai - TWiML Talk #65

The podcast you’re about to hear is the second of a series of shows recorded at the NYU Future Labs AI Summit last week in New York City.In this episode, you’ll hear from Bite.ai, a startup founded by...

8 Nov 201726min

Nexus Lab Cohort 2 - Bowtie - TWiML Talk #64

Nexus Lab Cohort 2 - Bowtie - TWiML Talk #64

The podcast you’re about to hear is the second of a series of shows recorded at the NYU Future Labs AI Summit last week in New York City. In this episode, I speak with Ron Fisher and Mike Wang, who, a...

7 Nov 201725min

AI Nexus Lab Cohort 2 - Mt. Cleverest - TWiML Talk #63

AI Nexus Lab Cohort 2 - Mt. Cleverest - TWiML Talk #63

The podcast you’re about to hear is the first of a series of shows recorded at the NYU Future Labs AI Summit last week in New York City. My guests this time around are James Villarrubia and Bernie Pra...

6 Nov 201732min

Populært innen Politikk og nyheter

aftenpodden
giver-og-gjengen-vg
lydartikler-fra-aftenposten
forklart
aftenpodden-usa
i-retten
popradet
stopp-verden
det-store-bildet
dine-penger-pengeradet
fotballpodden-2
rss-gukild-johaug
rss-ness
hanna-de-heldige
nokon-ma-ga
aftenbla-bla
e24-podden
bt-dokumentar-2
rss-dannet-uten-piano
frokostshowet-pa-p5