AI Agents for Data Analysis with Shreya Shankar - #703

AI Agents for Data Analysis with Shreya Shankar - #703

Today, we're joined by Shreya Shankar, a PhD student at UC Berkeley to discuss DocETL, a declarative system for building and optimizing LLM-powered data processing pipelines for large-scale and complex document analysis tasks. We explore how DocETL's optimizer architecture works, the intricacies of building agentic systems for data processing, the current landscape of benchmarks for data processing tasks, how these differ from reasoning-based benchmarks, and the need for robust evaluation methods for human-in-the-loop LLM workflows. Additionally, Shreya shares real-world applications of DocETL, the importance of effective validation prompts, and building robust and fault-tolerant agentic systems. Lastly, we cover the need for benchmarks tailored to LLM-powered data processing tasks and the future directions for DocETL. The complete show notes for this episode can be found at https://twimlai.com/go/703.

Jaksot(781)

Exploring Large Language Models with ChatGPT - #603

Exploring Large Language Models with ChatGPT - #603

Today we're joined by ChatGPT, the latest and coolest large language model developed by OpenAl. In our conversation with ChatGPT, we discuss the background and capabilities of large language models, t...

8 Joulu 202236min

Accelerating Intelligence with AI-Generating Algorithms with Jeff Clune - #602

Accelerating Intelligence with AI-Generating Algorithms with Jeff Clune - #602

Are AI-generating algorithms the path to artificial general intelligence(AGI)?  Today we’re joined by Jeff Clune, an associate professor of computer science at the University of British Columbia, and...

5 Joulu 202256min

Programmatic Labeling and Data Scaling for Autonomous Commercial Aviation with Cedric Cocaud - #601

Programmatic Labeling and Data Scaling for Autonomous Commercial Aviation with Cedric Cocaud - #601

Today we’re joined by Cedric Cocaud, the chief engineer of the Wayfinder Group at Acubed, the innovation center for aircraft manufacturer Airbus. In our conversation with Cedric, we explore some of th...

28 Marras 202254min

Engineering Production NLP Systems at T-Mobile with Heather Nolis - #600

Engineering Production NLP Systems at T-Mobile with Heather Nolis - #600

Today we’re joined by Heather Nolis, a principal machine learning engineer at T-Mobile. In our conversation with Heather, we explored her machine learning journey at T-Mobile, including their initial ...

21 Marras 202243min

Sim2Real and Optimus, the Humanoid Robot with Ken Goldberg - #599

Sim2Real and Optimus, the Humanoid Robot with Ken Goldberg - #599

Today we’re joined by return guest Ken Goldberg, a professor at UC Berkeley and the chief scientist at Ambi Robotics. It’s been a few years since our initial conversation with Ken, so we spent a bit o...

14 Marras 202247min

The Evolution of the NLP Landscape with Oren Etzioni - #598

The Evolution of the NLP Landscape with Oren Etzioni - #598

Today friend of the show and esteemed guest host John Bohannon is back with another great interview, this time around joined by Oren Etzioni, former CEO of the Allen Institute for AI, where he is curr...

7 Marras 202253min

Live from TWIMLcon! The Great MLOps Debate: End-to-End ML Platforms vs Specialized Tools - #597

Live from TWIMLcon! The Great MLOps Debate: End-to-End ML Platforms vs Specialized Tools - #597

Over the last few years, it’s been established that your ML team needs at least some basic tooling in order to be effective, providing support for various aspects of the machine learning workflow, fro...

31 Loka 202247min

Live from TWIMLcon! You're not Facebook. Architecting MLOps for B2B Use Cases with Jacopo Tagliabue - #596

Live from TWIMLcon! You're not Facebook. Architecting MLOps for B2B Use Cases with Jacopo Tagliabue - #596

Much of the way we talk and think about MLOps comes from the perspective of large consumer internet companies like Facebook or Google. If you work at a FAANG company, these approaches might work well ...

24 Loka 202249min

Suosittua kategoriassa Politiikka ja uutiset

uutiscast
aikalisa
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
politiikan-puskaradio
tervo-halme
rss-vaalirankkurit-podcast
rss-podme-livebox
viisupodi
rss-asiastudio
et-sa-noin-voi-sanoo-esittaa
otetaan-yhdet
rikosmyytit
the-ulkopolitist
radio-antro
rss-kaikki-uusiksi
rss-hyvaa-huomenta-bryssel
rss-raha-talous-ja-politiikka
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rss-girls-finish-f1rst