Closing the Loop Between AI Training and Inference with Lin Qiao - #742

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She explains why aligning training and inference systems is essential for creating a seamless, fast-moving production pipeline, preventing the friction that often stalls deployment. We explore the strategic shift from treating models as commodities to viewing them as core product assets. Lin details how post-training methods, like reinforcement fine-tuning (RFT), allow teams to leverage their own proprietary data to continuously improve these assets. Lin also breaks down the complex challenge of what she calls "3D optimization"—balancing cost, latency, and quality—and emphasizes the role of clear evaluation criteria to guide this process, moving beyond unreliable methods like "vibe checking." Finally, we discuss the path toward the future of AI development: designing a closed-loop system for automated model improvement, a vision made more attainable by the exciting convergence of open and closed-source model capabilities. The complete show notes for this episode can be found at https://twimlai.com/go/742.

Episoder(782)

Video as a Universal Interface for AI Reasoning with Sherry Yang - #676

Video as a Universal Interface for AI Reasoning with Sherry Yang - #676

Today we’re joined by Sherry Yang, senior research scientist at Google DeepMind and a PhD student at UC Berkeley. In this interview, we discuss her new paper, "Video as the New Language for Real-World...

18 Mar 202449min

Assessing the Risks of Open AI Models with Sayash Kapoor - #675

Assessing the Risks of Open AI Models with Sayash Kapoor - #675

Today we’re joined by Sayash Kapoor, a Ph.D. student in the Department of Computer Science at Princeton University. Sayash walks us through his paper: "On the Societal Impact of Open Foundation Models...

11 Mar 202440min

OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674

OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674

Today we’re joined by Akshita Bhagia, a senior research engineer at the Allen Institute for AI. Akshita joins us to discuss OLMo, a new open source language model with 7 billion and 1 billion variants...

4 Mar 202432min

Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673

Training Data Locality and Chain-of-Thought Reasoning in LLMs with Ben Prystawski - #673

Today we’re joined by Ben Prystawski, a PhD student in the Department of Psychology at Stanford University working at the intersection of cognitive science and machine learning. Our conversation cente...

26 Feb 202425min

Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672

Reasoning Over Complex Documents with DocLLM with Armineh Nourbakhsh - #672

Today we're joined by Armineh Nourbakhsh of JP Morgan AI Research to discuss the development and capabilities of DocLLM, a layout-aware large language model for multimodal document understanding. Armi...

19 Feb 202445min

Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671

Are Emergent Behaviors in LLMs an Illusion? with Sanmi Koyejo - #671

Today we’re joined by Sanmi Koyejo, assistant professor at Stanford University, to continue our NeurIPS 2024 series. In our conversation, Sanmi discusses his two recent award-winning papers. First, we...

12 Feb 20241h 5min

AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670

AI Trends 2024: Reinforcement Learning in the Age of LLMs with Kamyar Azizzadenesheli - #670

Today we’re joined by Kamyar Azizzadenesheli, a staff researcher at Nvidia, to continue our AI Trends 2024 series. In our conversation, Kamyar updates us on the latest developments in reinforcement le...

5 Feb 20241h 10min

Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669

Building and Deploying Real-World RAG Applications with Ram Sriharsha - #669

Today we’re joined by Ram Sriharsha, VP of engineering at Pinecone. In our conversation, we dive into the topic of vector databases and retrieval augmented generation (RAG). We explore the trade-offs ...

29 Jan 202435min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
lydartikler-fra-aftenposten
forklart
aftenpodden-usa
popradet
stopp-verden
i-retten
dine-penger-pengeradet
fotballpodden-2
det-store-bildet
rss-gukild-johaug
nokon-ma-ga
hanna-de-heldige
rss-ness
aftenbla-bla
e24-podden
rss-dannet-uten-piano
grasoner-den-nye-kalde-krigen
frokostshowet-pa-p5