Closing the Loop Between AI Training and Inference with Lin Qiao - #742

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She explains why aligning training and inference systems is essential for creating a seamless, fast-moving production pipeline, preventing the friction that often stalls deployment. We explore the strategic shift from treating models as commodities to viewing them as core product assets. Lin details how post-training methods, like reinforcement fine-tuning (RFT), allow teams to leverage their own proprietary data to continuously improve these assets. Lin also breaks down the complex challenge of what she calls "3D optimization"—balancing cost, latency, and quality—and emphasizes the role of clear evaluation criteria to guide this process, moving beyond unreliable methods like "vibe checking." Finally, we discuss the path toward the future of AI development: designing a closed-loop system for automated model improvement, a vision made more attainable by the exciting convergence of open and closed-source model capabilities. The complete show notes for this episode can be found at https://twimlai.com/go/742.

Episoder(781)

Supporting Food Security in Africa Using ML with Catherine Nakalembe - #611

Supporting Food Security in Africa Using ML with Catherine Nakalembe - #611

Today we conclude our coverage of the 2022 NeurIPS series joined by Catherine Nakalembe, an associate research professor at the University of Maryland, and Africa Program Director under NASA Harvest. ...

9 Jan 20231h 6min

Service Cards and ML Governance with Michael Kearns - #610

Service Cards and ML Governance with Michael Kearns - #610

Today we conclude our AWS re:Invent 2022 series joined by Michael Kearns, a professor in the department of computer and information science at UPenn, as well as an Amazon Scholar. In our conversation,...

2 Jan 202339min

Reinforcement Learning for Personalization at Spotify with Tony Jebara - #609

Reinforcement Learning for Personalization at Spotify with Tony Jebara - #609

Today we continue our NeurIPS 2022 series joined by Tony Jebara, VP of engineering and head of machine learning at Spotify. In our conversation with Tony, we discuss his role at Spotify and how the co...

29 Des 202241min

Will ChatGPT take my job? - #608

Will ChatGPT take my job? - #608

More than any system before it, ChatGPT has tapped into our enduring fascination with artificial intelligence, raising in a more concrete and present way important questions and fears about what AI is...

26 Des 202237min

Geospatial Machine Learning at AWS with Kumar Chellapilla - #607

Geospatial Machine Learning at AWS with Kumar Chellapilla - #607

Today we continue our re:Invent 2022 series joined by Kumar Chellapilla, a general manager of ML and AI Services at AWS. We had the opportunity to speak with Kumar after announcing their recent additi...

22 Des 202236min

Real-Time ML Workflows at Capital One with Disha Singla - #606

Real-Time ML Workflows at Capital One with Disha Singla - #606

Today we’re joined by Disha Singla, a senior director of machine learning engineering at Capital One. In our conversation with Disha, we explore her role as the leader of the Data Insights team at Cap...

19 Des 202243min

Weakly Supervised Causal Representation Learning with Johann Brehmer - #605

Weakly Supervised Causal Representation Learning with Johann Brehmer - #605

Today we’re excited to kick off our coverage of the 2022 NeurIPS conference with Johann Brehmer, a research scientist at Qualcomm AI Research in Amsterdam. We begin our conversation discussing some of...

15 Des 202246min

Stable Diffusion & Generative AI with Emad Mostaque - #604

Stable Diffusion & Generative AI with Emad Mostaque - #604

Today we’re excited to kick off our 2022 AWS re:Invent series with a conversation with Emad Mostaque, Founder and CEO of Stability.ai. Stability.ai is a very popular name in the generative AI space at...

12 Des 202242min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
forklart
i-retten
popradet
stopp-verden
aftenpodden-usa
lydartikler-fra-aftenposten
rss-gukild-johaug
det-store-bildet
fotballpodden-2
dine-penger-pengeradet
nokon-ma-ga
rss-ness
hanna-de-heldige
aftenbla-bla
frokostshowet-pa-p5
rss-penger-polser-og-politikk
e24-podden
rss-utenrikskomiteen-med-bogen-og-grasvik