How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

On this episode, we’re joined by Stella Biderman, Executive Director at EleutherAI and Lead Scientist - Mathematician at Booz Allen Hamilton.

EleutherAI is a grassroots collective that enables open-source AI research and focuses on the development and interpretability of large language models (LLMs).

We discuss:

- How EleutherAI got its start and where it's headed.

- The similarities and differences between various LLMs.

- How to decide which model to use for your desired outcome.

- The benefits and challenges of reinforcement learning from human feedback.

- Details around pre-training and fine-tuning LLMs.

- Which types of GPUs are best when training LLMs.

- What separates EleutherAI from other companies training LLMs.

- Details around mechanistic interpretability.

- Why understanding what and how LLMs memorize is important.

- The importance of giving researchers and the public access to LLMs.

Stella Biderman - https://www.linkedin.com/in/stellabiderman/

EleutherAI - https://www.linkedin.com/company/eleutherai/

Resources:

- https://www.eleuther.ai/

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.


#OCR #DeepLearning #AI #Modeling #ML

Episoder(134)

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

Formal verification already consumes years of human effort.In this episode, Lukas Biewald talks with Carina Hong, Founder & CEO of Axiom, about why verification is becoming the real bottleneck in high...

5 Feb 50min

What a $42B Software Co. Really Spends on AI Tools

What a $42B Software Co. Really Spends on AI Tools

“I don't worry about being replaced by AI. I worry about being replaced by someone who's really good at using AI.”Atlassian has 10,000+ engineers currently split-testing the world’s top AI coding tool...

20 Jan 1h 7min

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

The future of AI training is shaped by one constraint: keeping GPUs fed.In this episode, Lukas Biewald talks with CoreWeave SVP Corey Sanders about why general-purpose clouds start to break down under...

6 Jan 53min

Why Physical AI Needed a Completely New Data Stack

Why Physical AI Needed a Completely New Data Stack

The future of AI is physical. In this episode, Lukas Biewald talks to Nikolaus West, CEO of Rerun, about why the breakthrough required to get AI out of the lab and into the messy real world is blocked...

16 Des 20251h

The Engineering Behind the World’s Most Advanced Video AI

The Engineering Behind the World’s Most Advanced Video AI

Is video AI a viable path toward AGI? Runway ML founder Cristóbal Valenzuela joins Lukas Biewald just after Gen 4.5 reached the #1 position on the Video Arena Leaderboard, according to community votin...

1 Des 202514min

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

In this episode of Gradient Dissent, Lukas Biewald talks with Tuhin Srivastava, CEO and founder of Baseten, one of the fastest-growing companies in the AI inference ecosystem. Tuhin shares the real st...

18 Nov 202559min

The Startup Powering The Data Behind AGI

The Startup Powering The Data Behind AGI

In this episode of Gradient Dissent, Lukas Biewald talks with the CEO & founder of Surge AI, the billion-dollar company quietly powering the next generation of frontier LLMs. They discuss Surge's orig...

16 Sep 202556min

Arvind Jain on Building Glean and the Future of Enterprise AI

Arvind Jain on Building Glean and the Future of Enterprise AI

In this episode of Gradient Dissent, Lukas Biewald sits down with Arvind Jain, CEO and founder of Glean. They discuss Glean's evolution from solving enterprise search to building agentic AI tools that...

5 Aug 202543min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
utbytte
pengepodden-2
pengesnakk
tid-er-penger-en-podcast-med-peter-warren
rss-sunn-okonomi
morgenkaffen-med-finansavisen
lederpodden
lederskap-nhhs-podkast-om-ledelse
rss-politisk-preik
rss-investering-gjort-enkelt
rss-markedspuls-2
rss-andelige-tanker-med-camillo