How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

On this episode, we’re joined by Stella Biderman, Executive Director at EleutherAI and Lead Scientist - Mathematician at Booz Allen Hamilton.

EleutherAI is a grassroots collective that enables open-source AI research and focuses on the development and interpretability of large language models (LLMs).

We discuss:

- How EleutherAI got its start and where it's headed.

- The similarities and differences between various LLMs.

- How to decide which model to use for your desired outcome.

- The benefits and challenges of reinforcement learning from human feedback.

- Details around pre-training and fine-tuning LLMs.

- Which types of GPUs are best when training LLMs.

- What separates EleutherAI from other companies training LLMs.

- Details around mechanistic interpretability.

- Why understanding what and how LLMs memorize is important.

- The importance of giving researchers and the public access to LLMs.

Stella Biderman - https://www.linkedin.com/in/stellabiderman/

EleutherAI - https://www.linkedin.com/company/eleutherai/

Resources:

- https://www.eleuther.ai/

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.


#OCR #DeepLearning #AI #Modeling #ML

Episoder(134)

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

In this episode of Gradient Dissent, host Lukas Biewald sits down with David Cahn, partner at Sequoia Capital, for a compelling discussion on the dynamic world of AI investments. They dive into recent...

28 Jan 202558min

Building the future of collaborative AI development with Akshay Agrawal

Building the future of collaborative AI development with Akshay Agrawal

In this episode of Gradient Dissent, Akshay Agrawal, Co-Founder of Marimo, joins host Lukas Biewald to discuss the future of collaborative AI development. They dive into how Marimo is enabling develop...

7 Jan 202541min

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

In this episode of Gradient Dissent, Joseph E. Gonzalez, EECS Professor at UC Berkeley and Co-Founder at RunLLM, joins host Lukas Biewald to explore innovative approaches to evaluating LLMs.They discu...

17 Des 202455min

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

In this episode of Gradient Dissent, Julian Green, Co-founder & CEO of Brightband, joins host Lukas Biewald to discuss how AI is transforming weather forecasting and climate solutions.They explore Bri...

26 Nov 202449min

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

In this episode of Gradient Dissent, Jonathan Siddharth, CEO & Co-Founder of Turing, joins host Lukas Biewald to discuss the path to AGI.They explore how Turing built a "developer cloud" of 3.7 millio...

7 Nov 202454min

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

In this episode of Gradient Dissent, Guillermo Rauch, CEO & Founder of Vercel, joins host Lukas Biewald for a wide ranging discussion on how AI is changing web development and front end engineering. T...

24 Okt 202456min

Snowflake’s CEO Sridhar Ramaswamy on 700+ LLM enterprise use cases

Snowflake’s CEO Sridhar Ramaswamy on 700+ LLM enterprise use cases

In this episode of Gradient Dissent, Snowflake CEO Sridhar Ramaswamy joins host Lukas Biewald to explore how AI is transforming enterprise data strategies.They discuss Sridhar's journey from Google to...

10 Okt 202455min

Elevating ML Infrastructure with Modal Labs CEO Erik Bernhardsson

Elevating ML Infrastructure with Modal Labs CEO Erik Bernhardsson

In this episode of Gradient Dissent, Erik Bernhardsson, CEO & Founder of Modal Labs, joins host Lukas Biewald to discuss the future of machine learning infrastructure. They explore how Modal is enhanc...

26 Sep 202449min

Populært innen Business og økonomi

lydartikler-fra-aftenposten
stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
pengesnakk
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
tid-er-penger-en-podcast-med-peter-warren
utbytte
stormkast-med-valebrokk-stordalen
morgenkaffen-med-finansavisen
rss-politisk-preik
liberal-halvtime
rss-markedspuls-2
rss-sunn-okonomi
lederpodden
rss-pa-konto