How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

On this episode, we’re joined by Stella Biderman, Executive Director at EleutherAI and Lead Scientist - Mathematician at Booz Allen Hamilton.

EleutherAI is a grassroots collective that enables open-source AI research and focuses on the development and interpretability of large language models (LLMs).

We discuss:

- How EleutherAI got its start and where it's headed.

- The similarities and differences between various LLMs.

- How to decide which model to use for your desired outcome.

- The benefits and challenges of reinforcement learning from human feedback.

- Details around pre-training and fine-tuning LLMs.

- Which types of GPUs are best when training LLMs.

- What separates EleutherAI from other companies training LLMs.

- Details around mechanistic interpretability.

- Why understanding what and how LLMs memorize is important.

- The importance of giving researchers and the public access to LLMs.

Stella Biderman - https://www.linkedin.com/in/stellabiderman/

EleutherAI - https://www.linkedin.com/company/eleutherai/

Resources:

- https://www.eleuther.ai/

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.


#OCR #DeepLearning #AI #Modeling #ML

Jaksot(134)

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

In this episode of Gradient Dissent, host Lukas Biewald sits down with David Cahn, partner at Sequoia Capital, for a compelling discussion on the dynamic world of AI investments. They dive into recent...

28 Tammi 202558min

Building the future of collaborative AI development with Akshay Agrawal

Building the future of collaborative AI development with Akshay Agrawal

In this episode of Gradient Dissent, Akshay Agrawal, Co-Founder of Marimo, joins host Lukas Biewald to discuss the future of collaborative AI development. They dive into how Marimo is enabling develop...

7 Tammi 202541min

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

In this episode of Gradient Dissent, Joseph E. Gonzalez, EECS Professor at UC Berkeley and Co-Founder at RunLLM, joins host Lukas Biewald to explore innovative approaches to evaluating LLMs.They discu...

17 Joulu 202455min

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

In this episode of Gradient Dissent, Julian Green, Co-founder & CEO of Brightband, joins host Lukas Biewald to discuss how AI is transforming weather forecasting and climate solutions.They explore Bri...

26 Marras 202449min

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

In this episode of Gradient Dissent, Jonathan Siddharth, CEO & Co-Founder of Turing, joins host Lukas Biewald to discuss the path to AGI.They explore how Turing built a "developer cloud" of 3.7 millio...

7 Marras 202454min

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

In this episode of Gradient Dissent, Guillermo Rauch, CEO & Founder of Vercel, joins host Lukas Biewald for a wide ranging discussion on how AI is changing web development and front end engineering. T...

24 Loka 202456min

Snowflake’s CEO Sridhar Ramaswamy on 700+ LLM enterprise use cases

Snowflake’s CEO Sridhar Ramaswamy on 700+ LLM enterprise use cases

In this episode of Gradient Dissent, Snowflake CEO Sridhar Ramaswamy joins host Lukas Biewald to explore how AI is transforming enterprise data strategies.They discuss Sridhar's journey from Google to...

10 Loka 202455min

Elevating ML Infrastructure with Modal Labs CEO Erik Bernhardsson

Elevating ML Infrastructure with Modal Labs CEO Erik Bernhardsson

In this episode of Gradient Dissent, Erik Bernhardsson, CEO & Founder of Modal Labs, joins host Lukas Biewald to discuss the future of machine learning infrastructure. They explore how Modal is enhanc...

26 Syys 202449min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
rss-rahapodi
psykopodiaa-podcast
ostan-asuntoja-podcast
herrasmieshakkerit
rss-seuraava-potilas
rahapuhetta
rss-rahamania
rss-40-ajatusta-aanesta
rss-porssipuhetta
rss-merja-mahkan-rahat
rss-lahtijat
rss-20-30-40-podcast
rss-levosta-kasin-yrittajyys
rss-draivi
rss-ma
raksapodi
rss-laakispodi
rss-paasipodi