How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman

On this episode, we’re joined by Stella Biderman, Executive Director at EleutherAI and Lead Scientist - Mathematician at Booz Allen Hamilton.

EleutherAI is a grassroots collective that enables open-source AI research and focuses on the development and interpretability of large language models (LLMs).

We discuss:

- How EleutherAI got its start and where it's headed.

- The similarities and differences between various LLMs.

- How to decide which model to use for your desired outcome.

- The benefits and challenges of reinforcement learning from human feedback.

- Details around pre-training and fine-tuning LLMs.

- Which types of GPUs are best when training LLMs.

- What separates EleutherAI from other companies training LLMs.

- Details around mechanistic interpretability.

- Why understanding what and how LLMs memorize is important.

- The importance of giving researchers and the public access to LLMs.

Stella Biderman - https://www.linkedin.com/in/stellabiderman/

EleutherAI - https://www.linkedin.com/company/eleutherai/

Resources:

- https://www.eleuther.ai/

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.


#OCR #DeepLearning #AI #Modeling #ML

Jaksot(134)

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

In this episode of Gradient Dissent, Lukas Biewald talks with Jarek Kutylowski, CEO and founder of DeepL, an AI-powered translation company. Jarek shares DeepL’s journey from launching neural machine ...

8 Heinä 202542min

GitHub CEO Thomas Dohmke on Copilot and the Future of Software Development

GitHub CEO Thomas Dohmke on Copilot and the Future of Software Development

In this episode of Gradient Dissent, Lukas Biewald sits down with Thomas Dohmke, CEO of GitHub, to talk about the future of software engineering in the age of AI. They discuss how GitHub Copilot was b...

10 Kesä 20251h 9min

From Pharma to AGI Hype, and Developing AI in Finance: Martin Shkreli’s Journey

From Pharma to AGI Hype, and Developing AI in Finance: Martin Shkreli’s Journey

In this episode of Gradient Dissent, Lukas Biewald talks with Martin Shkreli — the infamous "pharma bro" turned founder — about his path from hedge fund manager and pharma CEO to convicted felon and n...

20 Touko 20251h 30min

Inside Cursor: The future of AI coding with Co-founder Sualeh Asif

Inside Cursor: The future of AI coding with Co-founder Sualeh Asif

In this episode of Gradient Dissent, host Lukas Biewald talks with Sualeh Asif, the CPO and co-founder of Cursor, one of the fastest-growing and most loved AI-powered coding platforms. Sualeh shares t...

29 Huhti 202549min

Inside the Dark Web, AI and Cybersecurity with Christopher Ahlberg CEO of Recorded Future

Inside the Dark Web, AI and Cybersecurity with Christopher Ahlberg CEO of Recorded Future

In this episode of Gradient Dissent, host Lukas Biewald talks with Christopher Ahlberg, CEO of Recorded Future, a pioneering cybersecurity company leveraging AI to provide intelligence insights. Chris...

8 Huhti 202550min

AI, autonomy, and the future of naval warfare with Captain Jon Haase, United States Navy

AI, autonomy, and the future of naval warfare with Captain Jon Haase, United States Navy

In this episode of Gradient Dissent, host Lukas Biewald speaks with Captain Jon Haase, United States Navy about real-world applications of AI and autonomy in defense. From underwater mine detection wi...

25 Maalis 20251h 1min

The rise of AI agents

The rise of AI agents

In this episode of Gradient Dissent, host Lukas Biewald sits down with João Moura, CEO & Founder of CrewAI, one of the leading platforms enabling AI agents for enterprise applications. Joe shares insi...

25 Helmi 202549min

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

In this episode of Gradient Dissent, host Lukas Biewald sits down with Mike Knoop, Co-founder and CEO of Ndea, a cutting-edge AI research lab. Mike shares his journey from building Zapier into a major...

4 Helmi 20251h 12min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
rss-rahapodi
psykopodiaa-podcast
herrasmieshakkerit
ostan-asuntoja-podcast
rahapuhetta
rss-rahamania
rss-lahtijat
rss-seuraava-potilas
leadcast
rss-merja-mahkan-rahat
rss-40-ajatusta-aanesta
rss-porssipuhetta
rss-levosta-kasin-yrittajyys
rss-sisalto-kuntoon
rss-vaikuttavan-opettajan-vierella
rss-ma
rss-bisneksen-pehmea-puoli
rss-draivi