Neural Network Pruning and Training with Jonathan Frankle at MosaicML

Neural Network Pruning and Training with Jonathan Frankle at MosaicML

Jonathan Frankle, Chief Scientist at MosaicML and Assistant Professor of Computer Science at Harvard University, joins us on this episode. With comprehensive infrastructure and software tools, MosaicML aims to help businesses train complex machine-learning models using their own proprietary data.

We discuss:

- Details of Jonathan’s Ph.D. dissertation which explores his “Lottery Ticket Hypothesis.”

- The role of neural network pruning and how it impacts the performance of ML models.

- Why transformers will be the go-to way to train NLP models for the foreseeable future.

- Why the process of speeding up neural net learning is both scientific and artisanal.

- What MosaicML does, and how it approaches working with clients.

- The challenges for developing AGI.

- Details around ML training policy and ethics.

- Why data brings the magic to customized ML models.

- The many use cases for companies looking to build customized AI models.

Jonathan Frankle - https://www.linkedin.com/in/jfrankle/

Resources:

- https://mosaicml.com/

- The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks

Thanks for listening to the Gradient Dissent podcast, brought to you by Weights & Biases. If you enjoyed this episode, please leave a review to help get the word out about the show. And be sure to subscribe so you never miss another insightful conversation.

#OCR #DeepLearning #AI #Modeling #ML

Avsnitt(136)

The rise of AI agents

The rise of AI agents

In this episode of Gradient Dissent, host Lukas Biewald sits down with João Moura, CEO & Founder of CrewAI, one of the leading platforms enabling AI agents for enterprise applications. Joe shares insi...

25 Feb 202549min

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

In this episode of Gradient Dissent, host Lukas Biewald sits down with Mike Knoop, Co-founder and CEO of Ndea, a cutting-edge AI research lab. Mike shares his journey from building Zapier into a major...

4 Feb 20251h 12min

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

DeepSeek, Stargate and AI's $600 Billion Question with Sequoia's David Cahn

In this episode of Gradient Dissent, host Lukas Biewald sits down with David Cahn, partner at Sequoia Capital, for a compelling discussion on the dynamic world of AI investments. They dive into recent...

28 Jan 202558min

Building the future of collaborative AI development with Akshay Agrawal

Building the future of collaborative AI development with Akshay Agrawal

In this episode of Gradient Dissent, Akshay Agrawal, Co-Founder of Marimo, joins host Lukas Biewald to discuss the future of collaborative AI development. They dive into how Marimo is enabling develop...

7 Jan 202541min

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

Evaluating LLMs with Chatbot Arena and Joseph E. Gonzalez

In this episode of Gradient Dissent, Joseph E. Gonzalez, EECS Professor at UC Berkeley and Co-Founder at RunLLM, joins host Lukas Biewald to explore innovative approaches to evaluating LLMs.They discu...

17 Dec 202455min

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

AI’s breakthrough in weather forecasting with Brightband’s Julian Green

In this episode of Gradient Dissent, Julian Green, Co-founder & CEO of Brightband, joins host Lukas Biewald to discuss how AI is transforming weather forecasting and climate solutions.They explore Bri...

26 Nov 202449min

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

What’s the path to AGI? A conversation with Turing Co-founder and CEO Jonathan Siddharth

In this episode of Gradient Dissent, Jonathan Siddharth, CEO & Co-Founder of Turing, joins host Lukas Biewald to discuss the path to AGI.They explore how Turing built a "developer cloud" of 3.7 millio...

7 Nov 202454min

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

Vercel’s CEO & Founder Guillermo Rauch on the impact of AI on Web Development and Front End Engineering

In this episode of Gradient Dissent, Guillermo Rauch, CEO & Founder of Vercel, joins host Lukas Biewald for a wide ranging discussion on how AI is changing web development and front end engineering. T...

24 Okt 202456min

Populärt inom Business & ekonomi

framgangspodden
varvet
rss-jossan-nina
rss-svart-marknad
rss-borsens-finest
svd-tech-brief
badfluence
avanzapodden
uppgang-och-fall
bathina-en-podcast
fill-or-kill
rss-inga-dumma-fragor-om-pengar
lastbilspodden
rss-dagen-med-di
rss-kort-lang-analyspodden-fran-di
tabberaset
bilar-med-sladd
dynastin
24fragor
borsmorgon