DisTrO and the Quest for Community-Trained AI Models
AI + a16z27 Syys 2024

DisTrO and the Quest for Community-Trained AI Models

In this episode of AI + a16z, Bowen Peng and Jeffrey Quesnelle of Nous Research join a16z General Partner Anjney Midha to discuss their mission to keep open source AI research alive and activate the community of independent builders. The focus is on a recent project called DisTrO, which demonstrates it's possible to train AI models across the public internet much faster than previously thought possible. However, Nous is behind a number of other successful open source AI projects, including the popular Hermes family of "neutral" and guardrail-free language models.

Here's an excerpt of Jeffrey explaining how DisTrO was inspired by the possibility that major open source AI providers could turn their efforts back inward:

"What if we don't get Llama 4? That's like an actual existential threat because the closed providers will continue to get better and we would be dead in the water, in a sense.

"So we asked, 'Is there any real reason we can't make Llama 4 ourselves?' And there is a real reason, which is that we don't have 20,000 H100s. . . . God willing and the creek don't rise, maybe we will one day, but we don't have that right now.

"So we said, 'But what do we have?' We have a giant activated community who's passionate about wanting to do this and would be willing to contribute their GPUs, their power, to it, if only they could . . . but we don't have the ability to activate that willingness into actual action. . . . The only way people are connected is over the internet, and so anything that isn't sharing over the internet is not gonna work.

"And so that was the initial premise: What if we don't get Llama 4? And then, what do we have that we could use to create Llama 4? And, if we can't, what are the technical problems that, if only we slayed that one technical problem, the dam of our community can now flow and actually solve the problem?"

Learn more:

DisTrO paper

Nous Research

Nous Research GitHub

Follow everyone on X:

Bowen Peng

Jeffrey Quesnelle

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jaksot(82)

Augmenting Incident Response with LLMs

Augmenting Incident Response with LLMs

In this episode of the AI + a16z podcast, Command Zero cofounder and CTO Dean de Beer joins a16z's Joel de la Garza and Derrick Harris to discuss the benefits of training large language models on secu...

26 Heinä 202440min

Scaling AI for the Coming Data Deluge

Scaling AI for the Coming Data Deluge

In this episode of the AI + a16z podcast, Anyscale cofounder and CEO Robert Nishihara joins a16z's Jennifer Li and Derrick Harris to discuss the challenges of training and running AI models at scale; ...

19 Heinä 202437min

ARCHIVE: The Dream of AI Is Alive in AlphaGo

ARCHIVE: The Dream of AI Is Alive in AlphaGo

In this archive episode from 2015, a16z's Sonal Chokshi, Frank Chen, and Steven Sinofsky discuss DeepMind's breakthrough AlphaGo system, which mastered the ancient Chinese game Go and introduced the p...

5 Heinä 202433min

Beyond Language: Inside a Hundred-Trillion-Token Video Model

Beyond Language: Inside a Hundred-Trillion-Token Video Model

In this episode of the AI + a16z podcast, Luma Chief Scientist Jiaming Song joins a16z General Partner Anjney MIdha to discuss Jiaming's esteemed career in video models, culminating thus far in Luma's...

3 Heinä 20241h 5min

Developer Tool UX in the Age of Generative AI

Developer Tool UX in the Age of Generative AI

In this episode, design engineer Alasdair Monk joins a16z's Yoko Li and Derrick Harris to discuss how generative AI is changing how developers — and the those building for developers — interact with t...

21 Kesä 202437min

Building Production Workflows for AI Applications

Building Production Workflows for AI Applications

In this episode, Inngest cofounder and CEO Tony Holdstock-Brown joins a16z partner Yoko Li, as well as Derrick Harris, to discuss the reality and complexity of running AI agents and other multistep AI...

14 Kesä 202443min

The Future of Image Models Is Multimodal

The Future of Image Models Is Multimodal

In this episode, Ideogram CEO Mohammad Norouzi joins a16z General Partner Jennifer Li, as well as Derrick Harris, to share his story of growing up in Iran, helping build influential text-to-image mode...

7 Kesä 202437min

ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)

ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)

For this holiday weekend (in the United States) episode, we've stitched together two archived episodes from the a16z Podcast, both featuring General Partner Anjney Midha. In the first half, from Decem...

24 Touko 20241h 5min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
psykopodiaa-podcast
mimmit-sijoittaa
rss-rahapodi
rss-draivi
rss-lahtijat
oppimisen-psykologia
rss-rahamania
rss-porssipuhetta
taloudellinen-mielenrauha
rss-seuraava-potilas
rahapuhetta
rss-h-asselmoilanen
rss-paatos-podcast-suomen-kovimmat-paatoksentekijat-2
rss-paasipodi
rss-inderes
io-techin-tekniikkapodcast
pomojen-suusta
rss-viisas-raha-podi
rss-40-ajatusta-aanesta