How GPU Access Helps AI Startups Be Agile
AI + a16z23 Okt 2024

How GPU Access Helps AI Startups Be Agile

In this episode of AI + a16z, General Partner Anjney Midha explains the forces that lead to GPU shortages and price spikes, and how the firm mitigates these concerns for portfolio companies by supplying them with the GPUs they need through a program called Oxygen. The TL;DR version of the problem is that competition for GPU access favors large incumbents who can afford to outbid startups and commit to long contracts; when startups do buy or rent in bulk, they can be stuck with lots of GPUs and — absent training runs or ample customer demand for inference workloads — nothing to do with them.

Here is an excerpt of Anjney explaining how training versus inference workloads affect what level of resources a company needs at any given time:

"It comes down to whether the customer that's using them . . . has a use that can really optimize the efficiency of those chips. As an example, if you happen to be an image model company or a video model company and you put a long-term contract on H100s this year, and you trained and put out a really good model and a product that a lot of people want to use, even though you're not training on the best and latest cluster next year, that's OK. Because you can essentially swap out your training workloads for your inference workloads on those H100s.

"The H100s are actually incredibly powerful chips that you can run really good inference workloads on. So as long as you have customers who want to run inference of your model on your infrastructure, then you can just redirect that capacity to them and then buy new [Nvidia] Blackwells for your training runs.

"Who it becomes really tricky for is people who bought a bunch, don't have demand from their customers for inference, and therefore are stuck doing training runs on that last-generation hardware. That's a tough place to be."

Learn more:

Navigating the High Cost of GPU Compute

Chasing Silicon: The Race for GPUs

Remaking the UI for AI

Follow on X:

Anjney Midha

Derrick Harris

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Episoder(90)

Patrick Collison on Stripe’s Early Choices, Smalltalk, and What Comes After Coding

Patrick Collison on Stripe’s Early Choices, Smalltalk, and What Comes After Coding

Michael Truell, CEO of Cursor, sits down with Patrick Collison, CEO of Stripe and an investor in Anysphere, to talk about Collison's history with Smalltalk and Lisp, the MongoDB and Ruby decisions Str...

24 Mar 52min

OpenClaw: Why the Internet Isn't Built for AI Agents

OpenClaw: Why the Internet Isn't Built for AI Agents

Yoko Li, Guido Appenzeller, and Joel de la Garza discuss OpenClaw, the open source personal AI assistant that's forcing a rethink of how identity, permissions, and security work on the internet. They ...

19 Mar 47min

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

What's Missing Between LLMs and AGI - Vishal Misra & Martin Casado

Vishal Misra returns to explain his latest research on how LLMs actually work under the hood. He walks through experiments showing that transformers update their predictions in a precise, mathematical...

17 Mar 47min

Replit's CEO on Vibe Coding, Wealth Building, and What Most People Get Wrong About AI

Replit's CEO on Vibe Coding, Wealth Building, and What Most People Get Wrong About AI

Jack Neel speaks with Amjad Masad, CEO at Replit, about how AI is making it easier than ever to build and ship software without a technical background. They discuss Replit's rise from a browser-based ...

10 Mar 1h 39min

Jack Altman & Martin Casado on the Future of VC

Jack Altman & Martin Casado on the Future of VC

Jack Altman sits down with Martin Casado, General Partner at a16z, to unpack the shifting dynamics of venture capital and why media matters more than ever. They cover a16z’s evolution from generalists...

3 Mar 53min

AI’s Capital Flywheel: Models, Money, and the Future of Power

AI’s Capital Flywheel: Models, Money, and the Future of Power

a16z's Martin Casado and Sarah Wang join Latent Space hosts Alessio Fanelli and Swyx to discuss what makes this AI investment cycle unlike anything in the history of venture capital. They cover why th...

24 Feb 57min

Durable Execution and the Infrastructure Powering AI Agents

Durable Execution and the Infrastructure Powering AI Agents

Raghu Raghuram, Managing Partner at a16z, and Sarah Wang, General Partner at a16z, speak with Samar Abbas, CEO of Temporal, about how durable execution became the infrastructure layer behind some of t...

19 Feb 1h 3min

Evals, Feedback Loops, and the Engineering That Makes AI Work

Evals, Feedback Loops, and the Engineering That Makes AI Work

Martin Casado speaks with Ankur Goyal, founder and CEO of Braintrust, about where engineering actually matters in AI and where it doesn't. They cover the open source vs closed source model cycle, why ...

17 Feb 43min

Populært innen Business og økonomi

lydartikler-fra-aftenposten
stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
pengepodden-2
pengesnakk
utbytte
rss-sunn-okonomi
morgenkaffen-med-finansavisen
stormkast-med-valebrokk-stordalen
liberal-halvtime
tid-er-penger-en-podcast-med-peter-warren
lederpodden
rss-politisk-preik
okonomiamatorene
rss-markedspuls-2