How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning
The a16z Show28 Nov 2025

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

In this episode, a16z GP Martin Casado sits down with Sherwin Wu, Head of Engineering for the OpenAI Platform, to break down how OpenAI organizes its platform across models, pricing, and infrastructure, and how it is shifting from a single general-purpose model to a portfolio of specialized systems, custom fine-tuning options, and node-based agent workflows.

They get into why developers tend to stick with a trusted model family, what builds that trust, and why the industry moved past the idea of one model that can do everything. Sherwin also explains the evolution from prompt engineering to context design and how companies use OpenAI’s fine-tuning and RFT APIs to shape model behavior with their own data.

Highlights from the conversation include:

• How OpenAI balances a horizontal API platform with vertical products like ChatGPT
• The evolution from Codex to the Composer model
• Why usage-based pricing works and where outcome-based pricing breaks
• What the Harmonic Labs and Rockset acquisitions added to OpenAI’s agent work
• Why the new agent builder is deterministic, node based, and not free roaming

Resources:

Follow Sherwin on X: https://x.com/sherwinwu

Follow Martin on X: https://x.com/martin_casado

Stay Updated:

If you enjoyed this episode, be sure to like, subscribe, and share with your friends!

Find a16z on X: https://x.com/a16z

Find a16z on LinkedIn: https://www.linkedin.com/company/a16z

Listen to the a16z Podcast on Spotify: https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX

Listen to the a16z Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711

Follow our host: https://x.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see http://a16z.com/disclosures

Stay Updated:

Find a16z on YouTube: YouTube

Find a16z on X

Find a16z on LinkedIn

Listen to the a16z Show on Spotify

Listen to the a16z Show on Apple Podcasts

Follow our host: https://twitter.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Episoder(1000)

Balaji on Why AI Raises the Cost of Verification

Balaji on Why AI Raises the Cost of Verification

a16z general partner Erik Torenberg speaks with Balaji Srinivasan, angel investor and entrepreneur, about why AI simultaneously reduces the cost of creation and increases the cost of verification, and...

7 Apr 1h 7min

Peter Yang on Small Teams, Coding Agents, and Why Human Ambition Has No Ceiling

Peter Yang on Small Teams, Coding Agents, and Why Human Ambition Has No Ceiling

Anish Acharya speaks with Peter Yang, creator and product lead at Roblox, about how personal AI agents are replacing the apps we open every day, why coding agents feel like slot machines, and what hap...

6 Apr 28min

Marc Andreessen on AI Winters and Agent Breakthroughs

Marc Andreessen on AI Winters and Agent Breakthroughs

This episode originally aired on the Latent Space Podcast. swyx and Alessio Fanelli speak with Marc Andreessen about the arc of AI from its origins in 1943 to today's breakthroughs in reasoning, codin...

3 Apr 1h 17min

Alex Blania on Proof of Human and Building World's Identity Network

Alex Blania on Proof of Human and Building World's Identity Network

a16z's Ben Horowitz and Erik Torenberg speak with Alex Blania, cofounder and CEO of Tools for Humanity, World, and cofounder of Merge Labs. World is building the largest real human network, a proof-of...

2 Apr 42min

What Happens When a Public Company Goes All In on AI

What Happens When a Public Company Goes All In on AI

David Haber speaks with Owen Jennings, executive officer and business lead at Block, about how the company rebuilt itself around AI agents, small squads, and internal tools like Goose and Builder Bot ...

1 Apr 27min

How Radiant and Heron Are Rethinking Power Generation and Delivery

How Radiant and Heron Are Rethinking Power Generation and Delivery

a16z general partners Erin Price-Wright and Erik Torenberg speak with Doug Bernauer, founder and CEO of Radiant, and Drew Baglino, founder and CEO of Heron, about rebuilding American energy infrastruc...

31 Mar 49min

Marc Andreessen on Evaluating Founders and AI's Consumer Surplus

Marc Andreessen on Evaluating Founders and AI's Consumer Surplus

This episode originally aired on The Twenty Minute VC with Harry Stebbings. Marc Andreessen explains why learning from past investment mistakes can be a trap, shares his framework for evaluating found...

30 Mar 1h 7min

The SpaceX and Tesla Playbook for Hard Tech Startups

The SpaceX and Tesla Playbook for Hard Tech Startups

Erin Price-Wright speaks with Chandler Luzsicza, founder and CEO of Galadyne, and Turner Caldwell, cofounder and CEO of Mariana Minerals, about what they actually learned building Starship and Tesla's...

27 Mar 50min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
pengepodden-2
pengesnakk
tid-er-penger-en-podcast-med-peter-warren
finansredaksjonen
livet-pa-veien-med-jan-erik-larssen
utbytte
stormkast-med-valebrokk-stordalen
morgenkaffen-med-finansavisen
rss-sunn-okonomi
rss-markedspuls-2
lederpodden
liberal-halvtime
rss-pa-konto
rss-investering-gjort-enkelt