Building the Next Generation of Conversational AI
AI + a16z14 Mar 2025

Building the Next Generation of Conversational AI

In this episode of AI + a16z, Sesame Cofounder and CTO Ankit Kumar joins a16z general partner Anjney Midha for a deep dive into the research and engineering behind their voice technology. They discuss the technical challenges of real-time speech generation, the trade-offs in balancing personality with efficiency, and why the team is open-sourcing key components of their model. Ankit breaks down the complexities of multimodal AI, full-duplex conversation modeling, and the computational optimizations that enable low-latency interactions.

They also explore the evolution of natural language as a user interface and its potential to redefine human-computer interaction.
Plus, we take audience questions on everything from scaling laws in speech synthesis to the role of in-context learning in making AI voices more expressive.

Key Takeaways:
How Sesame AI achieves natural voice interactions through real-time speech generation.

  • The impact of open-sourcing their speech model and what it means for AI research.
  • The role of full-duplex modeling in improving AI responsiveness.
  • How computational efficiency and system latency shape AI conversation quality.
  • The growing role of natural language as a user interface in AI-driven experiences.

For anyone interested in AI and voice technology, this episode offers an in-depth look at the latest advancements pushing the boundaries of human-computer interaction.

Learn more:

The Maya + Miles demo

Crossing the uncanny valley of conversational voice

Sesame CSM 1B model

Follow everybody on X:

Ankit Kumar

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Episoder(89)

Data Management for Enterprise LLMs

Data Management for Enterprise LLMs

In this episode of AI + a16z, Fivetran cofounder and CEO George Fraser and a16z partner Guido Appenzeller discuss how LLMs fit into the data management picture within large enterprises. In order to ta...

7 Feb 202538min

From NLP to LLMs: The Quest for a Reliable Chatbot

From NLP to LLMs: The Quest for a Reliable Chatbot

In this episode of AI + a16z, a16z General Partner Martin Casado and Rasa cofounder and CEO Alan Nichol discuss the past, present, and future of AI agents and chatbots. Alan shares his history working...

10 Jan 202538min

Best of the Year: Building AI Companies

Best of the Year: Building AI Companies

A 2024 highlight reel, featuring founders sharing their insights, advice, and experiences building AI companies — from foundation-model labs to vertical applications. Topics include:Building AI tools ...

27 Des 202446min

Can AI Agents Finally Fix Customer Support?

Can AI Agents Finally Fix Customer Support?

In this episode of the AI + a16z podcast, Decagon cofounder/CEO Jesse Zhang and a16z partner Kimberly Tan discuss how LLMs are reshaping customer support, the strong market demand for AI agents, and h...

18 Des 202444min

REPLAY: Scoping the Enterprise LLM Market

REPLAY: Scoping the Enterprise LLM Market

This is a replay of our first episode from April 12, featuring Databricks VP of AI Naveen Rao and a16z partner Matt Bornstein discussing enterprise LLM adoption, hardware platforms, and what it means ...

30 Nov 202443min

Building Developers Tools, From Docker to Diffusion Models

Building Developers Tools, From Docker to Diffusion Models

In this episode of AI + a16z, Replicate cofounder and CEO Ben Firshman, and a16z partner Matt Bornstein, discuss the art of building products and companies that appeal to software developers. Ben was ...

15 Nov 202441min

The Best Way to Achieve AGI Is to Invent It

The Best Way to Achieve AGI Is to Invent It

Longtime machine-learning researcher, and University of Washington Professor Emeritus, Pedro Domingos joins a16z General Partner Martin Casado to discuss the state of artificial intelligence, whether ...

4 Nov 202438min

Neural Nets and Nobel Prizes: AI's 40-Year Journey from the Lab to Ubiquity

Neural Nets and Nobel Prizes: AI's 40-Year Journey from the Lab to Ubiquity

In this episode of AI + a16z, General Partner Anjney Midha shares his perspective on the recent collection of Nobel Prizes awarded to AI researchers in both Physics and Chemistry. He talks through how...

25 Okt 202440min

Populært innen Business og økonomi

lydartikler-fra-aftenposten
stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
pengesnakk
pengepodden-2
tid-er-penger-en-podcast-med-peter-warren
utbytte
rss-sunn-okonomi
morgenkaffen-med-finansavisen
liberal-halvtime
stormkast-med-valebrokk-stordalen
lederpodden
rss-markedspuls-2
okonomiamatorene
rss-politisk-preik