Building the Next Generation of Conversational AI
AI + a16z14 Maalis 2025

Building the Next Generation of Conversational AI

In this episode of AI + a16z, Sesame Cofounder and CTO Ankit Kumar joins a16z general partner Anjney Midha for a deep dive into the research and engineering behind their voice technology. They discuss the technical challenges of real-time speech generation, the trade-offs in balancing personality with efficiency, and why the team is open-sourcing key components of their model. Ankit breaks down the complexities of multimodal AI, full-duplex conversation modeling, and the computational optimizations that enable low-latency interactions.

They also explore the evolution of natural language as a user interface and its potential to redefine human-computer interaction.
Plus, we take audience questions on everything from scaling laws in speech synthesis to the role of in-context learning in making AI voices more expressive.

Key Takeaways:
How Sesame AI achieves natural voice interactions through real-time speech generation.

  • The impact of open-sourcing their speech model and what it means for AI research.
  • The role of full-duplex modeling in improving AI responsiveness.
  • How computational efficiency and system latency shape AI conversation quality.
  • The growing role of natural language as a user interface in AI-driven experiences.

For anyone interested in AI and voice technology, this episode offers an in-depth look at the latest advancements pushing the boundaries of human-computer interaction.

Learn more:

The Maya + Miles demo

Crossing the uncanny valley of conversational voice

Sesame CSM 1B model

Follow everybody on X:

Ankit Kumar

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jaksot(93)

Building Production Workflows for AI Applications

Building Production Workflows for AI Applications

In this episode, Inngest cofounder and CEO Tony Holdstock-Brown joins a16z partner Yoko Li, as well as Derrick Harris, to discuss the reality and complexity of running AI agents and other multistep AI...

14 Kesä 202443min

The Future of Image Models Is Multimodal

The Future of Image Models Is Multimodal

In this episode, Ideogram CEO Mohammad Norouzi joins a16z General Partner Jennifer Li, as well as Derrick Harris, to share his story of growing up in Iran, helping build influential text-to-image mode...

7 Kesä 202437min

ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)

ARCHIVE: Open Models (with Arthur Mensch) and Video Models (with Stefano Ermon)

For this holiday weekend (in the United States) episode, we've stitched together two archived episodes from the a16z Podcast, both featuring General Partner Anjney Midha. In the first half, from Decem...

24 Touko 20241h 5min

Open Models and Maturation: Assessing the Generative AI Market

Open Models and Maturation: Assessing the Generative AI Market

a16z partners Guido Appenzeller and Matt Bornstein join Derrick Harris to discuss the state of the generative AI market, about 18 months after it really kicked into high gear with the release of ChatG...

17 Touko 202440min

Security Founders Talk Shop About Generative AI

Security Founders Talk Shop About Generative AI

In this bonus episode, recorded live at our San Francisco office, security-startup founders Dean De Beer (Command Zero), Kevin Tian (Doppel), and Travis McPeak (Resourcely) share their thoughts on gen...

15 Touko 202422min

How to Think About Foundation Models for Cybersecurity

How to Think About Foundation Models for Cybersecurity

In this episode of the AI + a16z podcast, a16z General Partner Zane Lackey and a16z Partner Joel de la Garza sit down with Derrick Harris to discuss how generative AI — LLMs, in particular — and found...

10 Touko 202437min

Securing the Software Supply Chain with LLMs

Securing the Software Supply Chain with LLMs

Socket Founder and CEO Feross Aboukhadijeh joins a16z's Joel de la Garza and Derrick Harris to discuss the open-source software supply chain. Feross and Joel share their thoughts and insights on topic...

3 Touko 202438min

ARCHIVE: GPT-3 Hype

ARCHIVE: GPT-3 Hype

In this episode, though, we’re traveling back in time to distant — in AI years, at least — past of 2020. Because amid all the news over the past 18 or so months, it’s easy to forget that generative AI...

1 Touko 202433min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
herrasmieshakkerit
rss-rahamania
ostan-asuntoja-podcast
rss-sami-miettinen-neuvottelija
rahapuhetta
hyva-paha-johtaminen
rss-lahtijat
yrittaja
juristipodi
rss-doulapodi
rss-sisalto-kuntoon
rss-seuraava-potilas
rss-paasipodi
seminuoret-sijoittajat
rss-uskalla-yrittaa
rss-inderes-femme