Building the Next Generation of Conversational AI
AI + a16z14 Maalis 2025

Building the Next Generation of Conversational AI

In this episode of AI + a16z, Sesame Cofounder and CTO Ankit Kumar joins a16z general partner Anjney Midha for a deep dive into the research and engineering behind their voice technology. They discuss the technical challenges of real-time speech generation, the trade-offs in balancing personality with efficiency, and why the team is open-sourcing key components of their model. Ankit breaks down the complexities of multimodal AI, full-duplex conversation modeling, and the computational optimizations that enable low-latency interactions.

They also explore the evolution of natural language as a user interface and its potential to redefine human-computer interaction.
Plus, we take audience questions on everything from scaling laws in speech synthesis to the role of in-context learning in making AI voices more expressive.

Key Takeaways:
How Sesame AI achieves natural voice interactions through real-time speech generation.

  • The impact of open-sourcing their speech model and what it means for AI research.
  • The role of full-duplex modeling in improving AI responsiveness.
  • How computational efficiency and system latency shape AI conversation quality.
  • The growing role of natural language as a user interface in AI-driven experiences.

For anyone interested in AI and voice technology, this episode offers an in-depth look at the latest advancements pushing the boundaries of human-computer interaction.

Learn more:

The Maya + Miles demo

Crossing the uncanny valley of conversational voice

Sesame CSM 1B model

Follow everybody on X:

Ankit Kumar

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Jaksot(83)

The Best Way to Achieve AGI Is to Invent It

The Best Way to Achieve AGI Is to Invent It

Longtime machine-learning researcher, and University of Washington Professor Emeritus, Pedro Domingos joins a16z General Partner Martin Casado to discuss the state of artificial intelligence, whether ...

4 Marras 202438min

Neural Nets and Nobel Prizes: AI's 40-Year Journey from the Lab to Ubiquity

Neural Nets and Nobel Prizes: AI's 40-Year Journey from the Lab to Ubiquity

In this episode of AI + a16z, General Partner Anjney Midha shares his perspective on the recent collection of Nobel Prizes awarded to AI researchers in both Physics and Chemistry. He talks through how...

25 Loka 202440min

How GPU Access Helps AI Startups Be Agile

How GPU Access Helps AI Startups Be Agile

In this episode of AI + a16z, General Partner Anjney Midha explains the forces that lead to GPU shortages and price spikes, and how the firm mitigates these concerns for portfolio companies by supplyi...

23 Loka 202439min

DisTrO and the Quest for Community-Trained AI Models

DisTrO and the Quest for Community-Trained AI Models

In this episode of AI + a16z, Bowen Peng and Jeffrey Quesnelle of Nous Research join a16z General Partner Anjney Midha to discuss their mission to keep open source AI research alive and activate the c...

27 Syys 20241h 12min

Balancing AI Expertise and Industry Acumen in Vertical Applications

Balancing AI Expertise and Industry Acumen in Vertical Applications

In this episode of AI + a16z, Ambience cofounder and chief scientist Nikhil Buduma joins Derrick Harris to discuss the nuances of using AI models to build vertical applications (including in his space...

13 Syys 202442min

AI, SQL, and the End of Big Data

AI, SQL, and the End of Big Data

In this episode of AI + a16z, a16z General Partner Jennifer Li joins MotherDuck Cofounder and CEO Jordan Tigani to discuss DuckDB's spiking popularity as the era of big data wanes, as well as the appl...

30 Elo 202433min

The Researcher to Founder Journey, and the Power of Open Models

The Researcher to Founder Journey, and the Power of Open Models

In this episode of the AI + a16z podcast, Black Forest Labs founders Robin Rombach, Andreas Blattmann, and Patrick Esser sit down with a16z general partner Anjney Midha to discuss their journey from P...

16 Elo 202437min

Why Computer Science Subsumed Biotech

Why Computer Science Subsumed Biotech

In this episode, a16z General Partner Vijay Pande walks us through the past two decades of applying software engineering to the life sciences — from the Folding@Home project that he launched, through ...

9 Elo 202447min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
rss-draivi
rss-rahamania
ostan-asuntoja-podcast
rss-sami-miettinen-neuvottelija
pomojen-suusta
inderespodi
rss-seuraava-potilas
herrasmieshakkerit
taloudellinen-mielenrauha
oppimisen-psykologia
rss-h-asselmoilanen
rss-paasipodi
rss-inderes
asuntoasiaa-paivakirjat
rss-lahtijat
rss-bisnesta-bebeja