Building the Next Generation of Conversational AI
AI + a16z14 Mars 2025

Building the Next Generation of Conversational AI

In this episode of AI + a16z, Sesame Cofounder and CTO Ankit Kumar joins a16z general partner Anjney Midha for a deep dive into the research and engineering behind their voice technology. They discuss the technical challenges of real-time speech generation, the trade-offs in balancing personality with efficiency, and why the team is open-sourcing key components of their model. Ankit breaks down the complexities of multimodal AI, full-duplex conversation modeling, and the computational optimizations that enable low-latency interactions.

They also explore the evolution of natural language as a user interface and its potential to redefine human-computer interaction.
Plus, we take audience questions on everything from scaling laws in speech synthesis to the role of in-context learning in making AI voices more expressive.

Key Takeaways:
How Sesame AI achieves natural voice interactions through real-time speech generation.

  • The impact of open-sourcing their speech model and what it means for AI research.
  • The role of full-duplex modeling in improving AI responsiveness.
  • How computational efficiency and system latency shape AI conversation quality.
  • The growing role of natural language as a user interface in AI-driven experiences.

For anyone interested in AI and voice technology, this episode offers an in-depth look at the latest advancements pushing the boundaries of human-computer interaction.

Learn more:

The Maya + Miles demo

Crossing the uncanny valley of conversational voice

Sesame CSM 1B model

Follow everybody on X:

Ankit Kumar

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(100)

Ideogram’s Open-Weights Image Model and the Future of AI Design

Ideogram’s Open-Weights Image Model and the Future of AI Design

Yoko Li and Justine Moore speak with Ideogram founder and CEO Mohammad Norouzi about image generation models, design workflows, and the evolving relationship between AI and creative work. The conversa...

15 Juni 42min

Building Search for AI Agents with Exa CEO Will Bryk

Building Search for AI Agents with Exa CEO Will Bryk

Sarah Wang speaks with Exa cofounder and CEO Will Bryk about building search infrastructure for the AI era. The conversation covers Exa’s origins, why traditional search engines were not designed for ...

4 Juni 49min

AI Agents and the Fight for Customer Data

AI Agents and the Fight for Customer Data

Martin Casado speaks with George Fraser, cofounder and CEO of Fivetran, about the future of data infrastructure in the age of AI. The conversation covers Fivetran’s merger with dbt, the changing role ...

2 Juni 50min

Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software

Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software

Recorded live at the a16z Fintech Connect conference in Deer Valley, Alex Rampell speaks with Ben Horowitz, cofounder and general partner at a16z, about how AI has rewritten the fundamental rules of s...

19 Maj 29min

AI Infrastructure, Distribution, and the Next Wave of Software

AI Infrastructure, Distribution, and the Next Wave of Software

Sophie Buonassisi speaks with Jennifer Li, general partner at a16z, about why infrastructure is becoming one of the most important areas in AI. They discuss how the shift to AI-native systems is resha...

12 Maj 38min

From Vector Databases to Knowledge Engines: The Next Layer of AI

From Vector Databases to Knowledge Engines: The Next Layer of AI

Peter Levine speaks with Ash Ashutosh, CEO of Pinecone, about the launch of Nexus and the shift from vector databases to knowledge engines. As agents become the primary users of software, they discuss...

5 Maj 46min

Why We Need Continual Learning

Why We Need Continual Learning

Elena Burger speaks with Malika Aubakirova, partner on the AI infrastructure team at a16z, about why today’s AI systems struggle to learn over time. They discuss the limits of in-context learning, the...

28 Apr 18min

The Agent Era: Building Software Beyond Chat with Box CEO Aaron Levie

The Agent Era: Building Software Beyond Chat with Box CEO Aaron Levie

Erik Torenberg, Steve Sinofsky, and Martin Casado speak to Aaron Levie, CEO at Box, about what happens to enterprise software when agents become the primary users. They discuss why coding agents succe...

21 Apr 59min

Populärt inom Business & ekonomi

badfluence
framgangspodden
varvet
uppgang-och-fall
rss-borsens-finest
avanzapodden
24fragor
dynastin
svd-tech-brief
lastbilspodden
bathina-en-podcast
rss-dagen-med-di
rss-inga-dumma-fragor-om-pengar
fill-or-kill
tabberaset
kapitalet-en-podd-om-ekonomi
borsmorgon
rss-kort-lang-analyspodden-fran-di
rikatillsammans-om-privatekonomi-rikedom-i-livet
bilar-med-sladd