Building the Next Generation of Conversational AI
AI + a16z14 Maalis 2025

Building the Next Generation of Conversational AI

In this episode of AI + a16z, Sesame Cofounder and CTO Ankit Kumar joins a16z general partner Anjney Midha for a deep dive into the research and engineering behind their voice technology. They discuss the technical challenges of real-time speech generation, the trade-offs in balancing personality with efficiency, and why the team is open-sourcing key components of their model. Ankit breaks down the complexities of multimodal AI, full-duplex conversation modeling, and the computational optimizations that enable low-latency interactions.

They also explore the evolution of natural language as a user interface and its potential to redefine human-computer interaction.
Plus, we take audience questions on everything from scaling laws in speech synthesis to the role of in-context learning in making AI voices more expressive.

Key Takeaways:
How Sesame AI achieves natural voice interactions through real-time speech generation.

  • The impact of open-sourcing their speech model and what it means for AI research.
  • The role of full-duplex modeling in improving AI responsiveness.
  • How computational efficiency and system latency shape AI conversation quality.
  • The growing role of natural language as a user interface in AI-driven experiences.

For anyone interested in AI and voice technology, this episode offers an in-depth look at the latest advancements pushing the boundaries of human-computer interaction.

Learn more:

The Maya + Miles demo

Crossing the uncanny valley of conversational voice

Sesame CSM 1B model

Follow everybody on X:

Ankit Kumar

Anjney Midha

Check out everything a16z is doing with artificial intelligence here, including articles, projects, and more podcasts.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(100)

Ideogram’s Open-Weights Image Model and the Future of AI Design

Ideogram’s Open-Weights Image Model and the Future of AI Design

Yoko Li and Justine Moore speak with Ideogram founder and CEO Mohammad Norouzi about image generation models, design workflows, and the evolving relationship between AI and creative work. The conversa...

15 Kesä 42min

Building Search for AI Agents with Exa CEO Will Bryk

Building Search for AI Agents with Exa CEO Will Bryk

Sarah Wang speaks with Exa cofounder and CEO Will Bryk about building search infrastructure for the AI era. The conversation covers Exa’s origins, why traditional search engines were not designed for ...

4 Kesä 49min

AI Agents and the Fight for Customer Data

AI Agents and the Fight for Customer Data

Martin Casado speaks with George Fraser, cofounder and CEO of Fivetran, about the future of data infrastructure in the age of AI. The conversation covers Fivetran’s merger with dbt, the changing role ...

2 Kesä 50min

Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software

Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software

Recorded live at the a16z Fintech Connect conference in Deer Valley, Alex Rampell speaks with Ben Horowitz, cofounder and general partner at a16z, about how AI has rewritten the fundamental rules of s...

19 Touko 29min

AI Infrastructure, Distribution, and the Next Wave of Software

AI Infrastructure, Distribution, and the Next Wave of Software

Sophie Buonassisi speaks with Jennifer Li, general partner at a16z, about why infrastructure is becoming one of the most important areas in AI. They discuss how the shift to AI-native systems is resha...

12 Touko 38min

From Vector Databases to Knowledge Engines: The Next Layer of AI

From Vector Databases to Knowledge Engines: The Next Layer of AI

Peter Levine speaks with Ash Ashutosh, CEO of Pinecone, about the launch of Nexus and the shift from vector databases to knowledge engines. As agents become the primary users of software, they discuss...

5 Touko 46min

Why We Need Continual Learning

Why We Need Continual Learning

Elena Burger speaks with Malika Aubakirova, partner on the AI infrastructure team at a16z, about why today’s AI systems struggle to learn over time. They discuss the limits of in-context learning, the...

28 Huhti 18min

The Agent Era: Building Software Beyond Chat with Box CEO Aaron Levie

The Agent Era: Building Software Beyond Chat with Box CEO Aaron Levie

Erik Torenberg, Steve Sinofsky, and Martin Casado speak to Aaron Levie, CEO at Box, about what happens to enterprise software when agents become the primary users. They discuss why coding agents succe...

21 Huhti 59min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
rss-oivalluksia-rahasta-elamasta
leadcast
herrasmieshakkerit
vapauta-supervoimasi-podcast
rss-rahamania
asuntoasiaa-paivakirjat
rss-viisas-raha-podi
ostan-asuntoja-podcast
hyva-paha-johtaminen
rss-porssipuhetta
rss-kaupan-tila
rss-inderes-femme
rss-set-for-life-sijoita-ja-vaurastu
pomojen-suusta
rss-perho-rajoilla
rss-savessa