Beyond Transcripts:  Language Nuances and Audio Signals with Carter Huffman of Modulate
How Many CTOs17 Maalis

Beyond Transcripts: Language Nuances and Audio Signals with Carter Huffman of Modulate

In this episode of "How Many CTOs Does It Take?" podcast, hosts Scott Porad and Brad Hefta-Gaub talk with Carter Huffman, CTO and co-founder of Modulate AI, about his path from machine learning work at NASA's Jet Propulsion Lab to building voice AI that understands conversations. Carter explains why moderation in gaming is hard because you don't want to ban players unfairly, and contrasts big foundation models with orchestrated ensembles of many tiny models that require high-quality, globally vetted data labeling. They discuss the nuance of classifying hate speech, expansion into detecting fraud and manipulation in delivery and call-center contexts, and monitoring misbehaving AI voice agents. The conversation covers why conversation is more than transcripts, possible therapeutic/telehealth uses of Modulate, analyzing data at a massive scale, and ambitions for audio generation using hierarchical edge-and-cloud approaches. The episode ends with a humorous anecdote about two factor authenticaiton failure. 00:00 Podcast Cold Open 00:48 Meet Carter Huffman 02:06 JPL Spacecraft Autonomy 04:18 From JPL to Audio AI 06:18 Why Audio Is Hard 07:44 Voice AI Use Cases 12:49 Tiny Models Orchestration 15:56 Data Labeling at Scale 17:17 Defining Toxic Behavior 18:58 Nuanced Language Moderation 20:04 Scaling Ensemble Models 21:39 GPU Crunch During Launch 22:29 Beyond Gaming Use Cases 26:03 AI Agents Gone Wrong 28:45 Telehealth and Diagnostics 30:26 Ambient Audio and Privacy 32:26 Edge Ensembles Everywhere 33:25 Audio Synthesis Ambitions 35:24 Latency Hierarchies Explained 38:10 Two Factor Key Fob Fiasco 39:14 Wrap Up and Credits

Resources:

#TechPodcast #EngineeringPodcast #DevTalks #PodcastForDevs #HowManyCTOs #Podcast #CTOs #CTOPodcast #ChiefTechnologyOfficer #Technology #Engineering #SoftwareDevelopment #SoftwareEngineering #TechLeadership #EngineeringLeadership #EngineeringCulture #TechDebates #AI #VoiceTech #MachineLearning #MachineLearningModels #GamingIndustry #AIinnovation #Entrepreneurship #AIConversation #VoiceAssistant #LanguageModeration #GPU #LLMs #LargeLanguageModels

Jaksot(63)

Engaging Employees in Security Appreciation with Robert Siciliano

Engaging Employees in Security Appreciation with Robert Siciliano

In this episode of "How Many CTOs Does It Take?" podcast, host Brad Hefta-Gaub welcomes Boston-raised security speaker Robert Siciliano, who traces his path into security from early experiences with c...

31 Maalis 57min

Building Trust with AI: David Espindola on the Path Forward

Building Trust with AI: David Espindola on the Path Forward

In this episode of "How Many CTOs Does It Take?" podcast, Scott Porad hosts solo and interviews technologist David Espindola about AI. Espindola explains his path from engineer at fast-growing Silicon...

24 Maalis 40min

Introducing the ADLC: The Agent Development Life Cycle

Introducing the ADLC: The Agent Development Life Cycle

In this episode of "How Many CTOs Does It Take?" podcast, hosts Scott Porad and Brad Hefta-Gaub" open with Super Bowl reactions and a meme about non-fans describing plays, then pivot to the ai.com hal...

10 Maalis 44min

The Evolving Role of Tech Leadership with Philip Rosedale

The Evolving Role of Tech Leadership with Philip Rosedale

In this episode of "How Many CTOs Does It Take?" podcast, Brad Hefta-Gaub is joined by guest co-host Philip Rosedale to explore the multifaceted role of a CTO, comparing it with the CEO position. They...

3 Maalis 53min

Predictions and Reflections: One Year Anniversary of the How Many CTOs Does It Take? Podcast

Predictions and Reflections: One Year Anniversary of the How Many CTOs Does It Take? Podcast

In this episode of "How Many CTOs Does It Take?" podcast, hosts Scott Porad and Brad Hefta-Gaub" reflect on the first year of the podcast's publication, discussing Scott's ongoing questions about tech...

24 Helmi 43min

Adapt or Fade: Interviewing for Developers in the Age of AI Assisted Coding

Adapt or Fade: Interviewing for Developers in the Age of AI Assisted Coding

In this episode of "How Many CTOs Does It Take?" podcast, hosts Scott Porad and Brad Hefta-Gaub explore the evolving landscape of interviewing for programmer positions in the age of AI-assisted coding...

17 Helmi 30min

From Rave Promoter to SaaS Innovator: Revolutionizing Event Management with Ritesh Patel

From Rave Promoter to SaaS Innovator: Revolutionizing Event Management with Ritesh Patel

In this episode of "How Many CTOs Does It Take?" podcast, hosts Scott Porad and Brad Hefta-Gaub sit down with Ritesh Patel, co-founder of Ticket Fairy. Ritesh shares his journey from coding and organi...

10 Helmi 49min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
rss-rahamania
ostan-asuntoja-podcast
rahapuhetta
rss-laakispodi
rss-sisalto-kuntoon
herrasmieshakkerit
sijoituspodi
rss-draivi
inderespodi
rss-sami-miettinen-neuvottelija
rss-lahtijat
rss-bisnesta-bebeja
rss-karon-grilli
rss-seuraava-potilas
rss-paasipodi
vapauta-supervoimasi-podcast