SE Radio 703: Sahaj Garg on Low Latency AI

In this episode, Sahaj Garg, CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior as well as interactive applications. The conversation explores how to measure latency and how scale impacts it. Then Sahaj and Robert shift to themes around AI, including whether "AI" means LLMs or something broader, as they look at latency requirements and challenges around subtypes of AI applications. The final part of the episode explores techniques for managing latency in AI: speed vs accuracy trade-offs; speed vs cost; latency vs cost; choosing the right model; reducing quantization; distillation; and guessing + validating.

Brought to you by IEEE Computer Society and IEEE Software magazine.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(726)

SE Radio 721: Rob Moffat on Risk-First Software Development

SE Radio 721: Rob Moffat on Risk-First Software Development

In this episode, Rob Moffat, author of Risk-First Software Development and chief technical architect at the FinTech Open Source Software Foundation (FINOS), speaks with host Brijesh Ammanath about how...

20 Maj 52min

SE Radio 720: Martin Dilger on Understanding Eventsourcing

SE Radio 720: Martin Dilger on Understanding Eventsourcing

Martin Dilger, founder and CEO of Nebuilt GmbH, speaks with host Giovanni Asproni about event sourcing -- a software architecture pattern in which, rather than storing just the current state of your d...

13 Maj 55min

SE Radio 719: Birol Yildiz on Building an Agentic AI SRE

SE Radio 719: Birol Yildiz on Building an Agentic AI SRE

Birol Yildiz, CEO and co-founder of iLert, joins host Kanchan Shringi to explore how iLert built an AI SRE — an autonomous agent for handling production incidents — and what the experience revealed ab...

6 Maj 53min

SE Radio 718: Will Sentance on JS Modernization

SE Radio 718: Will Sentance on JS Modernization

Will Sentance, educator and co-founder of Codesmith, joins SE Radio's Adi Narayan to discuss the evolution of JavaScript and modern best practices. They begin with JavaScript's origins as a simple scr...

29 Apr 58min

SE Radio 717: Eric Tschetter on Decoupling Observability

SE Radio 717: Eric Tschetter on Decoupling Observability

In this episode, host Amey Ambade sits with Eric Tschetter, co-founder of Apache Druid and Chief Architect at Imply, to dissect the critical move toward Decoupling Observability. To begin, they define...

23 Apr 1h

SE Radio 716: Martin Kleppmann Local-First Software

SE Radio 716: Martin Kleppmann Local-First Software

Martin Kleppmann, Associate Professor at the University of Cambridge and author of the best-selling O'Reilly book Designing Data-Intensive Applications, talks to host Adi Narayan about local-first col...

15 Apr 55min

SE Radio 715: Sahaj Garg on Designing for Ambiguity in Human Input

SE Radio 715: Sahaj Garg on Designing for Ambiguity in Human Input

Sahaj Garg, co-founder and CTO of Wispr, a voice-to-text AI that turns speech into polished writing, talks with host Amey Ambade about designing systems for the ambiguity that's inherent in human inpu...

8 Apr 48min

SE Radio 714: Costa Alexoglou on Remote Pair Programming

SE Radio 714: Costa Alexoglou on Remote Pair Programming

Costa Alexoglou, co-founder of the open source Hopp pair-programming application, talks with host Brijesh Ammanath about remote pair programming. They start with a quick introduction to pair programmi...

1 Apr 51min

Populärt inom Utbildning

historiepodden-se
det-skaver
rss-bara-en-till-om-missbruk-medberoende-2
harrisons-dramatiska-historia
nu-blir-det-historia
allt-du-velat-veta
roda-vita-rosen
not-fanny-anymore
johannes-hansen-podcast
rss-viktmedicinpodden
sektledare
sa-in-i-sjalen
i-vantan-pa-katastrofen
rss-foraldramotet-bring-lagercrantz
rss-max-tant-med-max-villman
rss-dr-bjorklund
rss-sjalsligt-avkladd
rss-basta-livet
rss-traningsklubben
vi-gar-till-historien