Building real-time voice applications with Live API

Building real-time voice applications with Live API

Shrestha Basu Mallick, one of the product leads for the Gemini API, joins host Logan Kilpatrick for a deep dive of Gemini Live API, Google’s real-time, multimodal interface for developers. Learn about how native audio alongside new capabilities like proactive audio and async function calling unlocks the unique power of audio as an interface.

Watch on YouTube: https://www.youtube.com/watch?v=4xlwlU6h-wM

0:00 - Intro
1:18 - Live API Overview
3:36 - Why audio is a special modality
5:07 - Speed vs. precision in audio
6:17 - Controllable and promptable TTS
8:31 - What developers are building with the Live API
11:14 - URL context and async calling features
15:02 - Proactive audio and affective dialog
16:55 - Addressing developer feedback
21:54 - Live API roadmap
23:49 - The role of long context
24:57 - What’s next for the Live API
26:41 - State of the AI audio market
30:10 - Advice for developers getting started with the Live API
31:16 - Live API demo
38:10 - Demo wrap up and closing

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(27)

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemi...

12 Mars 36min

Gemini in Workspace: New Ways to Create Faster

Gemini in Workspace: New Ways to Create Faster

Chapters: 1:15 - Gemini in Docs 3:17 - Which models power Workspace 3:45 - AI Overviews in Drive 5:22 - Rollout and availability 6:33 - Reimagining every Workspace canvas 8:58 - Gemini in Calend...

12 Mars 40min

Gemini in Chrome: Your agentic browsing assistant

Gemini in Chrome: Your agentic browsing assistant

Chapters: 0:00 - Introduction 2:49 - Evolution from web apps to integrated assistants 4:37 - Chrome as a platform for personal context 6:38 - Navigating the context overload problem 7:52 - Transf...

12 Mars 48min

Inside Lyria 3, Google's music generation model

Inside Lyria 3, Google's music generation model

1:00 - Defining music generation models1:40 - Lyria as a new instrument3:05 - Connecting language and creative intent5:08 - Guest backgrounds and musical journeys7:57 - Demo: Instrumental funk jam8:29...

18 Feb 36min

Project Genie: Create and explore worlds

Project Genie: Create and explore worlds

Chapters:00:00 - Intro and defining world models and RL roots01:51 - Demo: Goldfish and shark in underwater world04:59 - Project Genie gallery06:31 - Physics, remixing, and UI prompts11:00 - Demo: Nan...

30 Jan 42min

Gemini 3 and Gen UI in Google Search

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evol...

18 Dec 202521min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Goog...

26 Nov 202527min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advan...

26 Nov 202536min

Populärt inom Vetenskap

dumma-manniskor
allt-du-velat-veta
p3-dystopia
kapitalet-en-podd-om-ekonomi
rss-vetenskapsradion
rss-vetenskapsradion-2
rss-ufobortom-rimligt-tvivel
bildningspodden
medicinvetarna
paranormalt-med-caroline-giertz
svd-nyhetsartiklar
det-morka-psyket
rss-spraket
sexet
vetenskapsradion
rss-ronden
pojkmottagningen
ufo-sverige
hacka-livet
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli