Gemini's Multimodality

Gemini's Multimodality

Ani Baddepudi, Gemini Model Behavior Product Lead, joins host Logan Kilpatrick for a deep dive into Gemini's multimodal capabilities. Their conversation explores why Gemini was built as a natively multimodal model from day one, the future of proactive AI assistants, and how we are moving towards a world where "everything is vision." Learn about the differences between video and image understanding and token representations, higher FPS video sampling, and more.

Chapters:

0:00 - Intro
1:12 - Why Gemini is natively multimodal
2:23 - The technology behind multimodal models
5:15 - Video understanding with Gemini 2.5
9:25 - Deciding what to build next
13:23 - Building new product experiences with multimodal AI
17:15 - The vision for proactive assistants
24:13 - Improving video usability with variable FPS and frame tokenization
27:35 - What’s next for Gemini’s multimodal development
31:47 - Deep dive on Gemini’s document understanding capabilities
37:56 - The teamwork and collaboration behind Gemini
40:56 - What’s next with model behavior


Watch on YouTube: https://www.youtube.com/watch?v=K4vXvaRV0dw

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(27)

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemi...

12 Mars 36min

Gemini in Workspace: New Ways to Create Faster

Gemini in Workspace: New Ways to Create Faster

Chapters: 1:15 - Gemini in Docs 3:17 - Which models power Workspace 3:45 - AI Overviews in Drive 5:22 - Rollout and availability 6:33 - Reimagining every Workspace canvas 8:58 - Gemini in Calend...

12 Mars 40min

Gemini in Chrome: Your agentic browsing assistant

Gemini in Chrome: Your agentic browsing assistant

Chapters: 0:00 - Introduction 2:49 - Evolution from web apps to integrated assistants 4:37 - Chrome as a platform for personal context 6:38 - Navigating the context overload problem 7:52 - Transf...

12 Mars 48min

Inside Lyria 3, Google's music generation model

Inside Lyria 3, Google's music generation model

1:00 - Defining music generation models1:40 - Lyria as a new instrument3:05 - Connecting language and creative intent5:08 - Guest backgrounds and musical journeys7:57 - Demo: Instrumental funk jam8:29...

18 Feb 36min

Project Genie: Create and explore worlds

Project Genie: Create and explore worlds

Chapters:00:00 - Intro and defining world models and RL roots01:51 - Demo: Goldfish and shark in underwater world04:59 - Project Genie gallery06:31 - Physics, remixing, and UI prompts11:00 - Demo: Nan...

30 Jan 42min

Gemini 3 and Gen UI in Google Search

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evol...

18 Dec 202521min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Goog...

26 Nov 202527min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advan...

26 Nov 202536min

Populärt inom Vetenskap

dumma-manniskor
allt-du-velat-veta
p3-dystopia
kapitalet-en-podd-om-ekonomi
rss-vetenskapsradion
rss-vetenskapsradion-2
rss-ufobortom-rimligt-tvivel
bildningspodden
medicinvetarna
paranormalt-med-caroline-giertz
svd-nyhetsartiklar
det-morka-psyket
rss-spraket
sexet
vetenskapsradion
rss-ronden
pojkmottagningen
ufo-sverige
hacka-livet
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli