How a Moonshot Led to Google DeepMind's Veo 3

How a Moonshot Led to Google DeepMind's Veo 3

Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.

Chapter:
0:00 - Intro
0:47 - Veo project's beginnings
3:02 - Veo's origins in Google Brain
5:07 - Video prediction and robotics applications
7:45 - Early progress and evaluation challenges
10:30 - Physics-based evaluations and their limitations
12:18 - The launch of the original Veo model
14:06 - Scaling challenges for video models
16:02 - The leap from Veo1 to Veo2
19:40 - Veo 3’s viral audio moment
21:17 - User trends shaping Veo's roadmap
23:49 - Image-to-video vs. text-to-video complexity
26:00 - New prompting methods and user control
27:55 - Coherence in long video generation
31:03 - Genie 3 and world models
35:54 - The steerability challenge
41:59 - Capability transfer and image data's role
47:25 - Closing

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(27)

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemi...

12 Mar 36min

Gemini in Workspace: New Ways to Create Faster

Gemini in Workspace: New Ways to Create Faster

Chapters: 1:15 - Gemini in Docs 3:17 - Which models power Workspace 3:45 - AI Overviews in Drive 5:22 - Rollout and availability 6:33 - Reimagining every Workspace canvas 8:58 - Gemini in Calend...

12 Mar 40min

Gemini in Chrome: Your agentic browsing assistant

Gemini in Chrome: Your agentic browsing assistant

Chapters: 0:00 - Introduction 2:49 - Evolution from web apps to integrated assistants 4:37 - Chrome as a platform for personal context 6:38 - Navigating the context overload problem 7:52 - Transf...

12 Mar 48min

Inside Lyria 3, Google's music generation model

Inside Lyria 3, Google's music generation model

1:00 - Defining music generation models1:40 - Lyria as a new instrument3:05 - Connecting language and creative intent5:08 - Guest backgrounds and musical journeys7:57 - Demo: Instrumental funk jam8:29...

18 Feb 36min

Project Genie: Create and explore worlds

Project Genie: Create and explore worlds

Chapters:00:00 - Intro and defining world models and RL roots01:51 - Demo: Goldfish and shark in underwater world04:59 - Project Genie gallery06:31 - Physics, remixing, and UI prompts11:00 - Demo: Nan...

30 Jan 42min

Gemini 3 and Gen UI in Google Search

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evol...

18 Des 202521min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Goog...

26 Nov 202527min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advan...

26 Nov 202536min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
jss
sinnsyn
dekodet-2
rekommandert
forskningno
villmarksliv
liberal-halvtime
rss-paradigmepodden
rss-nysgjerrige-norge
rss-zahid-ali-hjelper-deg
rss-inn-til-kjernen-med-sunniva-rose
tidlose-historier
kvinnehelsepodden
rss-rekommandert
nordnorsk-historie
fjellsportpodden
tomprat-med-gunnar-tjomlid
rss-lundqvist-podden