How a Moonshot Led to Google DeepMind's Veo 3

How a Moonshot Led to Google DeepMind's Veo 3

Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.

Chapter:
0:00 - Intro
0:47 - Veo project's beginnings
3:02 - Veo's origins in Google Brain
5:07 - Video prediction and robotics applications
7:45 - Early progress and evaluation challenges
10:30 - Physics-based evaluations and their limitations
12:18 - The launch of the original Veo model
14:06 - Scaling challenges for video models
16:02 - The leap from Veo1 to Veo2
19:40 - Veo 3’s viral audio moment
21:17 - User trends shaping Veo's roadmap
23:49 - Image-to-video vs. text-to-video complexity
26:00 - New prompting methods and user control
27:55 - Coherence in long video generation
31:03 - Genie 3 and world models
35:54 - The steerability challenge
41:59 - Capability transfer and image data's role
47:25 - Closing

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(27)

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemi...

12 Maalis 36min

Gemini in Workspace: New Ways to Create Faster

Gemini in Workspace: New Ways to Create Faster

Chapters: 1:15 - Gemini in Docs 3:17 - Which models power Workspace 3:45 - AI Overviews in Drive 5:22 - Rollout and availability 6:33 - Reimagining every Workspace canvas 8:58 - Gemini in Calend...

12 Maalis 40min

Gemini in Chrome: Your agentic browsing assistant

Gemini in Chrome: Your agentic browsing assistant

Chapters: 0:00 - Introduction 2:49 - Evolution from web apps to integrated assistants 4:37 - Chrome as a platform for personal context 6:38 - Navigating the context overload problem 7:52 - Transf...

12 Maalis 48min

Inside Lyria 3, Google's music generation model

Inside Lyria 3, Google's music generation model

1:00 - Defining music generation models1:40 - Lyria as a new instrument3:05 - Connecting language and creative intent5:08 - Guest backgrounds and musical journeys7:57 - Demo: Instrumental funk jam8:29...

18 Helmi 36min

Project Genie: Create and explore worlds

Project Genie: Create and explore worlds

Chapters:00:00 - Intro and defining world models and RL roots01:51 - Demo: Goldfish and shark in underwater world04:59 - Project Genie gallery06:31 - Physics, remixing, and UI prompts11:00 - Demo: Nan...

30 Tammi 42min

Gemini 3 and Gen UI in Google Search

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evol...

18 Joulu 202521min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Goog...

26 Marras 202527min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advan...

26 Marras 202536min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
tiedekulma-podcast
rss-poliisin-mieli
utelias-mieli
rss-tiedetta-vai-tarinaa
university-of-eastern-finland
docemilia
sotataidon-ytimessa
filocast-filosofian-perusteet
menologeja-tutkimusmatka-vaihdevuosiin
rss-bios-podcast
rss-ranskaa-raakana
rss-duodecim-lehti
rss-duokkari-ekstra
rss-astetta-parempi-elama-podcast
rss-ilmasto-kriisissa
rss-ylistys-elaimille
rss-sosiopodi
rss-totuuden-liepeilla