Behind the scenes of Google's state-of-the-art "nano-banana" image model

Behind the scenes of Google's state-of-the-art "nano-banana" image model

Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.

Watch on YouTube:

Chapters:
0:37 - New model introduction
1:21 -Demo - Image Editing
3:44 - Text rendering capabilities
4:44 Beyond human preference evals
6:44 - Text rendering as a proxy for quality
8:38 - Positive transfer between modalities
11:25 - Demo - Multi-turn, context aware image generation
13:54 - Pixel-perfect editing and character consistency
15:51 - Interleaved image generation
17:59 - Specialized vs. native models
19:52 - Understanding nuanced prompts
20:59 - User feedback shaping model development
22:37 - Improvements in character consistency
24:17 - More natural looking images from team collaboration
26:41 - What’s next for image generation models

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(27)

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Google Maps Leaders Talk About Its Biggest Update in 10 Years

Miriam Daniel and David Cronin join host Logan Kilpatrick to unveil the biggest update in Google Maps history. This episode explores the launch of Ask Maps, a conversational experience powered by Gemi...

12 Mar 36min

Gemini in Workspace: New Ways to Create Faster

Gemini in Workspace: New Ways to Create Faster

Chapters: 1:15 - Gemini in Docs 3:17 - Which models power Workspace 3:45 - AI Overviews in Drive 5:22 - Rollout and availability 6:33 - Reimagining every Workspace canvas 8:58 - Gemini in Calend...

12 Mar 40min

Gemini in Chrome: Your agentic browsing assistant

Gemini in Chrome: Your agentic browsing assistant

Chapters: 0:00 - Introduction 2:49 - Evolution from web apps to integrated assistants 4:37 - Chrome as a platform for personal context 6:38 - Navigating the context overload problem 7:52 - Transf...

12 Mar 48min

Inside Lyria 3, Google's music generation model

Inside Lyria 3, Google's music generation model

1:00 - Defining music generation models1:40 - Lyria as a new instrument3:05 - Connecting language and creative intent5:08 - Guest backgrounds and musical journeys7:57 - Demo: Instrumental funk jam8:29...

18 Feb 36min

Project Genie: Create and explore worlds

Project Genie: Create and explore worlds

Chapters:00:00 - Intro and defining world models and RL roots01:51 - Demo: Goldfish and shark in underwater world04:59 - Project Genie gallery06:31 - Physics, remixing, and UI prompts11:00 - Demo: Nan...

30 Jan 42min

Gemini 3 and Gen UI in Google Search

Gemini 3 and Gen UI in Google Search

Rhiannon Bell and Robby Stein, Product and Design leads for Google Search, join host Logan Kilpatrick for a deep dive into the integration of Gemini 3 into Search. Their conversation explores the evol...

18 Des 202521min

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Sundar Pichai: Gemini 3, Vibe Coding and Google's Full Stack Strategy

Logan Kilpatrick from Google DeepMind sits down with Sundar Pichai, CEO of Google and Alphabet to discuss the launch of Gemini 3, Nano Banana Pro and Google's overall AI momentum. They talk about Goog...

26 Nov 202527min

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Nano Banana Pro: Hands-on with the World’s Most Powerful Image Model

Introducing Nano Banana Pro, a powerful model built on Gemini 3 Pro, designed to enhance text rendering, infographics, and structured content generation. Tune in to learn about Nano Banana Pro’s advan...

26 Nov 202536min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
jss
sinnsyn
dekodet-2
rekommandert
forskningno
villmarksliv
liberal-halvtime
rss-paradigmepodden
rss-nysgjerrige-norge
rss-zahid-ali-hjelper-deg
rss-inn-til-kjernen-med-sunniva-rose
tidlose-historier
kvinnehelsepodden
rss-rekommandert
nordnorsk-historie
fjellsportpodden
tomprat-med-gunnar-tjomlid
rss-lundqvist-podden