Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?
How I AI3 Dec 2025

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error.


What you’ll learn:

  1. How each AI model approaches the same design challenge differently
  2. Why planning capabilities dramatically impact design quality
  3. The specific visual and functional improvements each model made
  4. Which model excels at front-end design versus back-end functionality
  5. How to strategically choose the right AI model for different parts of your workflow
  6. The importance of model-switching based on specific use cases

Blog design: https://www.chatprd.ai/blog

Brought to you by:

Lovable—Build apps by simply chatting with AI

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

In this episode, we cover:

(00:00) Introduction to the AI design challenge

(01:25) The question: Which model is the better designer?

(03:08) The prompt used for all three models

(04:10) Gemini 3 Pro’s approach and results

(06:00) Opus 4.5’s approach and results

(10:54) Codex 5.1’s approach and disappointing results

(14:51) Comparing the three designs side by side

(16:03) Analyzing the change logs and SEO improvements from each model

(22:43) Final verdict

(23:00) Conclusion and next steps

Tools referenced:

• Gemini 3 Pro: https://deepmind.google/models/gemini/pro/

• Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5

• OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex

• Cursor: https://cursor.com/

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Avsnitt(65)

I built a custom Slack inbox. It was easier than you’d think. | Yash Tekriwal (Clay)

I built a custom Slack inbox. It was easier than you’d think. | Yash Tekriwal (Clay)

Yash Tekriwal is the head of education at Clay. A self-described hyper-optimizer, Yash has built multiple custom productivity applications using Perplexity Computer and OpenClaw to manage his overwhel...

8 Apr 44min

I gave Claude Code our entire codebase. Our customers noticed. | Al Chen (Galileo)

I gave Claude Code our entire codebase. Our customers noticed. | Al Chen (Galileo)

Al Chen is a field engineer at Galileo, an observability platform for AI applications, where he works on the front lines with enterprise customers asking highly technical questions. Despite never havi...

6 Apr 45min

How to turn Claude Code into your personal life operating system | Hilary Gridley

How to turn Claude Code into your personal life operating system | Hilary Gridley

Hilary Gridley is an entrepreneur, former product leader, and new mom who previously appeared on the podcast discussing AI for managers. She returns to share how she's transformed her approach to pers...

30 Mars 51min

How Stripe built “minions”—AI coding agents that ship 1,300 PRs weekly from Slack reactions | Steve Kaliski (Stripe engineer)

How Stripe built “minions”—AI coding agents that ship 1,300 PRs weekly from Slack reactions | Steve Kaliski (Stripe engineer)

Steve Kaliski is a software engineer at Stripe who has spent the past six and a half years building developer tools and payment infrastructure. He’s part of the team that created “minions”—Stripe’s in...

25 Mars 41min

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

Marco Casalaina, VP of Core AI Products and AI Futurist at Microsoft, demonstrates how he uses AI tools to automate administrative tasks that typically consume valuable time. Rather than using Warp as...

23 Mars 34min

From journalist to iOS developer: How LinkedIn’s editor builds with Claude Code | Daniel Roth

From journalist to iOS developer: How LinkedIn’s editor builds with Claude Code | Daniel Roth

Daniel Roth, editor in chief at LinkedIn, went from business writer to iOS app developer, without ever learning how to code. Using Claude Code, Daniel built and shipped multiple production-ready iOS a...

16 Mars 38min

From Figma to Claude Code and back | Gui Seiz & Alex Kern (Figma)

From Figma to Claude Code and back | Gui Seiz & Alex Kern (Figma)

Most teams are still passing static design files back and forth, and most Figma files are already out of date by the time they reach engineering. Gui Seiz (designer) and Alex Kern (engineer) from Figm...

11 Mars 40min

Mastering Midjourney: How to create consistent, beautiful brand imagery without complex prompts | Jamey Gannon

Mastering Midjourney: How to create consistent, beautiful brand imagery without complex prompts | Jamey Gannon

Jamey Gannon is an AI creative director who specializes in creating consistent, beautiful brand imagery using AI tools. In this episode, Jamey demonstrates her streamlined workflow for generating cohe...

9 Mars 49min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
bilar-med-sladd
rss-technokratin
market-makers
skogsforum-podcast
natets-morka-sida
rss-elektrikerpodden
rss-en-ai-till-kaffet
har-vi-akt-till-mars-an
bli-saker-podden
rss-laddstationen-med-elbilen-i-sverige
dom-kallar-oss-krypto
developers-mer-an-bara-kod
hej-bruksbil
rss-snacka-om-ai
rss-upplyst-entreprenordirektor
rss-fabriken-2
rss-veckans-ai
rss-milpodden