Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?
How I AI3 Dec 2025

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error.


What you’ll learn:

  1. How each AI model approaches the same design challenge differently
  2. Why planning capabilities dramatically impact design quality
  3. The specific visual and functional improvements each model made
  4. Which model excels at front-end design versus back-end functionality
  5. How to strategically choose the right AI model for different parts of your workflow
  6. The importance of model-switching based on specific use cases

Blog design: https://www.chatprd.ai/blog

Brought to you by:

Lovable—Build apps by simply chatting with AI

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

In this episode, we cover:

(00:00) Introduction to the AI design challenge

(01:25) The question: Which model is the better designer?

(03:08) The prompt used for all three models

(04:10) Gemini 3 Pro’s approach and results

(06:00) Opus 4.5’s approach and results

(10:54) Codex 5.1’s approach and disappointing results

(14:51) Comparing the three designs side by side

(16:03) Analyzing the change logs and SEO improvements from each model

(22:43) Final verdict

(23:00) Conclusion and next steps

Tools referenced:

• Gemini 3 Pro: https://deepmind.google/models/gemini/pro/

• Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5

• OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex

• Cursor: https://cursor.com/

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Avsnitt(68)

How Webflow’s CPO built an AI chief of staff to manage her calendar, prep for meetings, and drive AI adoption | Rachel Wolan

How Webflow’s CPO built an AI chief of staff to manage her calendar, prep for meetings, and drive AI adoption | Rachel Wolan

Rachel Wolan, the chief product officer at Webflow, has embraced AI not just as a product leader but as a hands-on builder. A coder since age 16, Rachel has returned to her technical roots by creating...

29 Dec 202543min

How to get your whole team excited about AI (and actually using it) | Brian Greenbaum (product designer at Pendo)

How to get your whole team excited about AI (and actually using it) | Brian Greenbaum (product designer at Pendo)

Brian Greenbaum is a Senior Staff Product Designer at Pendo who led a company-wide AI transformation after a personal epiphany while on paternity leave. After experiencing the power of AI coding tools...

22 Dec 202547min

How Zapier’s EA built an army of AI interns to automate meeting prep, strengthen team culture, and scale internal alignment | Cortney Hickey

How Zapier’s EA built an army of AI interns to automate meeting prep, strengthen team culture, and scale internal alignment | Cortney Hickey

Cortney Hickey is the executive assistant to the CEO at Zapier, where she’s leveraging AI to transform traditional EA responsibilities into scalable, organization-wide systems. In this episode, she de...

15 Dec 202544min

ChatGPT agent mode: The “little helper” that transformed recruiting, crafted user personas, and solved parking nightmares | Michal Peled (Honeybook)

ChatGPT agent mode: The “little helper” that transformed recruiting, crafted user personas, and solved parking nightmares | Michal Peled (Honeybook)

Michal Peled is a Technical Operations Engineer at HoneyBook who specializes in building internal tools and automations that eliminate friction for teams. In this episode, Michal demonstrates three pr...

8 Dec 202558min

“PMs who use AI will replace those who don’t”: Google’s AI product lead on the new PM toolkit | Marily Nika

“PMs who use AI will replace those who don’t”: Google’s AI product lead on the new PM toolkit | Marily Nika

Marily Nika, AI Product Lead at Google and founder of the AI Product Academy, demonstrates how product managers can leverage AI tools to dramatically accelerate their workflow. Using a smart-fridge co...

1 Dec 202540min

How to create your own AI performance coach: Optimizing your unique nutrition, recovery, and injury management needs | Lucas Werthein (Cactus)

How to create your own AI performance coach: Optimizing your unique nutrition, recovery, and injury management needs | Lucas Werthein (Cactus)

Lucas Werthein, the COO and co-founder of Cactus, shares how he built a personalized AI wellness coach using ChatGPT to optimize his athletic performance while managing past injuries. After multiple s...

24 Nov 202551min

“Farm-to-table software”: How I built a Thanksgiving party hub using Lovable for managing invites, dishes, shared recipes, and photos

“Farm-to-table software”: How I built a Thanksgiving party hub using Lovable for managing invites, dishes, shared recipes, and photos

In today’s pre-Thanksgiving episode, I walk you through how I vibe coded my very own “Thanksgiving party hub” using Lovable—and how I transformed it from AI-generated slop into something warm, persona...

19 Nov 202534min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
market-makers
skogsforum-podcast
rss-elektrikerpodden
rss-powerboat-sverige-podcast
bilar-med-sladd
rss-veckans-ai
developers-mer-an-bara-kod
rss-uppgang-och-fall
rss-laddstationen-med-elbilen-i-sverige
rss-technokratin
gubbar-som-tjotar-om-bilar
rss-fabriken-2
bli-saker-podden
hej-bruksbil
har-vi-akt-till-mars-an
natets-morka-sida
teknikveckan
rss-snacka-om-ai