Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?
How I AI3 Dec 2025

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error.


What you’ll learn:

  1. How each AI model approaches the same design challenge differently
  2. Why planning capabilities dramatically impact design quality
  3. The specific visual and functional improvements each model made
  4. Which model excels at front-end design versus back-end functionality
  5. How to strategically choose the right AI model for different parts of your workflow
  6. The importance of model-switching based on specific use cases

Blog design: https://www.chatprd.ai/blog

Brought to you by:

Lovable—Build apps by simply chatting with AI

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

In this episode, we cover:

(00:00) Introduction to the AI design challenge

(01:25) The question: Which model is the better designer?

(03:08) The prompt used for all three models

(04:10) Gemini 3 Pro’s approach and results

(06:00) Opus 4.5’s approach and results

(10:54) Codex 5.1’s approach and disappointing results

(14:51) Comparing the three designs side by side

(16:03) Analyzing the change logs and SEO improvements from each model

(22:43) Final verdict

(23:00) Conclusion and next steps

Tools referenced:

• Gemini 3 Pro: https://deepmind.google/models/gemini/pro/

• Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5

• OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex

• Cursor: https://cursor.com/

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Avsnitt(68)

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

CJ Hess is a software engineer at Tenex who has built some of the most useful tools and workflows for being a “real AI engineer.” In this episode, CJ demonstrates his custom-built tool, Flowy, that tr...

9 Feb 53min

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch, the CEO of Vercel, demonstrates how v0 has evolved from a simple prototyping tool to a complete development environment that supports the entire Git workflow. Guillermo shows how Verc...

4 Feb 43min

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

Reid Robinson, Principal AI Product Strategist at Zapier, shares how he uses Model Context Protocols (MCPs) to automate tedious tasks and create powerful workflows. He demonstrates practical workflows...

2 Feb 40min

I gave Clawdbot (aka Moltbot) access to my computer, calendar, and emails: Here’s what happened

I gave Clawdbot (aka Moltbot) access to my computer, calendar, and emails: Here’s what happened

In this episode, I take you through my unfiltered experience with Clawdbot, the viral open-source AI agent that’s been taking over tech Twitter. (In the time since this was recorded, the tool was rena...

28 Jan 55min

Advanced Claude Code techniques: context loading, mermaid diagrams, stop hooks, and more | John Lindquist

Advanced Claude Code techniques: context loading, mermaid diagrams, stop hooks, and more | John Lindquist

John Lindquist is the co-founder of egghead.io and an expert in leveraging AI tools for professional software development. In this episode, John shares advanced techniques for using AI coding tools li...

26 Jan 56min

Claude Code for product managers: research, writing, context libraries, custom to-do system, and more | Teresa Torres

Claude Code for product managers: research, writing, context libraries, custom to-do system, and more | Teresa Torres

Teresa Torres is the author of Continuous Discovery Habits and an internationally acclaimed speaker and coach. In this episode, Teresa demonstrates how she’s built a personalized productivity system u...

19 Jan 43min

The power user’s guide to Codex: parallelizing workflows, planning techniques, advanced context engineering tips, automating code reviews, and more | Alexander Embiricos

The power user’s guide to Codex: parallelizing workflows, planning techniques, advanced context engineering tips, automating code reviews, and more | Alexander Embiricos

Alexander Embiricos, the product lead for Codex at OpenAI, shares practical workflows for getting the most out of this AI coding agent. In this episode, he demonstrates how both non-technical users an...

12 Jan 53min

Zapier’s CEO shares his personal AI stack | Wade Foster

Zapier’s CEO shares his personal AI stack | Wade Foster

Wade Foster is the co-founder and CEO of Zapier. In this episode, Wade shows how he uses meeting transcripts, Zapier agents, and even Grok to analyze company culture, evaluate interview candidates, an...

5 Jan 41min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
market-makers
skogsforum-podcast
rss-elektrikerpodden
rss-powerboat-sverige-podcast
bilar-med-sladd
rss-veckans-ai
developers-mer-an-bara-kod
rss-uppgang-och-fall
rss-laddstationen-med-elbilen-i-sverige
rss-technokratin
gubbar-som-tjotar-om-bilar
rss-fabriken-2
bli-saker-podden
hej-bruksbil
har-vi-akt-till-mars-an
natets-morka-sida
teknikveckan
rss-snacka-om-ai