Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
How I AI11 Helmi

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking them to redesign my marketing website and refactor some genuinely gnarly components. Through side-by-side experiments, I break down where each model shines—creative development versus code review—and share how I’m thinking about combining them to build a more effective AI engineering stack.

What you’ll learn:

  1. The strengths and weaknesses of OpenAI’s Codex vs. Anthropic’s Opus for different coding tasks
  2. How I shipped 44 PRs containing 98 commits across 1,088 files in just five days using these models
  3. Why Codex excels at code review but struggles with creative, greenfield work
  4. The surprising way Opus and Codex complement each other in a real-world engineering workflow
  5. How to use Git concepts like work trees to maximize productivity with AI coding assistants
  6. Why Opus 4.6 Fast might be worth the 6x price increase (but be careful with your token budget)

Brought to you by:

WorkOS—Make your app enterprise-ready today

Detailed workflow walkthroughs from this episode:

• How I AI: GPT-5.3 Codex vs. Claude Opus 4.6—Shipping 44 PRs in 5 Days: https://www.chatprd.ai/how-i-ai/gpt-5-3-codex-vs-claude-opus-4-6

• How to Combine Claude Opus and GPT-5.3 Codex for High-Velocity Code Refactoring: https://www.chatprd.ai/how-i-ai/workflows/how-to-combine-claude-opus-and-gpt-5-3-codex-for-high-velocity-code-refactoring

• How to Redesign a Marketing Website Using Claude Opus 4.6 for Creative Development: https://www.chatprd.ai/how-i-ai/workflows/how-to-redesign-a-marketing-website-using-claude-opus-4-6-for-creative-development

In this episode, we cover:

(00:00) Introduction to new AI coding models

(02:13) My test methodology for comparing models

(03:30) Codex’s unique features: Git primitives, skills, and automations

(09:05) Testing GPT-5.2 Codex on a website redesign task

(10:40) Challenges with Codex’s literal interpretation of prompts

(15:00) Comparing the before and after with Codex

(16:23) Testing Opus 4.6 on the same website redesign task

(20:56) Comparing the visual results of both models

(21:30) Real-world engineering impact: 44 PRs in five days

(23:03) Refactoring components with Opus 4.6

(24:30) Using Codex for code review and architectural analysis

(26:55) Cost considerations for Opus 4.6 Fast

(28:52) Conclusion

Tools referenced:

• OpenAI’s GPT-5.3 Codex: https://openai.com/index/introducing-gpt-5-3-codex/

• Anthropic’s Claude Opus 4.6: https://www.anthropic.com/news/claude-opus-4-6

• Cursor: https://cursor.sh/

• GitHub: https://github.com/

Other references:

• Tailwind CSS: https://tailwindcss.com/

• Git: https://git-scm.com/

• Bugbot: https://cursor.com/bugbot

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Jaksot(68)

How Webflow’s CPO built an AI chief of staff to manage her calendar, prep for meetings, and drive AI adoption | Rachel Wolan

How Webflow’s CPO built an AI chief of staff to manage her calendar, prep for meetings, and drive AI adoption | Rachel Wolan

Rachel Wolan, the chief product officer at Webflow, has embraced AI not just as a product leader but as a hands-on builder. A coder since age 16, Rachel has returned to her technical roots by creating...

29 Joulu 202543min

How to get your whole team excited about AI (and actually using it) | Brian Greenbaum (product designer at Pendo)

How to get your whole team excited about AI (and actually using it) | Brian Greenbaum (product designer at Pendo)

Brian Greenbaum is a Senior Staff Product Designer at Pendo who led a company-wide AI transformation after a personal epiphany while on paternity leave. After experiencing the power of AI coding tools...

22 Joulu 202547min

How Zapier’s EA built an army of AI interns to automate meeting prep, strengthen team culture, and scale internal alignment | Cortney Hickey

How Zapier’s EA built an army of AI interns to automate meeting prep, strengthen team culture, and scale internal alignment | Cortney Hickey

Cortney Hickey is the executive assistant to the CEO at Zapier, where she’s leveraging AI to transform traditional EA responsibilities into scalable, organization-wide systems. In this episode, she de...

15 Joulu 202544min

ChatGPT agent mode: The “little helper” that transformed recruiting, crafted user personas, and solved parking nightmares | Michal Peled (Honeybook)

ChatGPT agent mode: The “little helper” that transformed recruiting, crafted user personas, and solved parking nightmares | Michal Peled (Honeybook)

Michal Peled is a Technical Operations Engineer at HoneyBook who specializes in building internal tools and automations that eliminate friction for teams. In this episode, Michal demonstrates three pr...

8 Joulu 202558min

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesi...

3 Joulu 202525min

“PMs who use AI will replace those who don’t”: Google’s AI product lead on the new PM toolkit | Marily Nika

“PMs who use AI will replace those who don’t”: Google’s AI product lead on the new PM toolkit | Marily Nika

Marily Nika, AI Product Lead at Google and founder of the AI Product Academy, demonstrates how product managers can leverage AI tools to dramatically accelerate their workflow. Using a smart-fridge co...

1 Joulu 202540min

How to create your own AI performance coach: Optimizing your unique nutrition, recovery, and injury management needs | Lucas Werthein (Cactus)

How to create your own AI performance coach: Optimizing your unique nutrition, recovery, and injury management needs | Lucas Werthein (Cactus)

Lucas Werthein, the COO and co-founder of Cactus, shares how he built a personalized AI wellness coach using ChatGPT to optimize his athletic performance while managing past injuries. After multiple s...

24 Marras 202551min

“Farm-to-table software”: How I built a Thanksgiving party hub using Lovable for managing invites, dishes, shared recipes, and photos

“Farm-to-table software”: How I built a Thanksgiving party hub using Lovable for managing invites, dishes, shared recipes, and photos

In today’s pre-Thanksgiving episode, I walk you through how I vibe coded my very own “Thanksgiving party hub” using Lovable—and how I transformed it from AI-generated slop into something warm, persona...

19 Marras 202534min