Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
How I AI11 Helmi

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking them to redesign my marketing website and refactor some genuinely gnarly components. Through side-by-side experiments, I break down where each model shines—creative development versus code review—and share how I’m thinking about combining them to build a more effective AI engineering stack.

What you’ll learn:

  1. The strengths and weaknesses of OpenAI’s Codex vs. Anthropic’s Opus for different coding tasks
  2. How I shipped 44 PRs containing 98 commits across 1,088 files in just five days using these models
  3. Why Codex excels at code review but struggles with creative, greenfield work
  4. The surprising way Opus and Codex complement each other in a real-world engineering workflow
  5. How to use Git concepts like work trees to maximize productivity with AI coding assistants
  6. Why Opus 4.6 Fast might be worth the 6x price increase (but be careful with your token budget)

Brought to you by:

WorkOS—Make your app enterprise-ready today

Detailed workflow walkthroughs from this episode:

• How I AI: GPT-5.3 Codex vs. Claude Opus 4.6—Shipping 44 PRs in 5 Days: https://www.chatprd.ai/how-i-ai/gpt-5-3-codex-vs-claude-opus-4-6

• How to Combine Claude Opus and GPT-5.3 Codex for High-Velocity Code Refactoring: https://www.chatprd.ai/how-i-ai/workflows/how-to-combine-claude-opus-and-gpt-5-3-codex-for-high-velocity-code-refactoring

• How to Redesign a Marketing Website Using Claude Opus 4.6 for Creative Development: https://www.chatprd.ai/how-i-ai/workflows/how-to-redesign-a-marketing-website-using-claude-opus-4-6-for-creative-development

In this episode, we cover:

(00:00) Introduction to new AI coding models

(02:13) My test methodology for comparing models

(03:30) Codex’s unique features: Git primitives, skills, and automations

(09:05) Testing GPT-5.2 Codex on a website redesign task

(10:40) Challenges with Codex’s literal interpretation of prompts

(15:00) Comparing the before and after with Codex

(16:23) Testing Opus 4.6 on the same website redesign task

(20:56) Comparing the visual results of both models

(21:30) Real-world engineering impact: 44 PRs in five days

(23:03) Refactoring components with Opus 4.6

(24:30) Using Codex for code review and architectural analysis

(26:55) Cost considerations for Opus 4.6 Fast

(28:52) Conclusion

Tools referenced:

• OpenAI’s GPT-5.3 Codex: https://openai.com/index/introducing-gpt-5-3-codex/

• Anthropic’s Claude Opus 4.6: https://www.anthropic.com/news/claude-opus-4-6

• Cursor: https://cursor.sh/

• GitHub: https://github.com/

Other references:

• Tailwind CSS: https://tailwindcss.com/

• Git: https://git-scm.com/

• Bugbot: https://cursor.com/bugbot

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Jaksot(65)

I built a custom Slack inbox. It was easier than you’d think. | Yash Tekriwal (Clay)

I built a custom Slack inbox. It was easier than you’d think. | Yash Tekriwal (Clay)

Yash Tekriwal is the head of education at Clay. A self-described hyper-optimizer, Yash has built multiple custom productivity applications using Perplexity Computer and OpenClaw to manage his overwhel...

8 Huhti 44min

I gave Claude Code our entire codebase. Our customers noticed. | Al Chen (Galileo)

I gave Claude Code our entire codebase. Our customers noticed. | Al Chen (Galileo)

Al Chen is a field engineer at Galileo, an observability platform for AI applications, where he works on the front lines with enterprise customers asking highly technical questions. Despite never havi...

6 Huhti 45min

How to turn Claude Code into your personal life operating system | Hilary Gridley

How to turn Claude Code into your personal life operating system | Hilary Gridley

Hilary Gridley is an entrepreneur, former product leader, and new mom who previously appeared on the podcast discussing AI for managers. She returns to share how she's transformed her approach to pers...

30 Maalis 51min

How Stripe built “minions”—AI coding agents that ship 1,300 PRs weekly from Slack reactions | Steve Kaliski (Stripe engineer)

How Stripe built “minions”—AI coding agents that ship 1,300 PRs weekly from Slack reactions | Steve Kaliski (Stripe engineer)

Steve Kaliski is a software engineer at Stripe who has spent the past six and a half years building developer tools and payment infrastructure. He’s part of the team that created “minions”—Stripe’s in...

25 Maalis 41min

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

Marco Casalaina, VP of Core AI Products and AI Futurist at Microsoft, demonstrates how he uses AI tools to automate administrative tasks that typically consume valuable time. Rather than using Warp as...

23 Maalis 34min

From journalist to iOS developer: How LinkedIn’s editor builds with Claude Code | Daniel Roth

From journalist to iOS developer: How LinkedIn’s editor builds with Claude Code | Daniel Roth

Daniel Roth, editor in chief at LinkedIn, went from business writer to iOS app developer, without ever learning how to code. Using Claude Code, Daniel built and shipped multiple production-ready iOS a...

16 Maalis 38min

From Figma to Claude Code and back | Gui Seiz & Alex Kern (Figma)

From Figma to Claude Code and back | Gui Seiz & Alex Kern (Figma)

Most teams are still passing static design files back and forth, and most Figma files are already out of date by the time they reach engineering. Gui Seiz (designer) and Alex Kern (engineer) from Figm...

11 Maalis 40min

Mastering Midjourney: How to create consistent, beautiful brand imagery without complex prompts | Jamey Gannon

Mastering Midjourney: How to create consistent, beautiful brand imagery without complex prompts | Jamey Gannon

Jamey Gannon is an AI creative director who specializes in creating consistent, beautiful brand imagery using AI tools. In this episode, Jamey demonstrates her streamlined workflow for generating cohe...

9 Maalis 49min