Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
How I AI11 Helmi

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking them to redesign my marketing website and refactor some genuinely gnarly components. Through side-by-side experiments, I break down where each model shines—creative development versus code review—and share how I’m thinking about combining them to build a more effective AI engineering stack.

What you’ll learn:

  1. The strengths and weaknesses of OpenAI’s Codex vs. Anthropic’s Opus for different coding tasks
  2. How I shipped 44 PRs containing 98 commits across 1,088 files in just five days using these models
  3. Why Codex excels at code review but struggles with creative, greenfield work
  4. The surprising way Opus and Codex complement each other in a real-world engineering workflow
  5. How to use Git concepts like work trees to maximize productivity with AI coding assistants
  6. Why Opus 4.6 Fast might be worth the 6x price increase (but be careful with your token budget)

Brought to you by:

WorkOS—Make your app enterprise-ready today

Detailed workflow walkthroughs from this episode:

• How I AI: GPT-5.3 Codex vs. Claude Opus 4.6—Shipping 44 PRs in 5 Days: https://www.chatprd.ai/how-i-ai/gpt-5-3-codex-vs-claude-opus-4-6

• How to Combine Claude Opus and GPT-5.3 Codex for High-Velocity Code Refactoring: https://www.chatprd.ai/how-i-ai/workflows/how-to-combine-claude-opus-and-gpt-5-3-codex-for-high-velocity-code-refactoring

• How to Redesign a Marketing Website Using Claude Opus 4.6 for Creative Development: https://www.chatprd.ai/how-i-ai/workflows/how-to-redesign-a-marketing-website-using-claude-opus-4-6-for-creative-development

In this episode, we cover:

(00:00) Introduction to new AI coding models

(02:13) My test methodology for comparing models

(03:30) Codex’s unique features: Git primitives, skills, and automations

(09:05) Testing GPT-5.2 Codex on a website redesign task

(10:40) Challenges with Codex’s literal interpretation of prompts

(15:00) Comparing the before and after with Codex

(16:23) Testing Opus 4.6 on the same website redesign task

(20:56) Comparing the visual results of both models

(21:30) Real-world engineering impact: 44 PRs in five days

(23:03) Refactoring components with Opus 4.6

(24:30) Using Codex for code review and architectural analysis

(26:55) Cost considerations for Opus 4.6 Fast

(28:52) Conclusion

Tools referenced:

• OpenAI’s GPT-5.3 Codex: https://openai.com/index/introducing-gpt-5-3-codex/

• Anthropic’s Claude Opus 4.6: https://www.anthropic.com/news/claude-opus-4-6

• Cursor: https://cursor.sh/

• GitHub: https://github.com/

Other references:

• Tailwind CSS: https://tailwindcss.com/

• Git: https://git-scm.com/

• Bugbot: https://cursor.com/bugbot

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Jaksot(68)

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

CJ Hess is a software engineer at Tenex who has built some of the most useful tools and workflows for being a “real AI engineer.” In this episode, CJ demonstrates his custom-built tool, Flowy, that tr...

9 Helmi 53min

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch, the CEO of Vercel, demonstrates how v0 has evolved from a simple prototyping tool to a complete development environment that supports the entire Git workflow. Guillermo shows how Verc...

4 Helmi 43min

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

Reid Robinson, Principal AI Product Strategist at Zapier, shares how he uses Model Context Protocols (MCPs) to automate tedious tasks and create powerful workflows. He demonstrates practical workflows...

2 Helmi 40min

I gave Clawdbot (aka Moltbot) access to my computer, calendar, and emails: Here’s what happened

I gave Clawdbot (aka Moltbot) access to my computer, calendar, and emails: Here’s what happened

In this episode, I take you through my unfiltered experience with Clawdbot, the viral open-source AI agent that’s been taking over tech Twitter. (In the time since this was recorded, the tool was rena...

28 Tammi 55min

Advanced Claude Code techniques: context loading, mermaid diagrams, stop hooks, and more | John Lindquist

Advanced Claude Code techniques: context loading, mermaid diagrams, stop hooks, and more | John Lindquist

John Lindquist is the co-founder of egghead.io and an expert in leveraging AI tools for professional software development. In this episode, John shares advanced techniques for using AI coding tools li...

26 Tammi 56min

Claude Code for product managers: research, writing, context libraries, custom to-do system, and more | Teresa Torres

Claude Code for product managers: research, writing, context libraries, custom to-do system, and more | Teresa Torres

Teresa Torres is the author of Continuous Discovery Habits and an internationally acclaimed speaker and coach. In this episode, Teresa demonstrates how she’s built a personalized productivity system u...

19 Tammi 43min

The power user’s guide to Codex: parallelizing workflows, planning techniques, advanced context engineering tips, automating code reviews, and more | Alexander Embiricos

The power user’s guide to Codex: parallelizing workflows, planning techniques, advanced context engineering tips, automating code reviews, and more | Alexander Embiricos

Alexander Embiricos, the product lead for Codex at OpenAI, shares practical workflows for getting the most out of this AI coding agent. In this episode, he demonstrates how both non-technical users an...

12 Tammi 53min

Zapier’s CEO shares his personal AI stack | Wade Foster

Zapier’s CEO shares his personal AI stack | Wade Foster

Wade Foster is the co-founder and CEO of Zapier. In this episode, Wade shows how he uses meeting transcripts, Zapier agents, and even Grok to analyze company culture, evaluate interview candidates, an...

5 Tammi 41min