Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days
How I AI11 Helmi

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking them to redesign my marketing website and refactor some genuinely gnarly components. Through side-by-side experiments, I break down where each model shines—creative development versus code review—and share how I’m thinking about combining them to build a more effective AI engineering stack.

What you’ll learn:

  1. The strengths and weaknesses of OpenAI’s Codex vs. Anthropic’s Opus for different coding tasks
  2. How I shipped 44 PRs containing 98 commits across 1,088 files in just five days using these models
  3. Why Codex excels at code review but struggles with creative, greenfield work
  4. The surprising way Opus and Codex complement each other in a real-world engineering workflow
  5. How to use Git concepts like work trees to maximize productivity with AI coding assistants
  6. Why Opus 4.6 Fast might be worth the 6x price increase (but be careful with your token budget)

Brought to you by:

WorkOS—Make your app enterprise-ready today

Detailed workflow walkthroughs from this episode:

• How I AI: GPT-5.3 Codex vs. Claude Opus 4.6—Shipping 44 PRs in 5 Days: https://www.chatprd.ai/how-i-ai/gpt-5-3-codex-vs-claude-opus-4-6

• How to Combine Claude Opus and GPT-5.3 Codex for High-Velocity Code Refactoring: https://www.chatprd.ai/how-i-ai/workflows/how-to-combine-claude-opus-and-gpt-5-3-codex-for-high-velocity-code-refactoring

• How to Redesign a Marketing Website Using Claude Opus 4.6 for Creative Development: https://www.chatprd.ai/how-i-ai/workflows/how-to-redesign-a-marketing-website-using-claude-opus-4-6-for-creative-development

In this episode, we cover:

(00:00) Introduction to new AI coding models

(02:13) My test methodology for comparing models

(03:30) Codex’s unique features: Git primitives, skills, and automations

(09:05) Testing GPT-5.2 Codex on a website redesign task

(10:40) Challenges with Codex’s literal interpretation of prompts

(15:00) Comparing the before and after with Codex

(16:23) Testing Opus 4.6 on the same website redesign task

(20:56) Comparing the visual results of both models

(21:30) Real-world engineering impact: 44 PRs in five days

(23:03) Refactoring components with Opus 4.6

(24:30) Using Codex for code review and architectural analysis

(26:55) Cost considerations for Opus 4.6 Fast

(28:52) Conclusion

Tools referenced:

• OpenAI’s GPT-5.3 Codex: https://openai.com/index/introducing-gpt-5-3-codex/

• Anthropic’s Claude Opus 4.6: https://www.anthropic.com/news/claude-opus-4-6

• Cursor: https://cursor.sh/

• GitHub: https://github.com/

Other references:

• Tailwind CSS: https://tailwindcss.com/

• Git: https://git-scm.com/

• Bugbot: https://cursor.com/bugbot

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Jaksot(68)

“Nobody wanted to do this work”: How Emmy Award–winning filmmakers use AI to automate the tedious parts of documentaries

“Nobody wanted to do this work”: How Emmy Award–winning filmmakers use AI to automate the tedious parts of documentaries

Tim McAleer is a producer at Ken Burns’s Florentine Films who is responsible for the technology and processes that power their documentary production. Rather than using AI to generate creative content...

17 Marras 202547min

How this CEO turned 25,000 hours of sales calls into a self-learning go-to-market engine | Matt Britton (Suzy)

How this CEO turned 25,000 hours of sales calls into a self-learning go-to-market engine | Matt Britton (Suzy)

Matt Britton is the founder and CEO of Suzy, a consumer insights platform that has raised over $100 million in venture capital and works with top brands like Coca-Cola, Google, Procter & Gamble, and N...

10 Marras 202542min

The complete beginner’s guide to coding with AI: from PRD to generating your very first lines of code

The complete beginner’s guide to coding with AI: from PRD to generating your very first lines of code

This episode is for complete beginners. I walk you through how to build your very first coding project using AI tools—even if you’ve never written a line of code. Together, we’ll create a personal pro...

5 Marras 202545min

“Vibe analysis”: How Faire’s data team uses AI to investigate conversion drops, analyze experiment results, and convert raw data into executive-ready insights

“Vibe analysis”: How Faire’s data team uses AI to investigate conversion drops, analyze experiment results, and convert raw data into executive-ready insights

Tim Trueman and Alexa Cerf from Faire’s data team demonstrate how AI tools are revolutionizing data analysis workflows. They show how data teams, product managers, and engineers can use tools like Cur...

3 Marras 20251h 3min

Vibe-coding a kid-friendly AI fortune teller for your Halloween festivities | Marco Casalaina (Microsoft VP)

Vibe-coding a kid-friendly AI fortune teller for your Halloween festivities | Marco Casalaina (Microsoft VP)

In this impromptu Halloween special, Marco Casalaina (VP of Products for Core AI at Microsoft) demonstrates how he uses GitHub Spark to quickly build a mobile app that generates kid-friendly fortunes ...

31 Loka 202511min

“Cursor is a much better product manager than I ever was”: How this PM uses AI for PRDs, Jira tickets, and replying to coworkers | Dennis Yang (Chime)

“Cursor is a much better product manager than I ever was”: How this PM uses AI for PRDs, Jira tickets, and replying to coworkers | Dennis Yang (Chime)

Dennis Yang is the Principal Product Manager for Generative AI at Chime, where he’s pioneered AI workflows that meaningfully increase productivity. While most people use Cursor as a coding tool, Denni...

27 Loka 202550min

Claude Skills explained: How to create reusable AI workflows

Claude Skills explained: How to create reusable AI workflows

Today I dive into Anthropic’s latest feature that lets anyone create reusable workflows for Claude—no coding required. I break down exactly what Claude Skills are, how to build them from scratch, and ...

22 Loka 202527min

How this Yelp AI PM works backward from “golden conversations” to create high-quality prototypes using Claude Artifacts and Magic Patterns | Priya Badger

How this Yelp AI PM works backward from “golden conversations” to create high-quality prototypes using Claude Artifacts and Magic Patterns | Priya Badger

Priya Badger, a product manager at Yelp, shares her innovative approach to designing AI-powered products by starting with example conversations rather than traditional wireframes or PRDs. In this epis...

20 Loka 202541min