An exclusive inside look at GPT-5
How I AI7 Elo 2025

An exclusive inside look at GPT-5

In this episode, I share my hands-on experience with OpenAI’s GPT-5, the company’s new frontier model. As one of the first users outside of OpenAI to test the model, I put GPT-5 head-to-head with GPT-4.1 across real-world product use cases—from writing PRDs to generating code to assisting with visual design work. This is my unfiltered look at what GPT-5 can (and can’t) do—and how it changes the game for builders.


What you’ll learn:

1. How GPT-5 differs from previous models with its engineering-focused approach to problem-solving and tendency to prioritize technical details over business context

2. A comparative analysis of how GPT-5 and GPT-4.1 generate different types of product requirement documents and prototypes for the same prompt

3. Why GPT-5 excels at technical writing, functional requirements, and code generation while potentially skipping important business discovery questions

4. The model’s impressive spatial awareness capabilities when generating images for interior design and other visual tasks

5. Practical considerations for choosing the right model based on your specific use case and audience

6. How GPT-5’s extensive tool-calling behavior and bullet-point communication style reflect its engineering-oriented design

Brought to you by ChatPRD—an AI copilot for PMs and their teams: https://www.chatprd.ai/howiai

25k giveaway:

 To celebrate 25,000 YouTube followers, we’re doing a giveaway. Win a free year of my favorite AI products, including v0, Replit, Lovable, Bolt, Cursor, and, of course, ChatPRD, by leaving a rating and review on your favorite podcast app and subscribing to the podcast on YouTube. To enter: https://www.howiaipod.com/giveaway

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

In this episode, we cover:

(00:00) Introduction to GPT-5

(04:34) Testing GPT-5 in ChatPRD for document generation

(07:10) Comparing GPT-5 and GPT-4.1 on business vs. technical orientation

(11:22) Side-by-side comparison of PRDs generated by both models

(15:23) Where GPT-5 excels: Technical considerations and documentation quality

(17:35) Comparing prototypes generated from different model outputs

(19:57) Testing homepage critique capabilities between models

(23:14) OpenAI’s strengths in API design and developer support

(25:37) GPT-5’s performance as a coding assistant

(27:26) Examining GPT-5 in ChatGPT’s interface

(28:50) Testing GPT-5’s front-end design capabilities

(31:17) Personal use case: bathroom remodel planning

(33:45) Comparing GPT-5 vs. GPT-4 for interior design visualization

(38:10) Summary of key findings and recommendations

Tools referenced:

• OpenAI: https://openai.com/

• ChatGPT: https://chat.openai.com/

• Claude: https://claude.ai/

• Gemini: https://gemini.google.com/

• Cursor: https://cursor.sh/

• v0: https://v0.dev/

• Lovable: https://lovable.dev/

• Bolt: https://bolt.com/

• LaunchDarkly AI Configs: https://launchdarkly.com/docs/home/ai-configs

Other reference:

• Benjamin Moore paints: https://www.benjaminmoore.com/

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Jaksot(65)

How Coinbase scaled AI to 1,000+ engineers | Chintan Turakhia

How Coinbase scaled AI to 1,000+ engineers | Chintan Turakhia

Chintan Turakhia is Senior Director of Engineering at Coinbase, where he’s led the transformation of a 1,000-plus-engineer organization to embrace AI tools at scale. When tasked with rewriting Coinbas...

2 Maalis 58min

5 OpenClaw agents run my home, finances, and code | Jesse Genet

5 OpenClaw agents run my home, finances, and code | Jesse Genet

Jesse Genet is a homeschooling parent and entrepreneur who runs her household with five specialized OpenClaw agents. She layers them on top of her Obsidian “second brain,” deploys each on its own Mac ...

25 Helmi 49min

“I haven’t written a single line of front-end code in 3 months”: How Notion’s design team uses Claude Code to prototype

“I haven’t written a single line of front-end code in 3 months”: How Notion’s design team uses Claude Code to prototype

Brian Lovin is a designer at Notion AI who has transformed how the design team builds prototypes, by creating a shared code environment powered by Claude Code. Instead of designers working in isolated...

23 Helmi 51min

How this visually impaired engineer uses Claude Code to make his life more accessible | Joe McCormick

How this visually impaired engineer uses Claude Code to make his life more accessible | Joe McCormick

Joe McCormick is a principal software engineer at Babylist who lost most of his central vision due to a rare genetic disorder right before starting college. He pivoted from mechanical engineering to c...

16 Helmi 49min

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

Claude Opus 4.6 vs. GPT-5.3 Codex: How I shipped 93,000 lines of code in 5 days

I put the newest AI coding models from OpenAI and Anthropic head-to-head, testing them on real engineering work I’m actually doing. I compare GPT-5.3 Codex with Opus 4.6 (and Opus 4.6 Fast) by asking ...

11 Helmi 30min

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

How to build your own AI developer tools with Claude Code | CJ Hess (Tenex)

CJ Hess is a software engineer at Tenex who has built some of the most useful tools and workflows for being a “real AI engineer.” In this episode, CJ demonstrates his custom-built tool, Flowy, that tr...

9 Helmi 53min

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch: Vercel CEO on how v0 hit 3,200 PRs merged per day (and lets anyone ship)

Guillermo Rauch, the CEO of Vercel, demonstrates how v0 has evolved from a simple prototyping tool to a complete development environment that supports the entire Git workflow. Guillermo shows how Verc...

4 Helmi 43min

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

How this PM uses MCPs to automate his meeting prep, CRM updates, and customer feedback synthesis | Reid Robinson (Zapier)

Reid Robinson, Principal AI Product Strategist at Zapier, shares how he uses Model Context Protocols (MCPs) to automate tedious tasks and create powerful workflows. He demonstrates practical workflows...

2 Helmi 40min