Claude Opus 4.8 is here. Is it as good as they say?
How I AI28 Touko

Claude Opus 4.8 is here. Is it as good as they say?

I got a few hours of early-access testing with Anthropic’s newly released model Opus 4.8. I walk through real coding, design, and strategy tasks across Claude Code and Claude Cowork, and give you my unfiltered view on what impressed me and what didn’t.

What you’ll learn:

  1. Where Opus 4.8 excels: greenfield prototypes, one-shot features, and fast execution
  2. Where it struggles: the last 10%, edge cases in existing codebases, and hallucinations
  3. How Opus 4.8 compares to Opus 4.7 on business strategy work
  4. Why I’m still reaching for Opus 4.7 on data-heavy strategy and roadmap work
  5. The new features shipping alongside the model: dynamic workflows with parallel subagents and effort control in Claude.ai and Cowork
  6. The prompting and harness strategy I’d use to get the most out of it

In this episode, we cover:

(00:00) Introduction to Opus 4.8

(00:44) Benchmark performance and pricing

(01:53) First coding test: Building a prototyping tool

(03:00) Where it failed: The last 10% problem

(03:27) The hallucination problem

(04:23) Testing Opus 4.8 on existing codebases

(05:24) The ambition test: Building games for a 9-year-old

(07:03) Business strategy test: 4.7 vs 4.8

(08:23) The roadmap test

(09:17) Final verdict

References:

• System Card: Claude Opus 4.8: https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf

• Introducing Claude Opus 4.8 on X: https://x.com/claudeai/status/2060042702150930686?s=20

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(79)

The Codex feature that works while you sleep

The Codex feature that works while you sleep

In this 30-minute episode, I walk through my favorite feature in Codex: the /goal command. I show how Goals transform AI from a turn-based assistant that needs constant ‘what’s next?’ prompting into a...

27 Touko 30min

How the engineer behind Claude Cowork actually uses Claude | Felix Rieseberg (Anthropic)

How the engineer behind Claude Cowork actually uses Claude | Felix Rieseberg (Anthropic)

Felix Rieseberg is the engineering lead for Claude Cowork and Claude Code Desktop at Anthropic. He previously spent five years at Slack building developer tools. In this episode, Felix demonstrates ho...

25 Touko 59min

What launched at Google I/O 2026 (30-minute day 1 recap)

What launched at Google I/O 2026 (30-minute day 1 recap)

Today is day one of Google I/O 2026, and I walk through every major announcement live—from the new Gemini 3.5 model family to Anti-Gravity 2.0, Google AI Studio, Gemini’s consumer redesign, the Omni v...

20 Touko 33min

HTML is the new Markdown: How Anthropic engineers are building with Claude Code | Thariq Shihipar

HTML is the new Markdown: How Anthropic engineers are building with Claude Code | Thariq Shihipar

Thariq Shihipar is an engineer at Anthropic working on the Claude Code team. He’s spent the past several months experimenting with HTML as a replacement for Markdown in planning and implementation wor...

18 Touko 35min

Spec-driven development: The AI engineering workflow at Notion | Ryan Nystrom

Spec-driven development: The AI engineering workflow at Notion | Ryan Nystrom

Ryan Nystrom is a software engineer at Notion. He joined in December 2024 after Notion acquired Campsite, the team communication platform he co-founded with Brian Lovin. At Notion, he’s been a core bu...

11 Touko 47min

Code with Claude: The 5 biggest updates explained

Code with Claude: The 5 biggest updates explained

Claire breaks down the biggest announcements from Anthropic’s “Code with Claude” event and what they actually mean for builders shipping AI products today. From scheduled AI routines to outcome-based ...

7 Touko 11min

Quests, token leaderboards, and a skills marketplace: The elite AI adoption playbook | John Kim (Sendbird)

Quests, token leaderboards, and a skills marketplace: The elite AI adoption playbook | John Kim (Sendbird)

John Kim is the co-founder and CEO of Delight.ai, a customer experience platform that’s transforming how companies deploy AI. But what makes John’s story fascinating isn’t just his product; it’s how h...

6 Touko 42min