Claude Opus 4.8 is here. Is it as good as they say?
How I AI28 Maj

Claude Opus 4.8 is here. Is it as good as they say?

I got a few hours of early-access testing with Anthropic’s newly released model Opus 4.8. I walk through real coding, design, and strategy tasks across Claude Code and Claude Cowork, and give you my unfiltered view on what impressed me and what didn’t.

What you’ll learn:

  1. Where Opus 4.8 excels: greenfield prototypes, one-shot features, and fast execution
  2. Where it struggles: the last 10%, edge cases in existing codebases, and hallucinations
  3. How Opus 4.8 compares to Opus 4.7 on business strategy work
  4. Why I’m still reaching for Opus 4.7 on data-heavy strategy and roadmap work
  5. The new features shipping alongside the model: dynamic workflows with parallel subagents and effort control in Claude.ai and Cowork
  6. The prompting and harness strategy I’d use to get the most out of it

In this episode, we cover:

(00:00) Introduction to Opus 4.8

(00:44) Benchmark performance and pricing

(01:53) First coding test: Building a prototyping tool

(03:00) Where it failed: The last 10% problem

(03:27) The hallucination problem

(04:23) Testing Opus 4.8 on existing codebases

(05:24) The ambition test: Building games for a 9-year-old

(07:03) Business strategy test: 4.7 vs 4.8

(08:23) The roadmap test

(09:17) Final verdict

References:

• System Card: Claude Opus 4.8: https://cdn.sanity.io/files/4zrzovbb/website/c886650a2e96fc0925c805a1a7ca77314ccbf4a6.pdf

• Introducing Claude Opus 4.8 on X: https://x.com/claudeai/status/2060042702150930686?s=20

Where to find Claire Vo:

ChatPRD: https://www.chatprd.ai/

Website: https://clairevo.com/

LinkedIn: https://www.linkedin.com/in/clairevo/

X: https://x.com/clairevo

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(79)

The Codex feature that works while you sleep

The Codex feature that works while you sleep

In this 30-minute episode, I walk through my favorite feature in Codex: the /goal command. I show how Goals transform AI from a turn-based assistant that needs constant ‘what’s next?’ prompting into a...

27 Maj 30min

How the engineer behind Claude Cowork actually uses Claude | Felix Rieseberg (Anthropic)

How the engineer behind Claude Cowork actually uses Claude | Felix Rieseberg (Anthropic)

Felix Rieseberg is the engineering lead for Claude Cowork and Claude Code Desktop at Anthropic. He previously spent five years at Slack building developer tools. In this episode, Felix demonstrates ho...

25 Maj 59min

What launched at Google I/O 2026 (30-minute day 1 recap)

What launched at Google I/O 2026 (30-minute day 1 recap)

Today is day one of Google I/O 2026, and I walk through every major announcement live—from the new Gemini 3.5 model family to Anti-Gravity 2.0, Google AI Studio, Gemini’s consumer redesign, the Omni v...

20 Maj 33min

HTML is the new Markdown: How Anthropic engineers are building with Claude Code | Thariq Shihipar

HTML is the new Markdown: How Anthropic engineers are building with Claude Code | Thariq Shihipar

Thariq Shihipar is an engineer at Anthropic working on the Claude Code team. He’s spent the past several months experimenting with HTML as a replacement for Markdown in planning and implementation wor...

18 Maj 35min

Spec-driven development: The AI engineering workflow at Notion | Ryan Nystrom

Spec-driven development: The AI engineering workflow at Notion | Ryan Nystrom

Ryan Nystrom is a software engineer at Notion. He joined in December 2024 after Notion acquired Campsite, the team communication platform he co-founded with Brian Lovin. At Notion, he’s been a core bu...

11 Maj 47min

Code with Claude: The 5 biggest updates explained

Code with Claude: The 5 biggest updates explained

Claire breaks down the biggest announcements from Anthropic’s “Code with Claude” event and what they actually mean for builders shipping AI products today. From scheduled AI routines to outcome-based ...

7 Maj 11min

Quests, token leaderboards, and a skills marketplace: The elite AI adoption playbook | John Kim (Sendbird)

Quests, token leaderboards, and a skills marketplace: The elite AI adoption playbook | John Kim (Sendbird)

John Kim is the co-founder and CEO of Delight.ai, a customer experience platform that’s transforming how companies deploy AI. But what makes John’s story fascinating isn’t just his product; it’s how h...

6 Maj 42min

Populärt inom Teknik

uppgang-och-fall
elbilsveckan
market-makers
bilar-med-sladd
rss-elektrikerpodden
rss-laddstationen-med-elbilen-i-sverige
developers-mer-an-bara-kod
rss-veckans-ai
natets-morka-sida
rss-technokratin
bli-saker-podden
skogsforum-podcast
bosse-bildoktorn-och-hasse-p
under-femton
har-vi-akt-till-mars-an
rss-uppgang-och-fall
rss-upplyst-entreprenordirektor
rss-powerboat-sverige-podcast
rss-snacka-om-ai
rss-hit-med-dina-lunchpengar