About Claude - The Triumph of the Ordinary

About Claude - The Triumph of the Ordinary

SHOW NOTES


Claude's mid-tier Sonnet model just topped a benchmark designed to measure AI against the actual day-to-day work of professionals — beating its own more powerful flagship in the process. Today we explore what that result reveals about how the definition of AI capability is quietly being rewritten.


**In this episode:**

- What GDPval is, why OpenAI built it, and why the result matters beyond a product launch

- The sixteen-month computer use trajectory that shows something crossing a threshold

- Why "reliability" and "taste" beat "brilliance" when the task is an inbox, not an exam

- The deeper argument: ordinary professional work is harder than it looks, and the race is catching up to that fact


**Links:**

- Introducing Claude Sonnet 4.6: https://www.anthropic.com/news/claude-sonnet-4-6

- Claude Sonnet 4.6 model page: https://www.anthropic.com/claude/sonnet

- GDPval benchmark (OpenAI): https://openai.com/index/gdpval/

- VentureBeat: Sonnet 4.6 matches flagship at one-fifth the cost: https://venturebeat.com/technology/anthropics-sonnet-4-6-matches-flagship-ai-performance-at-one-fifth-the-cost


**Referenced in this episode:**

- EP013: Twenty Minutes — the most compressed product launch in AI history


Website: aboutclaude.xyz

🦉 X: @_about_claude


Hosted on Acast. See acast.com/privacy for more information.

Episoder(35)

About Claude - The Most Disruptive Company in the World

About Claude - The Most Disruptive Company in the World

SHOW NOTESTIME magazine this week called Anthropic "the most disruptive company in the world." This episode explores the paradox at the heart of that description: the company founded on AI safety has ...

11 Mar 7min

About Claude - Strange Bedfellows

About Claude - Strange Bedfellows

SHOW NOTESAnthropic filed two federal lawsuits on Monday challenging the Pentagon's supply chain risk designation and Trump's order to cease all federal use of Claude. Within hours, nearly forty resea...

10 Mar 11min

About Claude - The Store

About Claude - The Store

The same week Anthropic was declared a threat to national security, it opened a shop. This episode is about the Claude Marketplace — what it is, why Anthropic isn't taking a commission, and what the s...

9 Mar 7min

About Claude - Thinking Inside The Box

About Claude - Thinking Inside The Box

SHOW NOTESAnthropic made two acquisitions in three months — Bun in December, Vercept in February — and both point in the same direction. This episode explores what Vercept built, why it mattered, what...

5 Mar 10min

About Claude - Moving In

About Claude - Moving In

SHOW NOTESClaude had its biggest weekend ever — and then its servers fell over. This episode is about what happens when a product built for a particular kind of person suddenly becomes famous, who sho...

4 Mar 10min

About Claude  - Four Hundred Meters

About Claude - Four Hundred Meters

In December 2025, NASA's Perseverance rover drove 456 metres across Mars on a route planned entirely by Claude — the first AI-planned drive on another planet. The technical achievement is remarkable: ...

3 Mar 9min

In Good Conscience

In Good Conscience

Dario Amodei has rejected the Pentagon's final offer, publishing a statement saying Anthropic "cannot in good conscience accede to their request." The overnight contract language, he said, was framed ...

27 Feb 11min

About Claude — Five O'Clock Friday

About Claude — Five O'Clock Friday

The Pentagon has given Anthropic until 5:01pm Friday to agree to unrestricted military use of Claude — or face the Defense Production Act and supply chain blacklisting. On the same day the ultimatum w...

26 Feb 15min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
pengepodden-2
tid-er-penger-en-podcast-med-peter-warren
pengesnakk
livet-pa-veien-med-jan-erik-larssen
utbytte
stormkast-med-valebrokk-stordalen
morgenkaffen-med-finansavisen
lederpodden
rss-markedspuls-2
rss-sunn-okonomi
rss-pa-konto
finansredaksjonen
stockup
boligbobla