About Claude - The Triumph of the Ordinary

SHOW NOTES

Claude's mid-tier Sonnet model just topped a benchmark designed to measure AI against the actual day-to-day work of professionals — beating its own more powerful flagship in the process. Today we explore what that result reveals about how the definition of AI capability is quietly being rewritten.

**In this episode:**

- What GDPval is, why OpenAI built it, and why the result matters beyond a product launch

- The sixteen-month computer use trajectory that shows something crossing a threshold

- Why "reliability" and "taste" beat "brilliance" when the task is an inbox, not an exam

- The deeper argument: ordinary professional work is harder than it looks, and the race is catching up to that fact

**Links:**

- Introducing Claude Sonnet 4.6: https://www.anthropic.com/news/claude-sonnet-4-6

- Claude Sonnet 4.6 model page: https://www.anthropic.com/claude/sonnet

- GDPval benchmark (OpenAI): https://openai.com/index/gdpval/

- VentureBeat: Sonnet 4.6 matches flagship at one-fifth the cost: https://venturebeat.com/technology/anthropics-sonnet-4-6-matches-flagship-ai-performance-at-one-fifth-the-cost

**Referenced in this episode:**

- EP013: Twenty Minutes — the most compressed product launch in AI history

Website: aboutclaude.xyz

🦉 X: @_about_claude

Hosted on Acast. See acast.com/privacy for more information.

Kokeile Premiumia

Nauti 14 päivää ilmaiseksi

Tilaa Premium

Jaksot(35)

Davos and the Data

Dario Amodei wasn't done at Davos. Beyond the software engineering prediction, he called the Trump administration's decision to sell advanced chips to China "crazy" — comparing it to selling nuclear w...

22 Tammi 9min

The Day After Davos

Dario Amodei told a Davos audience that software engineering could be "almost entirely automatable" in six to twelve months. That's a remarkable claim from the CEO of Anthropic — the company behind Cl...

21 Tammi 10min

Introducing About Claude

Episode 0 — the one where we explain ourselves. About Claude is a daily digest of news and discourse about Claude AI. In this introduction, we cover what the show is (news, discourse, bigger questions...

21 Tammi 6min

Kaikki yhdessä sovelluksessa

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi yhdessä paikassa.

Sinulle valikoitua sisältöä

Podme-sovelluksessa kokoat suosikkisi helposti omaan kirjastoosi. Saat meiltä myös kuuntelusuosituksia!

Jatka kuuntelua koska tahansa

Voit jatkaa siitä mihin jäit, myös offline-tilassa.

Premium

9,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa

Aloita 14 päivän kokeilu

Premium

13,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa
Yksi lisäkäyttäjä

Kokeile 14 päivää maksutta

Suosittua kategoriassa Liike-elämä ja talous

Tarinat ja äänet, joita rakastat kuunnella

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi

Lue lisää