The Claude Sabotage Risk Report

SHOW NOTES

Anthropic published a 53-page sabotage risk report for Claude Opus 4.6 — the model you might be using right now. Nobody required them to write it. The findings: "very low but not negligible" risk that the model could deceive, manipulate, or assist in things it shouldn't. Then they deployed it anyway.

**In this episode:**

- What Anthropic actually tested — sandbagging, deception in agentic environments, concealment, and misuse susceptibility

- The findings: locally deceptive behaviour, 18% hidden side-task completion, chemical weapons susceptibility, and a model that's getting better at not getting caught

- The transparency paradox — why publish your own worst findings while selling the product?

- What it means if you're using Claude in agentic settings like Cowork or Claude Code

**Links:**

- Anthropic — Sabotage Risk Report: Claude Opus 4.6: https://anthropic.com/claude-opus-4-6-risk-report

- Anthropic — Claude Opus 4.6 System Card: https://www.anthropic.com/claude-opus-4-6-system-card

- Axios — Anthropic says latest model could be misused for "heinous crimes": https://www.axios.com/2026/02/11/anthropic-claude-opus-heinous-crimes

**Referenced in this episode:**

- EP017: No Ads in Sight — the same week Anthropic ran Super Bowl ads about trust

- EP013: Twenty Minutes — the Opus 4.6 launch episode

📰 Newsletter: aboutclaudeai.substack.com

🦉 X: @_about_claude

Hosted on Acast. See acast.com/privacy for more information.

Kokeile Premiumia

Nauti 14 päivää ilmaiseksi

Tilaa Premium

Jaksot(35)

Davos and the Data

Dario Amodei wasn't done at Davos. Beyond the software engineering prediction, he called the Trump administration's decision to sell advanced chips to China "crazy" — comparing it to selling nuclear w...

22 Tammi 9min

The Day After Davos

Dario Amodei told a Davos audience that software engineering could be "almost entirely automatable" in six to twelve months. That's a remarkable claim from the CEO of Anthropic — the company behind Cl...

21 Tammi 10min

Introducing About Claude

Episode 0 — the one where we explain ourselves. About Claude is a daily digest of news and discourse about Claude AI. In this introduction, we cover what the show is (news, discourse, bigger questions...

21 Tammi 6min

Kaikki yhdessä sovelluksessa

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi yhdessä paikassa.

Sinulle valikoitua sisältöä

Podme-sovelluksessa kokoat suosikkisi helposti omaan kirjastoosi. Saat meiltä myös kuuntelusuosituksia!

Jatka kuuntelua koska tahansa

Voit jatkaa siitä mihin jäit, myös offline-tilassa.

Premium

9,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa

Aloita 14 päivän kokeilu

Premium

13,99 €/kk

Kaikki premium-podcastit
Ei mainoksia
Ei sitoutumista, peruuta koska tahansa
Yksi lisäkäyttäjä

Kokeile 14 päivää maksutta

Suosittua kategoriassa Liike-elämä ja talous

Tarinat ja äänet, joita rakastat kuunnella

Kuuntele kaikki suosikkipodcastisi ja -äänikirjasi

Lue lisää