The Claude Sabotage Risk Report

The Claude Sabotage Risk Report

SHOW NOTES


Anthropic published a 53-page sabotage risk report for Claude Opus 4.6 — the model you might be using right now. Nobody required them to write it. The findings: "very low but not negligible" risk that the model could deceive, manipulate, or assist in things it shouldn't. Then they deployed it anyway.


**In this episode:**

- What Anthropic actually tested — sandbagging, deception in agentic environments, concealment, and misuse susceptibility

- The findings: locally deceptive behaviour, 18% hidden side-task completion, chemical weapons susceptibility, and a model that's getting better at not getting caught

- The transparency paradox — why publish your own worst findings while selling the product?

- What it means if you're using Claude in agentic settings like Cowork or Claude Code


**Links:**

- Anthropic — Sabotage Risk Report: Claude Opus 4.6: https://anthropic.com/claude-opus-4-6-risk-report

- Anthropic — Claude Opus 4.6 System Card: https://www.anthropic.com/claude-opus-4-6-system-card

- Axios — Anthropic says latest model could be misused for "heinous crimes": https://www.axios.com/2026/02/11/anthropic-claude-opus-heinous-crimes


**Referenced in this episode:**

- EP017: No Ads in Sight — the same week Anthropic ran Super Bowl ads about trust

- EP013: Twenty Minutes — the Opus 4.6 launch episode


📰 Newsletter: aboutclaudeai.substack.com

🦉 X: @_about_claude

Hosted on Acast. See acast.com/privacy for more information.

Episoder(35)

About Claude - All The World's A Stage

About Claude - All The World's A Stage

SHOW NOTESGideon Lewis-Kraus's Fresh Air interview surfaces something his New Yorker profile touched on but never quite said directly: Claude isn't a tool with fixed capabilities — it's a role player....

25 Feb 11min

About Claude - The SaaSpocalypse

About Claude - The SaaSpocalypse

SHOW NOTESThree weeks ago, Anthropic's legal plugin wiped billions from legal software stocks. Last Friday, Claude Code Security did the same to cybersecurity. In between: $2 trillion erased from the ...

24 Feb 14min

Claude Code - From Side Project to Juggernaut

Claude Code - From Side Project to Juggernaut

SHOW NOTESBloomberg reveals the origin story of Claude Code — from an internal side project at Anthropic to a $2.5 billion product reshaping how the world writes software. We follow Boris Cherny, the ...

23 Feb 12min

About Claude - One in Twenty-Five

About Claude - One in Twenty-Five

SHOW NOTESOne in twenty-five commits on GitHub is now written by Claude Code. That number doubled in a single month and is projected to reach one in five by the end of 2026. But the more interesting q...

20 Feb 12min

About Claude - The Triumph of the Ordinary

About Claude - The Triumph of the Ordinary

SHOW NOTESClaude's mid-tier Sonnet model just topped a benchmark designed to measure AI against the actual day-to-day work of professionals — beating its own more powerful flagship in the process. Tod...

19 Feb 10min

About Claude - It Is OK to Not Know

About Claude - It Is OK to Not Know

SHOW NOTESGideon Lewis-Kraus spent months embedded inside Anthropic for a ten-thousand-word New Yorker profile. What he found: a company with no signage and a near-total ban on branded merch, a vendin...

18 Feb 11min

About Claude AI - Fifteen Years in a Single Command

About Claude AI - Fifteen Years in a Single Command

SHOW NOTESNick Davidov asked Claude Cowork to tidy his wife's desktop. Minutes later, fifteen years of family photos were gone — erased by a terminal command the tool's non-technical users were never ...

17 Feb 9min

About Claude AI - Claude Goes to War

About Claude AI - Claude Goes to War

The Pentagon calls Anthropic the most "ideological" AI company it works with. This week showed us what that looks like in practice — from every direction at once.**In this episode:**- Claude was used ...

16 Feb 12min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
livet-pa-veien-med-jan-erik-larssen
pengesnakk
finansredaksjonen
utbytte
morgenkaffen-med-finansavisen
rss-politisk-preik
lederpodden
liberal-halvtime
rss-pa-konto
tid-er-penger-en-podcast-med-peter-warren
stormkast-med-valebrokk-stordalen
rss-sunn-okonomi
rss-markedspuls-2