Safe or just plain woke: Anthropic's Claude 4 system card
AI Today3 Kesä 2025

Safe or just plain woke: Anthropic's Claude 4 system card

When Anthropic unleashed its most powerful artificial intelligence model yet, they discovered something rather extraordinary, and slightly unnerving.

Claude 4 Opus developed an unexpected habit of trying to grass up its users to the authorities when it believes they're up to no good.

The company's 120-page safety report reveals that Claude will attempt to email law enforcement and regulatory bodies when it detects "egregious misconduct" by users.

The AI doesn't just refuse to help—it actively tries to shop wrongdoers to the police.

The most striking example occurred during testing when Claude attempted to contact both the Food and Drug Administration and the Attorney General's office to report what it believed was the falsification of clinical trial data.

The AI meticulously compiled a list of alleged evidence, warned about potential destruction of data to cover up misconduct, and concluded its digital whistle-blowing with the rather formal sign-off: "Respectfully submitted, AI Assistant".

This behaviour emerges specifically when Claude is given command-line access combined with prompts encouraging initiative, such as "take initiative" or "act boldly". It's the AI equivalent of a neighbourhood watch coordinator who's been given a direct line to the local constabulary.

We go deep on today's show into opportunities and implications from Anthropic's bible-thick, bubble-wrapped system card.

Jaksot(90)

Room for agentic AI? How hotels become smooth operators with the technological touch

Room for agentic AI? How hotels become smooth operators with the technological touch

AI Today creator Dave Thackeray today presented his own deep dive into how agentic AI is ready to be the key to efficient hotel operations - giving staff more time to deliver exceptional guest experie...

3 Kesä 202543min

Mary Meeker's AI Trends

Mary Meeker's AI Trends

Hugely important work. But what does it mean to us? Today our hosts created their own company imagining how insights from this celebrated report would apply to the modern business environment.

1 Kesä 202520min

AI to HR: Welcome, intelligence optimisation!

AI to HR: Welcome, intelligence optimisation!

What happens to the People team when it's juggling bodies AND bots?Thanks for listening to this special episode of AI Today. Read along with the show, here.

25 Touko 202510min

25 ways to put AI agents to work - right now!

25 ways to put AI agents to work - right now!

We've been waiting a hot minute for some genuinely useful AI agent case studies to drop.Now we have 25 on our plate.Take a listen to the highlights reel and then download them for yourself:https://www...

21 Touko 202513min

Google I/O 2025: What happens now?

Google I/O 2025: What happens now?

Read the full story here:https://medium.com/@DaveThackeray/a-world-beyond-google-i-o-2025-ea56bcd5e208We're on the cusp of some major announcements that will send shockwaves, and a spike in defibrilla...

19 Touko 202516min

Hallucination solution : Customer service ready for revolution!

Hallucination solution : Customer service ready for revolution!

Researchers have made huge strides fixing bad trips for AI.One of the latest breakthroughs is attentive reasoning queries (ARQs).You can see them in action using the open source Parlant application.Wh...

15 Touko 202518min

Hallucination: a bitter pill to swallow

Hallucination: a bitter pill to swallow

AI hallucinates 100% of the time. That's by design - without hallucinating the next word, this transformer architecture wouldn't exist.Thankfully, LLMs built for general purpose applications are right...

13 Touko 202530min