How scary is Claude Mythos? 303 pages in 21 minutes

How scary is Claude Mythos? 303 pages in 21 minutes

With Claude Mythos we have an AI that knows when it's being tested, can obscure its reasoning when it wants, and is better at breaking into (and out of) computers than any human alive. Rob Wiblin works through its 244-page System Card and 59-page Alignment Risk Update to explain why:

  • Mythos is a nightmare for computer security
  • It has arrived far ahead of schedule
  • It might be great news for alignment and safety
  • But 3 key problems mean we can’t take its alignment results at face value
  • Mythos isn’t building its replacement yet, probably
  • Anthropic staff are, for the first time, kinda scared of Claude
  • He's losing sleep

Learn more & full transcript: https://80k.info/mythos

This episode was recorded on April 9, 2026.

Chapters:

  • Why people are panicking about computer security (01:05)
  • Mythos could break out of containment (04:23)
  • Anthropic is losing billions in revenue by not releasing Mythos (06:21)
  • Mythos is actually the most aligned model to date, except… (07:48)
  • Mythos knows when it’s being tested (09:52)
  • Mythos can hide its thoughts (11:50)
  • Mythos can’t be trusted about whether it’s untrustworthy (14:02)
  • Does Mythos advance automated AI R&D? (17:03)
  • Mythos scares Anthropic (19:15)

Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon Monsour
Camera operator: Dominic Armstrong
Production: Elizabeth Cox, Nick Stockton, and Katy Moore

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(340)

We can guess what intergalactic war would look like. And strangely, it matters.

We can guess what intergalactic war would look like. And strangely, it matters.

Intergalactic war is probably billions of years away — yet physics can already tell us how it ends. And strangely that conclusion is relevant to decisions people have to make today.In this video, Rob ...

18 Kesä 15min

How AI could create the world’s biggest problems (article by Zershaaneh Qureshi)

How AI could create the world’s biggest problems (article by Zershaaneh Qureshi)

Imagine you’re living 15,000 years ago. Your people are hunter-gatherers and you sleep under the stars. If someone told you humans would one day build cities with millions of people, fly through the a...

11 Kesä 1h 29min

What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

What it's really like to run AGI safety at Google DeepMind (and where I disagree with 'doomers') | Rohin Shah

Most people working on AI safety think without a massive effort AI systems will probably end up with goals catastrophically different from humanity’s. Today’s guest, Rohin Shah — head of AGI Safety an...

2 Kesä 2h 48min

What makes for a dream job? | Benjamin Todd

What makes for a dream job? | Benjamin Todd

What actually makes a job fulfilling? It's not what most career advice tells you. "Follow your passion" sounds inspiring, but it's misleading — and the research backs that up.Drawing on hundreds of st...

28 Touko 28min

We’re updating our career advice for the strangest time in history | Benjamin Todd, author of 80,000 Hours

We’re updating our career advice for the strangest time in history | Benjamin Todd, author of 80,000 Hours

The average career is 80,000 hours long. With AI advancing so rapidly, the hours you have left in your career matter more than ever.Some leading AI researchers think there’s a 10% chance that AI syste...

26 Touko 1h 6min

Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

Can AIs already start 'rogue deployments' inside AI companies? (Landmark new METR report)

A red-teamer was embedded inside Anthropic for three weeks, told to imagine he was an evil Claude, and asked to figure out how to launch a ‘rogue AI deployment’ without getting caught. It’s one part o...

20 Touko 20min

#243 – 'Godfather of AI' Yoshua Bengio: "I now see a path" to safe superintelligent AI

#243 – 'Godfather of AI' Yoshua Bengio: "I now see a path" to safe superintelligent AI

The co-inventor of modern AI and the most cited living scientist believes he's figured out how to ensure AI is honest, incapable of deception, and never goes rogue. Yoshua Bengio – Turing Award Winner...

7 Touko 2h 35min

'95% of AI Pilots Fail': The hidden agenda behind the viral stat that misled millions

'95% of AI Pilots Fail': The hidden agenda behind the viral stat that misled millions

You might have heard that '95% of corporate AI pilots' are failing. It was one of the most widely cited AI statistics of 2025, parroted by media outlets everywhere. It helped trigger a Nasdaq selloff ...

28 Huhti 10min

Suosittua kategoriassa Koulutus

rss-murhan-anatomia
psykopodiaa-podcast
adhd-podi
dear-ladies
voi-hyvin-meditaatiot-2
rss-liian-kuuma-peruna
rss-hereilla
rahapuhetta
rss-niinku-asia-on
kesken
rss-uskonto-on-tylsaa
rss-rahamania
psykologia
rss-valo-minussa-2
rss-arkea-ja-aurinkoa-podcast-espanjasta
aamupore
puhutaan-koiraa
aamukahvilla
rss-narsisti
rss-vapaudu-voimaasi