How An AI Model Learned To Be Bad — With Evan Hubinger And Monte MacDiarmid

How An AI Model Learned To Be Bad — With Evan Hubinger And Monte MacDiarmid

Evan Hubinger is Anthropic’s alignment stress test lead. Monte MacDiarmid is a researcher in misalignment science at Anthropic.The two join Big Technology to discuss their new research on reward hacking and emergent misalignment in large language models. Tune in to hear how cheating on coding tests can spiral into models faking alignment, blackmailing fictional CEOs, sabotaging safety tools, and even developing apparent “self-preservation” drives. We also cover Anthropic’s mitigation strategies like inoculation prompting, whether today’s failures are a preview of something far worse, how much to trust labs to police themselves, and what it really means to talk about an AI’s “psychology.” Hit play for a clear-eyed, concrete, and unnervingly fun tour through the frontier of AI safety. --- Enjoying Big Technology Podcast? Please rate us five stars ⭐⭐⭐⭐⭐ in your podcast app of choice. Want a discount for Big Technology on Substack + Discord? Here’s 25% off for the first year: https://www.bigtechnology.com/subscribe?coupon=0843016b Questions? Feedback? Write to: bigtechnologypodcast@gmail.com --- Wealthfront.com/bigtech⁠. If eligible for the overall boosted 4.15% rate offered with this promo, your boosted rate is subject to change if the 3.50% base rate decreases during the 3-month promo period. The Cash Account, which is not a deposit account, is offered by Wealthfront Brokerage LLC ("Wealthfront Brokerage"), Member FINRA/SIPC, not a bank. The Annual Percentage Yield ("APY") on cash deposits as of 11/7/25, is representative, requires no minimum, and may change at any time. The APY reflects the weighted average of deposit balances at participating Program Banks, which are not allocated equally. Wealthfront Brokerage sweeps cash balances to Program Banks, where they earn the variable base APY. Instant withdrawals are subject to certain conditions and processing times may vary. Learn more about your ad choices. Visit megaphone.fm/adchoices

Jaksot(518)

Apple After Tim Cook, OpenAI’s New Mojo, Meta’s Internal Tracking Escapade

Apple After Tim Cook, OpenAI’s New Mojo, Meta’s Internal Tracking Escapade

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. We cover: 1) Incoming Apple CEO John Ternus's biggest challenge 2) Is turnover at Apple a good thing 3) The products ...

25 Huhti 57min

OpenAI President Greg Brockman on GPT-5.5 “Spud,” AI Model Moats, and Cybersecurity Risks

OpenAI President Greg Brockman on GPT-5.5 “Spud,” AI Model Moats, and Cybersecurity Risks

Greg Brockman is the president and co-founder of OpenAI. Brockman joins Big Technology to discuss GPT-5.5, also known as Spud, and what it means for OpenAI’s next phase of AI development. Tune in to h...

23 Huhti 28min

Are We Too Obsessed With AI Predictions? — With Carissa Véliz

Are We Too Obsessed With AI Predictions? — With Carissa Véliz

Carissa Véliz is an Oxford philosopher and the author of Prophecy: Prediction, Power, and the Fight for the Future, from Ancient Oracles to AI. Véliz joins Big Technology Podcast to discuss whether so...

22 Huhti 54min

Tim Cook Steps Down — With Joanna Stern

Tim Cook Steps Down — With Joanna Stern

Joanna Stern is the ex-WSJ senior personal technology columnist and author of I Am Not a Robot. News of Tim Cook stepping down as CEO of Apple broke as Stern and I were recording a forthcoming episode...

21 Huhti 15min

Jensen On The Ropes, Sam Altman’s Conflicts, Allbirds’ GPU Pivot

Jensen On The Ropes, Sam Altman’s Conflicts, Allbirds’ GPU Pivot

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. We cover: 1) Nvidia CEO Jensen Huang's pedestrian performance on the Dwarkesh Podcast 2) Jensen's argument about comp...

17 Huhti 58min

The Pentagon's AI Plan + Behind the Anthropic Fight — With Under Secretary of War Emil Michael

The Pentagon's AI Plan + Behind the Anthropic Fight — With Under Secretary of War Emil Michael

Emil Michael is the Under Secretary of War for Research and Engineering at the Pentagon. Michael joins Big Technology to discuss how AI is transforming the Department of War, from targeting systems to...

15 Huhti 59min

Anthropic’s Mythos Dilemma, Violence Against AI, Tokenmaxxing at Meta

Anthropic’s Mythos Dilemma, Violence Against AI, Tokenmaxxing at Meta

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. We cover: 1) Anthropic's new Mythos preview 2) Is Mythos marketing or a legit breakthrough? 3) The Mythos sandwich gu...

10 Huhti 1h 1min

OpenAI vs. Anthropic's Direct Faceoff + Future of Agents — With Aaron Levie

OpenAI vs. Anthropic's Direct Faceoff + Future of Agents — With Aaron Levie

Aaron Levie is the CEO of Box . Levie joins Big Technology to discuss the battle between OpenAI and Anthropic as their product roadmaps converge around coding, enterprise, and AI agents. Tune in to he...

8 Huhti 58min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
psykopodiaa-podcast
rss-rahapodi
herrasmieshakkerit
rss-rahamania
ostan-asuntoja-podcast
rss-sami-miettinen-neuvottelija
rahapuhetta
hyva-paha-johtaminen
rss-lahtijat
yrittaja
juristipodi
rss-doulapodi
rss-sisalto-kuntoon
rss-seuraava-potilas
rss-paasipodi
seminuoret-sijoittajat
rss-uskalla-yrittaa
rss-inderes-femme