How An AI Model Learned To Be Bad — With Evan Hubinger And Monte MacDiarmid

How An AI Model Learned To Be Bad — With Evan Hubinger And Monte MacDiarmid

Evan Hubinger is Anthropic’s alignment stress test lead. Monte MacDiarmid is a researcher in misalignment science at Anthropic.The two join Big Technology to discuss their new research on reward hacking and emergent misalignment in large language models. Tune in to hear how cheating on coding tests can spiral into models faking alignment, blackmailing fictional CEOs, sabotaging safety tools, and even developing apparent “self-preservation” drives. We also cover Anthropic’s mitigation strategies like inoculation prompting, whether today’s failures are a preview of something far worse, how much to trust labs to police themselves, and what it really means to talk about an AI’s “psychology.” Hit play for a clear-eyed, concrete, and unnervingly fun tour through the frontier of AI safety. --- Enjoying Big Technology Podcast? Please rate us five stars ⭐⭐⭐⭐⭐ in your podcast app of choice. Want a discount for Big Technology on Substack + Discord? Here’s 25% off for the first year: https://www.bigtechnology.com/subscribe?coupon=0843016b Questions? Feedback? Write to: bigtechnologypodcast@gmail.com --- Wealthfront.com/bigtech⁠. If eligible for the overall boosted 4.15% rate offered with this promo, your boosted rate is subject to change if the 3.50% base rate decreases during the 3-month promo period. The Cash Account, which is not a deposit account, is offered by Wealthfront Brokerage LLC ("Wealthfront Brokerage"), Member FINRA/SIPC, not a bank. The Annual Percentage Yield ("APY") on cash deposits as of 11/7/25, is representative, requires no minimum, and may change at any time. The APY reflects the weighted average of deposit balances at participating Program Banks, which are not allocated equally. Wealthfront Brokerage sweeps cash balances to Program Banks, where they earn the variable base APY. Instant withdrawals are subject to certain conditions and processing times may vary. Learn more about your ad choices. Visit megaphone.fm/adchoices

Jaksot(515)

Who Wins if AI Models Commoditize? — With Mistral CEO Arthur Mensch

Who Wins if AI Models Commoditize? — With Mistral CEO Arthur Mensch

Arthur Mensch is the CEO and co-founder of Mistral. Arthur Mensch joins the Big Technology Podcast to discuss what the AI business looks like if all leading models perform the same. Tune in to hear ho...

14 Tammi 56min

AI’s Steve Jobs?, Big Tech AI Chaos Ladder, 2026 Crystal Ball

AI’s Steve Jobs?, Big Tech AI Chaos Ladder, 2026 Crystal Ball

M.G. Siegler of Spyglass is back for our monthly tech news discussion. Today we discuss whether AI needs a Steve Jobs, whether the technology lends itself to that type of leader, and who it might be o...

12 Tammi 54min

Claude Code’s Shining Moment, ChatGPT for Healthcare, End Of Busywork?

Claude Code’s Shining Moment, ChatGPT for Healthcare, End Of Busywork?

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. This week, we do our 2026 predictions in an abbreviated holiday-time episode. Here's what we cover: 1) Claude Code's ...

9 Tammi 56min

Coreweave: AI Bubble Poster Child Or The Next Tech Giant? — With Michael Intrator and Brian Venturo

Coreweave: AI Bubble Poster Child Or The Next Tech Giant? — With Michael Intrator and Brian Venturo

Michael Intrator is the CEO of Coreweave. Brian Venturo is the chief strategy officer at Coreweave. The two join Big Technology Podcast to discuss the company's rapid rise amid the AI boom and the cri...

7 Tammi 1h 1min

Meta's AI Agent Plan, Grok's Perversion, Prison Of Financial Mediocrity

Meta's AI Agent Plan, Grok's Perversion, Prison Of Financial Mediocrity

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. This week, we do our 2026 predictions in an abbreviated holiday-time episode. Here's what we cover: 1) Meta buys Manu...

2 Tammi 49min

Best of Big Technology: Demis Hassabis On AGI, Deceptive AIs, Building a Virtual Cell

Best of Big Technology: Demis Hassabis On AGI, Deceptive AIs, Building a Virtual Cell

Demis Hassabis is the CEO of Google DeepMind. He joined Big Technology Podcast in early 2025 discuss the cutting edge of AI and where the research is heading. In this conversation, we cover the path t...

31 Joulu 202557min

Alex And Ranjan's 2026 Outlook: ChatGPT 1 Billion, AI Shopping, Apple's Big Year, AI Love Boom

Alex And Ranjan's 2026 Outlook: ChatGPT 1 Billion, AI Shopping, Apple's Big Year, AI Love Boom

Ranjan Roy from Margins is back for our weekly discussion of the latest tech news. This week, we do our 2026 predictions in an abbreviated holiday-time episode. Here's what we cover: 1) AI agents star...

26 Joulu 202524min

2025 In Review, 2026 Predictions — With Reed Albergotti

2025 In Review, 2026 Predictions — With Reed Albergotti

Reed Albergotti is the technology editor at Semafor. Albergotti joins Big Technology Podcast to break down which companies are best positioned in the coming year. We cover Meta’s superintelligence gam...

24 Joulu 202543min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
rss-rahapodi
psykopodiaa-podcast
herrasmieshakkerit
rss-rahamania
ostan-asuntoja-podcast
hyva-paha-johtaminen
rss-sami-miettinen-neuvottelija
rahapuhetta
rss-lahtijat
rss-doulapodi
rss-paasipodi
juristipodi
rss-sisalto-kuntoon
rss-muutoksenanatomiaa-podcast
rss-startup-ministerio
rss-uppoava-vn-laiva
rss-bisnesta-bebeja
rss-seuraava-potilas