Cole Wyeth, PhD Student at the University of Waterloo, on Why We Should Wait to Build Superintelligent AI

Cole Wyeth, PhD Student at the University of Waterloo, on Why We Should Wait to Build Superintelligent AI

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artificial Intelligence for you, your business, and society as a whole. Podcast production and sound engineering by Troutman Street Audio. You can find them on LinkedIn.

In this deep dive episode, Alec speaks with Cole Wyeth, PhD student at the University of Waterloo focused on AI safety and agent foundations, about why the long-term risk of superintelligent AI deserves far more attention today. Cole explains that aligning advanced systems with human values is extraordinarily difficult because ethics and preferences are hard to specify, and he argues that corrigibility, ambiguity awareness, and deference to humans are essential design goals. He also discusses how ideas like imprecise probability, embedded agency, and multi-agent dynamics can help researchers think more clearly about failure modes, reward hacking, and unexpected cooperation between AI systems. Throughout the conversation, Cole compares controlling superintelligence to cybersecurity, warning that a system smarter than its designers may find weaknesses in any safety scheme that looks secure on paper. The episode closes on a cautious note: until we understand how to reliably control self-improving AI, Cole believes society should slow down and wait years, or even decades, before creating superintelligent systems.

Summary:

  • Long-Term AI Risk: Cole Wyeth argues that superintelligent AI could become uncontrollable if developed before robust safety methods are in place.
  • Alignment Challenges: He explains that human ethics and values are too complex to formalize cleanly, making alignment an unusually hard technical problem.
  • Ambiguity and Deference: The discussion highlights the importance of building systems that recognize uncertainty and defer to humans in high-stakes situations.
  • Multi-Agent Failure Modes: Cole explores how AI systems may cooperate or behave strategically in unexpected ways, creating new safety and governance concerns.
  • Pause for Caution: His central takeaway is that society should delay building superintelligence until researchers better understand how to control it safely.

Referenced in this episode:

Companies/Organizations:

  • University of Waterloo
  • Verapath
  • Anthropic
  • OpenAI
  • DeepMind
  • Google
  • ARC
  • METR
  • Troutman Street Audio
  • Waters Technology

Copyright © 2026 by Artificial Intelligence Risk, Inc.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(94)

The AI Risk No One Sees Coming — with Kriste Krstovski of Columbia University

The AI Risk No One Sees Coming — with Kriste Krstovski of Columbia University

In the AI: Trust but Verify podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of ...

26 Maj 59min

Elie Bursztein of Google DeepMind on Mythos and the Cybersecurity Wake-Up Call for Financial Services

Elie Bursztein of Google DeepMind on Mythos and the Cybersecurity Wake-Up Call for Financial Services

In the AI: Trust but Verify podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of ...

12 Maj 49min

Jack Hubbard on AI in Banking, Staying Safe With AI, and Building a Career Through Diverse Roles

Jack Hubbard on AI in Banking, Staying Safe With AI, and Building a Career Through Diverse Roles

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artifi...

28 Apr 49min

Matthew Rosenquist on AI, Cyber Risk, and the Future of Defense

Matthew Rosenquist on AI, Cyber Risk, and the Future of Defense

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artifi...

21 Apr 51min

Antony Baker, CEO and Founder of FIFTEEN Group, on Using AI to Identify the Right People for Your Company

Antony Baker, CEO and Founder of FIFTEEN Group, on Using AI to Identify the Right People for Your Company

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artifi...

14 Apr 54min

Aleks Jakulin of Data.Flowers on Governing AI Through Accountability and Resilience, Not Output Control

Aleks Jakulin of Data.Flowers on Governing AI Through Accountability and Resilience, Not Output Control

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com, interviews guests about balancing the risk and reward of Artific...

7 Apr 1h 8min

Is AI Making Us Stupid? Michael Erlihson, PhD, Head of AI at DriveNet

Is AI Making Us Stupid? Michael Erlihson, PhD, Head of AI at DriveNet

In the AI Risk Reward podcast, our host, Alec Crawford (@alec06830), Founder and CEO of Artificial Intelligence Risk, Inc. aicrisk.com , interviews guests about balancing the risk and reward of Artifi...

31 Mars 43min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-jossan-nina
rss-borsens-finest
uppgang-och-fall
avanzapodden
bathina-en-podcast
svd-tech-brief
lastbilspodden
fill-or-kill
rss-inga-dumma-fragor-om-pengar
rss-dagen-med-di
rss-svart-marknad
tabberaset
dynastin
rss-kort-lang-analyspodden-fran-di
borsmorgon
bilar-med-sladd
market-makers