AI Red Teaming & Securing Enterprise AI
AI Security Podcast16 Touko 2025

AI Red Teaming & Securing Enterprise AI

As AI systems become more integrated into enterprise operations, understanding how to test their security effectively is paramount.

In this episode, we're joined by Leonard Tang, Co-founder and CEO of Haize Labs, to explore how AI red teaming is changing.

Leonard discusses the fundamental shifts in red teaming methodologies brought about by AI, common vulnerabilities he's observing in enterprise AI applications, and the emerging risks associated with multimodal AI (like voice and image processing systems). We delve into the intricacies of achieving precise output control for crafting sophisticated AI exploits, the challenges enterprises face in ensuring AI safety and reliability, and practical mitigation strategies they can implement.

Leonard shares his perspective on the future of AI red teaming, including the critical skills cybersecurity professionals will need to develop, the potential for fingerprinting AI models, and the ongoing discussion around protocols like MCP.


Questions asked:

  • 00:00 Intro: AI Red Teaming's Evolution
  • 01:50 Leonard Tang: Haize Labs & AI Expertise
  • 05:06 AI vs. Traditional Red Teaming (Enterprise View)
  • 06:18 AI Quality Assurance: The Haize Labs Perspective
  • 08:50 AI Red Teaming: Real-World Application Examples
  • 10:43 Major AI Risk: Multimodal Vulnerabilities Explained
  • 11:50 AI Exploit Example: Voice Injections via Background Noise
  • 15:41 AI Vulnerabilities & Early XSS: A Cybersecurity Analogy
  • 20:10 Expert AI Hacking: Precisely Controlling AI Output for Exploits
  • 21:45 The AI Fingerprinting Challenge: Identifying Chained Models
  • 25:48 Fingerprinting LLMs: The Reality & Detection Difficulty
  • 29:50 Top Enterprise AI Security Concerns: Reputation & Policy
  • 34:08 Enterprise AI: Model Choices (Frontier Labs vs. Open Source)
  • 34:55 Future of LLMs: Specialized Models & "Hot Swap" AI
  • 37:43 MCP for AI: Enterprise Ready or Still Too Early?
  • 44:50 AI Security: Mitigation with Precise Input/Output Classifiers
  • 49:50 Future Skills for AI Red Teamers: Discrete Optimization


Resources discussed during the episode:

Baselines for Watermarking Large Language Models

Haize Labs

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(55)

Why Asset Intelligence is Replacing the CMDB & Static Dashboards

Why Asset Intelligence is Replacing the CMDB & Static Dashboards

Why do CISOs still struggle with asset intelligence in 2026? Despite decades of security tooling, most organizations still have a massive 40% "dark matter" blind spot in their environment and the expl...

11 Kesä 42min

The AI AuthZ Problem: Why Human Least Privilege Fails for Autonomous Agents

The AI AuthZ Problem: Why Human Least Privilege Fails for Autonomous Agents

Why are security leaders terrified of connecting AI agents to production data? Because unlike humans, AI agents don't apply judgment, and they operate at machine speed, meaning they can relentlessly h...

4 Kesä 47min

Securing AI at the Speed of Engineering | DoorDash | Forward Deployed Security | GRC Engineering

Securing AI at the Speed of Engineering | DoorDash | Forward Deployed Security | GRC Engineering

Is your security team moving at the speed of your engineering team? In this special live recording of the AI Security Podcast from San Francisco, Ashish is joined by Nick Reva (Global Director, Engine...

21 Touko 1h 3min

Verification vs. Validation: How Autonomous AI is Changing Cybersecurity

Verification vs. Validation: How Autonomous AI is Changing Cybersecurity

Are autonomous AI agents operating unchecked in your enterprise? With the release of open source frameworks like OpenClaw, deploying an AI agent is now as simple as texting, but it comes with massive,...

13 Touko 1h 10min

The Zero-Click AI Hack: How to Contain the Blast Radius of Autonomous Agents

The Zero-Click AI Hack: How to Contain the Blast Radius of Autonomous Agents

Is an AI agent's identity a workload or an action? Ashish spoke to Elie Bursztein, Distinguished Research Scientist and co-author of Google SAIF (Secure AI Framework) about how it is neither and that ...

29 Huhti 47min

Buy vs. Build AI Security: Why [Box.com](http://Box.com) CISO is Creating their Own Agentic SOC

Buy vs. Build AI Security: Why [Box.com](http://Box.com) CISO is Creating their Own Agentic SOC

If your AI solution is just helping humans process the same amount of alerts a little faster, you haven't transformed anything, you've just created a faster hamster wheel.In this episode, Ashish and C...

22 Huhti 46min

Anthropic's Project Mythos: Why the "Zero-Day Machine" is Terrifying the Security Industry

Anthropic's Project Mythos: Why the "Zero-Day Machine" is Terrifying the Security Industry

In this episode, Ashish and Caleb discuss the internet-breaking preview of Project Mythos, an unreleased AI model from Anthropic that has shown an unprecedented, terrifying ability to reason through c...

18 Huhti 1h 3min

Are AI Security Startups Faking It? How to Separate Signal from Noise

Are AI Security Startups Faking It? How to Separate Signal from Noise

With over 70 startups claiming to have built the perfect "AI SOC Analyst" or "AI Threat Hunter," how do you separate the real products from the vaporware? Recorded live at Decibel RSAC Founder Festiva...

15 Huhti 47min