Building EDR for AI: Controlling Autonomous Agents Before They Go Rogue with Ron Eddings

Building EDR for AI: Controlling Autonomous Agents Before They Go Rogue with Ron Eddings

AI agents aren't just reacting anymore, they're thinking, learning, and sometimes deleting your entire production database without asking. The real question isn't if your AI agent will be hacked, it's when, and whether you'll have the right hooks in place to stop it before it happens.

In this episode, Ron breaks down the ChatGPT Atlas vulnerability that shocked researchers, revealing how malicious prompts can turn AI assistants against their own users by bypassing safeguards and accessing file systems. He presents his new talk "Hooking Before Hacking," introducing a framework for applying EDR principles, prevention, detection, and response, to AI agents before they execute unauthorized commands. From pre-tool use hooks that catch malicious intent to one-time passwords that put humans back in the loop, this episode shares practical security controls you can implement today to prevent your AI agents from going rogue.

Impactful Moments:

00:00 - Introduction 02:00 - ChatGPT Atlas vulnerability exposed 04:00 - AI technology outpacing security guardrails 05:00 - Guardrail jailbreaks and prompt injection 06:00 - AI agents deleting production databases 07:00 - EDR principles for AI agents 09:00 - Pre-tool use hooks catch intention 11:00 - User prompt sanitization prevents leaks 14:00 - One-time passwords for agent workflows 16:00 - Automation mistakes across 10 years

Links:

Connect with Ron on LinkedIn: https://www.linkedin.com/in/ronaldeddings/

Check out the entire article here: https://www.yahoo.com/news/articles/cybersecurity-experts-warn-openai-chatgpt-101658986.html

GitHub Repository: https://hackervalley.com/hooking-before-hacking

See Ron's "Hooking Before Hacking" presentation slides here: http://hackervalley.com/hooking-before-hacking-presentation

Check out our website: https://hackervalley.com/

Upcoming events: https://www.hackervalley.com/livestreams

Love Hacker Valley Studio? Pick up some swag: https://store.hackervalley.com

Continue the conversation by joining our Discord: https://hackervalley.com/discord

Become a sponsor of the show to amplify your brand: https://hackervalley.com/work-with-us/

Join our creative mastermind and stand out as a cybersecurity professional: https://www.patreon.com/hackervalleystudio

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(424)

Episode 128 - A Masterclass on Being Yourself with Ashish Rajan

Episode 128 - A Masterclass on Being Yourself with Ashish Rajan

Ashish Rajan is a Cloud Security leader, podcaster, investor and fashion expert. He’s the Melbourne chapter leader of the OWASP Foundation and the Head of Security & Compliance. Ashish makes full use ...

24 Mar 202131min

Episode 127 - Finding Comfort in Being Uncomfortable with Paul Rivera

Episode 127 - Finding Comfort in Being Uncomfortable with Paul Rivera

Paul Rivera is the president and CEO of Def Logix, he built his own company from the ground up. Paul was born in the Bronx, NYC and grew up in both Queens, New York and Dallas/Fort Worth area of Texas...

17 Mar 202120min

Episode 126 - The Grit of Being World Champion Part 2 with Lee Kemp

Episode 126 - The Grit of Being World Champion Part 2 with Lee Kemp

In this episode we continue our conversation with Lee Kemp, a three time World Champion in Wrestling (1978, 1979 and 1982 all in the 74 kg weight class) and held the record for being the youngest Worl...

12 Mar 202123min

Episode 125 - The Grit of Being World Champion with Lee Kemp

Episode 125 - The Grit of Being World Champion with Lee Kemp

Our special guest this episode is Lee Kemp, a three time World Champion in Wrestling (1978, 1979 and 1982 all in the 74 kg weight class) and held the record for being the youngest World Champion. In a...

9 Mar 202137min

Episode 124 - The Learning Leader with Allan Alford

Episode 124 - The Learning Leader with Allan Alford

Introducing the Cyber Ranch Podcast and Allan Alford! Allan Alford is currently the Chief Technology Officer/Chief Information Security Officer at TrustMAPP. Allan Alford is a member of the Hacker Val...

2 Mar 202132min

Episode 123 - Adventures in Venture Capital with Lindsay Lee

Episode 123 - Adventures in Venture Capital with Lindsay Lee

Lindsay Lee is the founder and managing member of Authentic Ventures. Authentic Ventures is an early stage VC firm based in Oakland CA. Lindsay has worked many years in the investment industries as we...

25 Feb 202141min

We Are Here Finale: Rep. Yvette Clarke

We Are Here Finale: Rep. Yvette Clarke

Hacker Valley Studio presents: We Are Here - an audio journey and series exploring black excellence in technology and cybersecurity. In part three of this series, Ron and Chris interview Congresswoman...

23 Feb 202136min

Episode 121 - What Is Your IP Address with Chris Parker

Episode 121 - What Is Your IP Address with Chris Parker

In this episode of Hacker Valley Studio podcast, Ron and Chris are joined by Chris Parker, creator of WhatIsMyIPAddress. His website now reaches six million monthly visitors and began as a necessity t...

20 Feb 202127min

Populært innen Fakta

fastlegen
dine-penger-pengeradet
relasjonspodden-med-dora-thorhallsdottir-kjersti-idem
rss-bisarr-historie
foreldreradet
treningspodden
jakt-og-fiskepodden
rss-strid-de-norske-borgerkrigene
mikkels-paskenotter
rss-sunn-okonomi
sinnsyn
rss-kunsten-a-leve
dopet
hverdagspsyken
rss-kull
lederskap-nhhs-podkast-om-ledelse
fryktlos
hagespiren-podcast
gravid-uke-for-uke
rss-impressions-2