Building EDR for AI: Controlling Autonomous Agents Before They Go Rogue with Ron Eddings

Building EDR for AI: Controlling Autonomous Agents Before They Go Rogue with Ron Eddings

AI agents aren't just reacting anymore, they're thinking, learning, and sometimes deleting your entire production database without asking. The real question isn't if your AI agent will be hacked, it's when, and whether you'll have the right hooks in place to stop it before it happens.

In this episode, Ron breaks down the ChatGPT Atlas vulnerability that shocked researchers, revealing how malicious prompts can turn AI assistants against their own users by bypassing safeguards and accessing file systems. He presents his new talk "Hooking Before Hacking," introducing a framework for applying EDR principles, prevention, detection, and response, to AI agents before they execute unauthorized commands. From pre-tool use hooks that catch malicious intent to one-time passwords that put humans back in the loop, this episode shares practical security controls you can implement today to prevent your AI agents from going rogue.

Impactful Moments:

00:00 - Introduction 02:00 - ChatGPT Atlas vulnerability exposed 04:00 - AI technology outpacing security guardrails 05:00 - Guardrail jailbreaks and prompt injection 06:00 - AI agents deleting production databases 07:00 - EDR principles for AI agents 09:00 - Pre-tool use hooks catch intention 11:00 - User prompt sanitization prevents leaks 14:00 - One-time passwords for agent workflows 16:00 - Automation mistakes across 10 years

Links:

Connect with Ron on LinkedIn: https://www.linkedin.com/in/ronaldeddings/

Check out the entire article here: https://www.yahoo.com/news/articles/cybersecurity-experts-warn-openai-chatgpt-101658986.html

GitHub Repository: https://hackervalley.com/hooking-before-hacking

See Ron's "Hooking Before Hacking" presentation slides here: http://hackervalley.com/hooking-before-hacking-presentation

Check out our website: https://hackervalley.com/

Upcoming events: https://www.hackervalley.com/livestreams

Love Hacker Valley Studio? Pick up some swag: https://store.hackervalley.com

Continue the conversation by joining our Discord: https://hackervalley.com/discord

Become a sponsor of the show to amplify your brand: https://hackervalley.com/work-with-us/

Join our creative mastermind and stand out as a cybersecurity professional: https://www.patreon.com/hackervalleystudio

Episoder(410)

People-Focused Leadership in Cybersecurity with Cody Wass

People-Focused Leadership in Cybersecurity with Cody Wass

Cody Wass, VP of Services at NetSPI, brings his near-decade of experience to the pod to talk about longevity, development, and leadership. It’s no secret that cybersecurity is in need of people. Cody’...

15 Des 202225min

Improv-ing Your Way to Better Vendor Meetings With Brad Liggett

Improv-ing Your Way to Better Vendor Meetings With Brad Liggett

Brad Liggett, CTI Intel Engineer Manager at Cybersixgill, puts on his improv hat and joins the pod ready for anything. After COVID pressed pause on daily life, Brad kept himself sane and gained some n...

13 Des 202227min

Prioritizing & Proactive Cybersecurity with Richard Rushing

Prioritizing & Proactive Cybersecurity with Richard Rushing

Richard Rushing, CISO at Motorola Mobility, brings his decades of experience to the show this week to talk about leadership, communication, and perhaps most importantly of all: prioritization. After j...

6 Des 202241min

Keeping Cyber Course Prices Equitable with Kenneth Ellington

Keeping Cyber Course Prices Equitable with Kenneth Ellington

Kenneth Ellington, the Senior Cybersecurity Consultant at EY and Founder of the Ellington Cyber Academy, achieves his goal of being on the Hacker Valley Studio this week. From working at Publix in col...

29 Nov 202222min

Sharing Cyber Outside of the Security Bubble with Lesley Carhart

Sharing Cyber Outside of the Security Bubble with Lesley Carhart

Lesley Carhart, Director of Incident Response at Dragos, takes some time off mentoring cybersecurity practitioners, responding to OT incidents, and training in martial arts to hop on the mics this wee...

22 Nov 202229min

Challenges & Opportunities in Cyber Threat Intelligence with Brian Kime

Challenges & Opportunities in Cyber Threat Intelligence with Brian Kime

Brian Kime, VP of Intelligence Strategy and Advisory at ZeroFox, talks about all things threat intelligence this week. Brian explains why he chose threat intelligence as his focus, where he’s seen opp...

15 Nov 202231min

Hiring the Next Fractional CISO with Michael Piacente

Hiring the Next Fractional CISO with Michael Piacente

Michael Piacente, Managing Partner & Cofounder at Hitch Partners, answers the essential question on many cybersecurity professionals’ minds: Where do CISOs find CISO jobs? As it turns out, Michael hel...

11 Nov 202228min

Cultivating Client Trust at Cybercon with NTT’s Dirk Hodgson & Adam Green

Cultivating Client Trust at Cybercon with NTT’s Dirk Hodgson & Adam Green

Hacker Valley: On the Road is a curated collection of conversations that Chris and Ron have had during conferences and events around the globe. In this episode, NTT’s Dirk Hodgson, Director of Cyberse...

9 Nov 202240min

Populært innen Fakta

fastlegen
dine-penger-pengeradet
relasjonspodden-med-dora-thorhallsdottir-kjersti-idem
treningspodden
foreldreradet
jakt-og-fiskepodden
dopet
merry-quizmas
podme-bio-3
rss-strid-de-norske-borgerkrigene
sinnsyn
rss-kull
sovnlos
gravid-uke-for-uke
rss-var-forste-kaffe
hverdagspsyken
fryktlos
rss-kunsten-a-leve
tomprat-med-gunnar-tjomlid
dypdykk