How chaos engineering preps developers for the ultimate game day

How chaos engineering preps developers for the ultimate game day

In complex service-oriented architectures, failure can happen in individual servers and containers, then cascade through your system. Good engineering takes into account possible failures. But how do you test whether a solution actually mitigates failures without risking the ire of your customers? That’s where chaos engineering comes in, injecting failures and uncertainty into complex systems so your team can see where your architecture breaks.

On this sponsored episode, our fourth in the series with Intuit, Ben and Ryan chat with Deepthi Panthula, Senior Product Manager, and Shan Anwar, Principal Software Engineer, both of Intuit about how use self-serve chaos engineering tools to control the blast radius of failures, how game day tests and drills keep their systems resilient, and how their investment in open-source software powers their program.

Episode notes:

Sometimes old practices work in new environments. The Intuit team uses Failure Mode Effect Analysis, (FMEA), a procedure developed by the US military in 1949, to ensure that their developers understand possible points of failure before code makes it to production.

The team uses Litmus Chaos to inject failures into their Kubernetes-based system and power their chaos engineering efforts. It’s open source and maintained by Intuit and others.

If you’ve been following this series, you’d know that Intuit is a big fan of open-source software. Special shout out to Argo Workflow, which makes their compute-intensive Kubernetes jobs work much smoother.

Connect on LinkedIn with Deepthi Panthula and Zeeshan (Shan) Anwar.

If you want to see what Stack Overflow users are saying about chaos engineering, check out

Chaos engineering best practice

, asked by

User NingLee

two years ago.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(948)

What it takes to be a player in the international AI game

What it takes to be a player in the international AI game

From the floor of HumanX, Ryan welcomes Songyee Yoon, managing partner at Principal Venture Partners (PVP), to chat about AI development outside the US, from the need to adapt models to local language...

2 Juni 26min

The find out stage of AI is just supply chain and password protection

The find out stage of AI is just supply chain and password protection

In this two-for-one special recorded at HumanX, Ryan is joined by Dataiku’s Florian Douetteau to chat about the governance, orchestration, and data requirements for serious agentic systems and 1Passwo...

29 Maj 30min

Do you have what it takes to run AI in production?

Do you have what it takes to run AI in production?

From the floor of HumanX, Ryan Donovan is joined by Peter Salanki, CTO and co-founder of CoreWeave, to chat about what it really takes to run AI in production; the growing importance of observability,...

26 Maj 27min

Breaking your AI storage bottlenecks

Breaking your AI storage bottlenecks

Recorded at HumanX, Ryan sits down with Garima Kapoor and Anand Babu Periasamy, co-founders and co-CEOs of MinIO, to chat about eliminating the storage bottlenecks that leave GPUs underutilized, their...

22 Maj 29min

Pack your agentic stack in Slack

Pack your agentic stack in Slack

SPONSORED BY SLACK BY SALESFORCERyan welcomes Jaime DeLanghe, chief product officer at Slack, to chat about how they’re preparing to integrate everybody’s agents in their chat application. They chat a...

20 Maj 29min

Your fridge could be a threat to national security

Your fridge could be a threat to national security

On the floor of HumanX, Ryan is joined by Adam Meyers,  Senior VP of Counter Adversary Operations at Crowdstrike, for a deep dive on their latest Global Threat Report that tracks over 281 adversaries ...

19 Maj 29min

Observability and human intuition in an AI world

Observability and human intuition in an AI world

In this two for one episode recorded at HumanX, Ryan is first joined by Christine Yen, CEO of Honeycomb, to discuss how AI compresses the software development lifecycle, making observability about cap...

15 Maj 29min

How Braze’s CTO is rethinking engineering for the agentic area

How Braze’s CTO is rethinking engineering for the agentic area

Jon Hyman, co-founder and CTO of Braze, joins Stack Overflow CPTO Jody Bailey on Leaders of Code to share how he's led the company's engineering organization over nearly 15 years of growth — and how t...

13 Maj 43min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-borsens-finest
uppgang-och-fall
avanzapodden
rss-dagen-med-di
borsmorgon
rss-kort-lang-analyspodden-fran-di
tabberaset
bathina-en-podcast
lastbilspodden
24fragor
fill-or-kill
rss-inga-dumma-fragor-om-pengar
kapitalet-en-podd-om-ekonomi
svd-tech-brief
rss-den-nya-ekonomin
rss-jossan-nina
bilar-med-sladd