The One With Shannon Brady and Operating Systems

The One With Shannon Brady and Operating Systems

In this episode of the Prodcast, guest Shannon Brady speaks with hosts Jordan Greenberg and Florian Rathgeber about managing Google's vast fleet of internal devices. Shannon explains how Google's Linux platform uses core SRE principles—specifically testing, canarying, and monitoring—for weekly stage rollouts of its Debian-based distribution. Configuration is efficiently managed using Puppet to ensure the right setup for a diverse user base. The conversation pivots to "the year of Linux everything," underscoring its widespread adoption. Discussing AI, Shannon identifies its greatest utility for SREs in rapidly analyzing signals and generating complex queries to resolve outages. This episode reinforces that practicing SRE fundamentals is paramount, demonstrating that you can be an SRE at heart, regardless of your official title.

Jaksot(51)

Life of An SRE Episode 1: Tom Cranitch and Megan Yin

Life of An SRE Episode 1: Tom Cranitch and Megan Yin

How does one become an SRE? And what's the career like? In this episode, Tom and Megan discuss their path to SRE.

12 Syys 202327min

Creating the SRE Prodcast with John Reese (JTR)

Creating the SRE Prodcast with John Reese (JTR)

Host MP English and former Google SRE John Reese (JTR) chat about the creation of the Prodcast. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

7 Kesä 202210min

Postmortems with Ayelet Sachto

Postmortems with Ayelet Sachto

Ayelet Sachto offers advice on creating an actionable, transparent, and blameless postmortem culture. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

31 Touko 202228min

Incident Management with Adrienne Walcer

Incident Management with Adrienne Walcer

Adrienne Walcer discusses how to approach and organize incident management efforts throughout the production lifecycle. Visit https://sre.google/prodcast for transcripts and links to further reading. ...

24 Touko 202239min

On-Call Rotations with Andrew Widdowson (APW)

On-Call Rotations with Andrew Widdowson (APW)

Andrew Widdowson (APW) shares strategies for successful on-call rotations. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

17 Touko 202243min

Automation with Pierre Palatin

Automation with Pierre Palatin

Pierre Palatin dives into different automation strategies, how to build confidence in your system, and why designing the UI may be your biggest challenge. Visit https://sre.google/prodcast for transcr...

10 Touko 20221h

Client-Transparent Migrations with Pavan Adharapurapu

Client-Transparent Migrations with Pavan Adharapurapu

Pavan Adharapurapu details how to approach large-scale migrations while optimizing for user experience. Visit https://sre.google/prodcast for transcripts and links to further reading. View transcript

3 Touko 202240min

Rethinking SLOs with Narayan Desai

Rethinking SLOs with Narayan Desai

Narayan Desai explains why SLOs can be problematic and proposes alternative methods for monitoring complex, large-scale systems. Visit https://sre.google/prodcast for transcripts and links to further ...

26 Huhti 202225min