PLAY PODCASTS
S5E12 - Missing insights and the SRE big picture — with Niall Murphy
Season 5 · Episode 12

S5E12 - Missing insights and the SRE big picture — with Niall Murphy

A discussion with Niall Murphy on the evolution of Site Reliability Engineering (SRE) and the challenges of building distributed systems

JUXT Cast · JUXT — A Grid Dynamics Company

September 6, 20241h 0m

Audio is streamed directly from the publisher (pinecast.com) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Show Notes

Episode Notes

Our guest is Niall Murphy, CEO of Stanza - a company founded by a group of experienced SREs with a vision to provide the tools, coding platform, culture and community to give any organization industry-leading reliability. Niall previously worked at Google where he co-authored the book "Site Reliability Engineering: How Google Runs Production Systems" (2016).

In this podcast episode, we discussed Niall's extensive experience including his role within an important era for Google's infrastructure transformation beginning in the late 2000s, and the wider contemporary challenges in the SRE landscape.

Niall's reflections on operating distributed systems has lead him to the conclusion that there is still a profound missing gap in SRE tooling between discovering 'signals' and taking 'actions'.

The conversation begins by alluding to a couple of other recent podcasts we've recorded on distributed systems in 2024, one with Mark Burgess and the other with András Gerlits.

Happy listening!

Topics

Clojuretechtechnologyfunctionalprogrammingprogrammingsoftware