David Blank-Edelman and Michael Stiefel dive into why chasing a single “root cause” in outages is a wild goose chase: reliability, like latency or throughput, is an emergent system property. By pairing site reliability engineers with architects, you get real-world feedback on how your system really behaves—so you can build platforms that evolve, degrade gracefully, and retire cleanly when the time comes.
Instead of fixating on what went wrong, celebrate what went right and feed those insights back into your designs. Want the full scoop? Check out the interview transcript, subscribe to InfoQ’s Software Architects’ Newsletter for monthly nuggets, and catch their QCon events and podcasts for more SRE and architecture gold.
Watch on YouTube
Top comments (0)