What's the worst production failure you've seen? Scott Breitenother borrows a comedian's line to reframe the question: "When an escalator breaks, it just becomes stairs. When your data workload fails, it often just results in stale data." Early in his career, a failed pipeline meant panic. Page the team, drop everything, scramble to fix it. Over time, the real lesson was learning to separate the severity levels. A real error (wrong numbers going to an exec) is fundamentally different from a pipeline that didn't run and left the data three hours old instead of one. Most data failures fall into the second category. The dashboard is stale. The report is delayed. But the numbers, when they arrive, are correct. Understanding that distinction changes how teams build alerting, handle on-call rotations, and decide what actually deserves a 2am page. "I think we'll be OK." Listen to the full Data Renegades Podcast episode with Scott wherever you get your favorite podcasts. #DataEngineering #DataReliability #Analytics

Watch the full episode on youtube: https://youtu.be/qKFBaDWMxkk

Like
Reply

To view or add a comment, sign in

Explore content categories