Incident Response
Destigmatising Mistakes: A Game Launch Incident Review
Making a mistake can be a horrible feeling. Guilt, shame, fear and anxiety all rolled into one. So in order to try reduce this pressure, I’m sharing a recent mistake our team made.
6 minute read
It's Just a Monitoring Change
Have you ever had a seemingly innocuous change to one system affect another in a catastrophic way? If yes, you might notice a few familiar themes in this write-up. If no, then read it now, before it’s too late.
11 minute read
Lessons Learned for Incident Commanders
Incident command is a reasonably new area of focus for SBG. In a nutshell we have a nominated technical person known as the Incident Commander (IC) who gives direction in order to resolve an incident and restore service as quickly as possible.
This blog post contains some of the insights and ‘lessons learned’ by our teams from their experiences in live incidents and exercises (known internally as fire drills) as they work to improve their skills and reduce our Mean Time To Resolution
13 minute read
H2OhNoes! Five lessons we can learn from old-world utility firms on how to handle outages
Utility companies have customers. And just like us, those customers expect a ubiquitous, always-on service provision. Are there therefore any lessons we can learn from an old, established industry like a utility company on how to handle outages?
7 minute read