Incident Response

Destigmatising Mistakes: A Game Launch Incident Review

Making a mistake can be a horrible feeling. Guilt, shame, fear and anxiety all rolled into one. So in order to try reduce this pressure, I’m sharing a recent mistake our team made.

Category:

Incident Response

Time:

6 minute read

It's Just a Monitoring Change

Have you ever had a seemingly innocuous change to one system affect another in a catastrophic way? If yes, you might notice a few familiar themes in this write-up. If no, then read it now, before it’s too late.

Category:

Incident Response

Time:

11 minute read

Lessons Learned for Incident Commanders

Incident command is a reasonably new area of focus for SBG. In a nutshell we have a nominated technical person known as the Incident Commander (IC) who gives direction in order to resolve an incident and restore service as quickly as possible.

This blog post contains some of the insights and ‘lessons learned’ by our teams from their experiences in live incidents and exercises (known internally as fire drills) as they work to improve their skills and reduce our Mean Time To Resolution

Author:

Patrick Holmes

Category:

Incident Response

Time:

13 minute read

H2OhNoes! Five lessons we can learn from old-world utility firms on how to handle outages

Utility companies have customers. And just like us, those customers expect a ubiquitous, always-on service provision. Are there therefore any lessons we can learn from an old, established industry like a utility company on how to handle outages?

Author:

Dan Adams

Category:

Incident Response

Time:

7 minute read