IT Audit (8) IT Operations (29) IT Security (11)

Metrics That Matter - Part 3 - Change Success Rate

In my previous Metrics that Matter posts, I discussed how high performing IT organizations use mean time to repair (MTTR) and first fix rate (FFR). 

In this post, I'll excerpt from my recent article that covered another metric that stratified the high performers from medium and low performers that we have studied in the IT Controls Performance Study.

This key metric is change success rate.  The key finding from the study is that top performers have a more stringent definition of change success rate, AND have higher metrics.

A common measure of of change success rate includes the number of changes successfully deployed without creating an incident, service impairment or disruption of work.  However, top performers we studied focus on process as well as outcomes, and also include process exceptions as part of their definition of success.  To them, a change doesn’t necessarily have to cause an incident to be defined as unsuccessful.

For instance, when a change does not go accordingly to plan would also count as a process exception. This could be such as when a change exceeds the scheduled implementation time or varied from the deployment plan, requiring undocumented steps. High performers would count all these examples as failed changes. (We’ve observed that high performers use the term "change process exception" to define this condition, to distinguish this from a failed change.)

For a closer look at measures and variance of top, medium, and low performers, please reference this article. Link to full article.

Published Wednesday, May 16, 2007 8:57 AM by Gene Kim
Filed Under:

Comment Notification

If you would like to receive an email when updates are made to this post, please register here

Subscribe to this post's comments using RSS

Comments

# re: Metrics That Matter - Part 3 - Change Success Rate

The link to the full article is incorrect. It should be http://www.tripwire.com/company/articles.cfm?aid=19. not http://www.tripwire.com/company/articles.cfm?aid=18

Bad change :-)

Alec
Wednesday, July 11, 2007 1:01 AM by Alec Clews

# re: Metrics That Matter - Part 3 - Change Success Rate

Gene,

In your experience, do most organizations have stringent change verification processes? If they do not, it means that they will mark a change as successful, when infact the error appears after a few days or weeks.

So some incidents should point to incorrectly implemented changes. However by the time of the new incident the change ticket is often closed.

If change verification is not prevalent, it means that change failure rate is higher than has been noted.
Thursday, July 19, 2007 10:10 AM by Gaurav Rampal

# re: Metrics That Matter - Part 3 - Change Success Rate

Hello,

The best know metric is MTTRsolve. Anyone remotely associated with IT Operations is familiar with this. SLAs, SLGs and SLOs all use this metric.

Though Change Success Rate is a important metric, we need to show High Change Success Rate is directly proportional to low MTTR. Speaking in terms of low MTTR will get change detection and reconciliation  process and tools  higher buy-in across IT organizations.

In most cases, change tickets are marked successful, closed and reported before being verified. Resulting incidents, which could be days or weeks later, loose the link to the causal changes (which at this point is considered successful).

Thank you for your blog.




Tuesday, July 24, 2007 8:44 AM by Gaurav Rampal

What do you think?

(required) 
required 
(required)