Subscribe to Windows IT Pro

 

Get Newsletters

  • Get the Latest News
  • Product Updates
  • Helpful Tricks
  • Productivity Tips

Subscribe Now!

June 24, 2002 12:00 AM

Measuring High Availability

Windows IT Pro
InstantDoc ID #25315
Rating: (0)

The term high availability is meaningless until you define how to measure it. IT shops measure high availability as the percentage of time that systems are available, but the provider and consumer of high-availability services must agree on what constitutes availability and how the time is measured. Typically, the availability percentage is calculated as follows:

x = (n - y) * 100/n

where n is the total number of minutes in a given calendar month and y is the total number of minutes that service is unavailable in a given calendar month. To calculate the availability percentage, you must know the total number of minutes in the service period, as well as the minutes that you can exclude from the calculation. Typical exclusions are scheduled maintenance hours, planned downtime (e.g., to quarantine viruses, to react to a security threat), and acts of force majeure.

For example, a 31-day month contains 31 * 24 * 60 or 44,640 minutes. If a server is unavailable for 15 minutes because of an unexpected crash and automatic reboot, the availability percentage is 99.97 percent. If the server is also down for 3 hours for a scheduled hardware replacement, the availability is 99.56 percent—if you don't exclude scheduled maintenance. (The availability percentage remains at 99.97 percent if you exclude scheduled maintenance.) Suppose you offer a $100,000 monthly guarantee on meeting a 99.9 percent service level agreement (SLA): An ambiguity with respect to exclusions might cost you every penny of that $100,000.

In addition to the duration of an outage, you also need to consider the frequency of outages. Suppose you want to offer a 99.5 percent SLA for a server, exclusive of scheduled maintenance. That availability percentage lets you have the server offline for a maximum of 3 hours and 43 minutes in a 31-day month. You could have one 3 hour 43 minute instance, two 1 hour 51 minute instances, or three 1 hour 14 minute instances. Obviously, the more often a server crashes, the faster you must restore it to meet your monthly SLA.

Related Content:

ARTICLE TOOLS

Comments
    There are no comments to display. Be the first one!
You must log on before posting a comment.

Are you a new visitor? Register Here

advertisement

advertisement

White Papers

Get your Windows 7 deployment off to the right start by implementing PC lockdown. A locked-down environment is easier and cheaper to support since users are less likely to make unnecessary changes to the core system configuration - read more here!

Essential Guides

Is your iSCSI "lossy"? The reality is that most off-the-shelf Ethernet hardware deployed for iSCSI can lose packets, resulting in slow performance or application downtime. Learn how to assess your current iSCSI infrastructure and engineer an advanced iSCSI SAN infrastructure.

Web Seminars

What's the best way to keep your network safe from malware? In this web seminar, security expert Greg Shields suggests an alternative method to the traditional blacklisting approach that is common with anti-virus and anti-malware solutions.

eLearning Series

We bring the experts direct to you to share their real-world perspective and expertise. During each event, three sessions stream in real time, so you can learn, ask questions, and get solutions.
Upcoming event: Getting the Most with Exchange 2010 with Paul Robichaux

Subscribe to Windows IT Pro!

Windows is a trademark of the Microsoft group of companies. Windows IT Pro is used by Penton Media Inc. under license from owner.