Plan: record downtimes over the period of a year so that I can figure out downtime percentage
ie, 99% uptime is .01*365.25 = 3.6525 days of downtime
99.999% uptime is .00001*365.25 = 5.25 minutes of downtime
"five nines" is what I would prefer. Two nines is what I get.
Work (roughly 10k users in AD; not sure how many duplicate accounts)
20100512: power outage, more than 10 minutes. No warning, no explanation
20100327: power outage, less than 10 minutes. No warning, no explanation
20100201: network outage, 12 hours notice. Planned for 11pm-midnight for router fix.
20100116: power outage, between 10 and 15 minutes. No warning, no explanation
20091210: internet unavailable for 10 minutes, 7pm. No warning, no explanation
20090907 - 09: network disconnected due to policy violation for office room (Monday 7pm to Wednesday 3pm). 8 people affected. Policy was incorrectly enforced and no communication was made, resulting in 43 hours of no internet. There were other resources available, but severe inconvenience plus much administrative communication lead to almost no work being completed during outage.
20090821: network services outage. Projected noon to 1pm, but turned out to be noon to 3pm. 30 minute warning.
20090810: [power outage]: between midnight and 8am, unknown duration (but longer than 10 minutes). No warning or explanation.
20090801: [power outage]: 7.5 hours. 7:10a-1:40p 1 week warning. all computers had 75 days of uptime prior to outage.
20090724: [cluster filesystem corruption]: needed to restart all jobs (180 CPUs)
20090516: [power outage]: 1.5 hours, no warning, no explanation.
20090508: [power outage]: 10 seconds (due to storm). UPS was able to cover outage
20090419: "network-wide issues" no access to external DNS. 4pm - ?
20090409: [wireless outage]: 10 am, 15 minutes. Notice that it was down and notice when it was back up. No warning.
20090323: [power outage]: 2 hours. No warning or explanation
20090315: [internet outage] midnight to 6am. IT upgrade
-three day warning
20090221: [internet outage] 11pm - 5am. IT upgrade
-two day warning
20090211: [wireless outage] 6-8am. IT upgrade
20090206: [power outage] about an hour. 7:45pm, so I shutdown safely (thanks, UPS) and went home
-no warning, no explanation
-server had 38 days of uptime prior to the outage
20090130: [internet outage, all users] 10 minutes
-no warning, no explanation
20081222: [power outage] 7am - 8:30am, 1.5 hours
20081230: [power outage] 3:45pm - 5:15pm, 1.5 hours
-20 minute warning lead-time
20081113: [power outage] 3 hours
20081108: [core router failure] 4:20pm. 15 minutes
20081104: [network outage] 8am - 11am
20081016: [networked drives unavailable] noon - 5am (17 hrs)
20080809: [power outage] 4 hours
-2 day warning lead-time