Later on recovery disk failure, the English name of the Case Study is necessary to do, of course, depending on the level of failure, each failure can not do both Case Study, unless adequate staff and time;
A document capability is the ability, general engineer of the document with little capability or general, but in fact there are general types of document templates, based on the content of the template can be filled more with less.
Failure to have a record, every company should have a wiki, these re-set should be recorded, you can learn a lot. Case Study will take a lot of time, but mid-level and major fault is still necessary.
Described below is re-set the whole routine:
the reason:
Host physical failure where the cloud host multiple servers at the same time lead to downtime.
The impact surface
The total amount of requests: 584472
Subsequent optimization
- The cloud host broken, distributed on a physical host nowhere.
The above is a simple failure recovery disk model, the first step is to restore the entire fault according to the time line beginning to end of the process, the second is to identify the problem (root cause), the third is to see what specific improvements and optimization, to avoid recurrence of similar failures.