29.01.2013 Views

WebSphere Application Server V7.0: Concepts ... - IBM Redbooks

WebSphere Application Server V7.0: Concepts ... - IBM Redbooks

WebSphere Application Server V7.0: Concepts ... - IBM Redbooks

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.6.4 Testing<br />

Alerting is just a first part of your incident and problem management. For further<br />

information and more details about incident and problem management refer to<br />

the ITIL® pages at the following Web page:<br />

http://www.itlibrary.org/index.php?page=ITIL<br />

As with each component in your environment, do not forget to test your<br />

monitoring infrastructure regularly. Especially if the implementation is new, test<br />

every single monitoring alert and make sure that your monitoring detects each<br />

condition of your system properly.<br />

Do not stop your testing when you see a monitoring situation raised. Test the<br />

whole process, including alerting and incident management and ensure that<br />

conditions are reset automatically as soon as the situation is back to normal.<br />

4.7 Planning for backup and recovery<br />

4.7.1 Risk analysis<br />

In general, computer hardware and software is reliable, but sometimes failures<br />

can occur and damage a machine, network device, software product,<br />

configuration, or more importantly, business data. Do not underestimate the risk<br />

of a human error that might lead to damage. It is important to plan for such<br />

occurrences. There are a number of stages to creating a backup and recovery<br />

plan, which is discussed in the following sections.<br />

The first step to creating a backup and recovery plan is to complete a<br />

comprehensive risk analysis. The goal is to discover which areas are the most<br />

critical and which hold the greatest risk. It is important to identify which business<br />

processes are the most important and are prioritized accordingly.<br />

4.7.2 Recovery strategy<br />

When critical areas have been identified, develop a strategy for recovering those<br />

areas. There are numerous backup and recovery strategies available that vary in<br />

recovery time and cost. In most cases, the cost increases as the recovery time<br />

decreases. The key to the proper strategy is to find the proper balance between<br />

recovery time and cost. The business impact is the determining factor in finding<br />

the proper balance. Business-critical processes need quick recovery time to<br />

minimize business losses. Therefore, the recovery costs are greater.<br />

Chapter 4. Infrastructure 117

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!