ExpressCluster X 3.1 for Linux Getting Started Guide - Nec

More documents

Recommendations

Info

Chapter 1 What is a cluster system?Eliminating single point of failureHaving a clear picture of the availability level required or aimed is important in building a highavailability system. This means when you design a system, you need to study cost effectiveness ofcountermeasures, such as establishing a redundant configuration to continue operations andrecovering operations within a short period of time, against various failures that can disturb systemoperations.Single point of failure (SPOF), as described previously, is a component where failure can lead tostop of the system. In a cluster system, you can eliminate the system’s SPOF by establishing serverredundancy. However, components shared among servers, such as shared disk may become aSPOF. The key in designing a high availability system is to duplicate or eliminate this sharedcomponent.A cluster system can improve availability but failover will take a few minutes for switchingsystems. That means time for failover is a factor that reduces availability. Solutions for thefollowing three, which are likely to become SPOF, will be discussed hereafter although technicalissues that improve availability of a single server such as ECC memory and redundant powersupply are important. Shared diskShared diskAccess path to the shared diskLANTypically a shared disk uses a disk array for RAID. Because of this, the bare drive of the disk doesnot become SPOF. The problem is the RAID controller is incorporated. Shared disks commonlyused in many cluster systems allow controller redundancy.In general, access paths to the shared disk must be duplicated to benefit from redundant RAIDcontroller. There are still things to be done to use redundant access paths in Linux (described laterin this chapter). If the shared disk has configuration to access the same logical disk unit (LUN)from duplicated multiple controllers simultaneously, and each controller is connected to one server,you can achieve high availability by failover between nodes when an error occurs in one of thecontrollers.FailoverSPOFRAID5RAID5HBA(SCSI Card, FC NIC)Access PathRAID ControllerArray Disk24Figure 1-9: Example of the shared disk RAID controller and access paths being SPOF (left)and an access path connected to a RAID controllerWith a failover cluster system of data mirror type, where no shared disk is used, you can create anideal system having no SPOF because all data is mirrored to the disk in the other server. Howeveryou should consider the following issues:ExpressCluster X 3.1 for Linux Getting Started Guide
Eliminating single point of failureDisk I/O performance in mirroring data over the network (especially writing performance)System performance during mirror resynchronization in recovery from server failure (mirrorcopy is done in the background)Time for mirror resynchronization (clustering cannot be done until mirror resynchronizationis completed)In a system with frequent data viewing and a relatively small volume of data, choosing the datamirror type for clustering is a key to increase availability.Access path to the shared diskIn a typical configuration of the shared disk type cluster system, the access path to the shared diskis shared among servers in the cluster. To take SCSI as an example, two servers and a shared diskare connected to a single SCSI bus. A failure in the access path to the shared disk can stop the entiresystem.What you can do for this is to have a redundant configuration by providing multiple access paths tothe shared disk and make them look as one path for applications. The device driver allowing such iscalled a path failover driver. Path failover drivers are often developed and released by shared diskvendors. Path failover drivers in Linux are still under development. For the time being, asdiscussed earlier, offering access paths to the shared disk by connecting a server on an arraycontroller on the shared disk basis is the way to ensure availability in Linux cluster systems.ApplicationPath FailoverDriverApplicationPath FailoverDriverFigure 1-10: Path failover driverSection I Introducing ExpressCluster25
Page 1 and 2: ExpressCluster ® X 3.1 for LinuxGe
Page 3: © Copyright NEC Corporation 2011.
Page 6 and 7: Section II Installing ExpressCluste
Page 8 and 9: Shutdown and reboot of individual s
Page 10 and 11: ExpressCluster X Documentation SetT
Page 12 and 13: Contacting NECFor the latest produc
Page 15 and 16: Chapter 1What is a cluster system?T
Page 17 and 18: High Availability (HA) clusterShare
Page 19: High Availability (HA) clusterData
Page 22 and 23: Chapter 1 What is a cluster system?
Page 26 and 27: Chapter 1 What is a cluster system?
Page 29 and 30: Chapter 2Using ExpressClusterThis c
Page 31 and 32: Software configuration of ExpressCl
Page 33 and 34: Software configuration of ExpressCl
Page 35 and 36: Network partition resolutionNetwork
Page 37 and 38: Failover mechanismFailover resource
Page 39 and 40: Failover mechanismDifferent Applica
Page 41 and 42: Failover mechanismHardware configur
Page 43 and 44: Failover mechanismWhat is cluster o
Page 45 and 46: What is a resource?NAS resource (na
Page 47 and 48: What is a resource?Websphere monito
Page 49: Section IIInstallingExpressClusterT
Page 52 and 53: Chapter 3 Installation requirements
Page 74 and 75:
Chapter 3 Installation requirements
Page 76 and 77:
Chapter 3 Installation requirements
Page 79 and 80:
Chapter 4Latest version information
Page 81 and 82:
System requirements for WebManager
Page 83 and 84:
Page 85 and 86:
Page 87 and 88:
Page 89 and 90:
Page 91 and 92:
Page 93 and 94:
Page 95 and 96:
Page 97 and 98:
Page 99 and 100:
Page 101 and 102:
Page 103 and 104:
Page 105 and 106:
Page 107 and 108:
Chapter 5Notes and RestrictionsThis
Page 109 and 110:
Designing a system configurationHar
Page 111 and 112:
Designing a system configurationHar
Page 113 and 114:
Designing a system configurationNet
Page 115 and 116:
Designing a system configuration4.
Page 117 and 118:
Designing a system configurationThe
Page 119 and 120:
Installing operating systemIt is po
Page 121 and 122:
Installing operating systemsoftdogT
Page 123 and 124:
Before installing ExpressClusterSer
Page 125 and 126:
Before installing ExpressClusterCha
Page 127 and 128:
Before installing ExpressClusterUse
Page 129 and 130:
Notes when creating ExpressCluster
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
After start operating ExpressCluste
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151:
Updating ExpressClusterUpdating Exp
Page 154 and 155:
Chapter 6 Upgrading ExpressClusterH
Page 157:
Appendix• Appendix A Glossary•
Page 160 and 161:
Appendix A GlossaryNodeHeartbeatPub
Page 162:
Appendix B Indexmonitorable and non
show all

ExpressCluster X 3.1 for Linux Getting Started Guide - Nec

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?