Dell Power Solutions

More documents

Recommendations

Info

HIGH-PERFORMANCE COMPUTINGBandwidth (MB/sec)12001000800600400200131Write bandwidthRead bandwidth1622503194456264871123Scalability (relative performance)87654321Write bandwidthRead bandwidth01/2 2/4 4/8 8/16Number of segment servers/number of clients00 1 2 3 4 5 6 7 8Number of segment servers (2 clients per server)Figure 7. Measured bandwidth for the I/O subsystem using IOzonesubsystem using one or more segment servers. Four configurationswere used to test the scalability of the I/O subsystem—onesegment server serving data to 2 clients, two segment serversfor 4 clients, four segment servers for 8 clients, and eight segmentservers for 16 clients. The I/O read bandwidth ranged from162 MB/sec for the single segment server to 1123 MB/sec for theeight segment servers. Similarly, the write bandwidth rangedfrom 131 MB/sec to 487 MB/sec for the different configurationsizes of the cluster.Figure 8 shows the scalability of the I/O subsystem as the sizeof the cluster increased from 2 clients to 16 clients and the numberof segment servers increased from one to eight—maintaining a2:1 client-to-segment-server ratio. These results show near-linearscalability of read bandwidth. Write bandwidth scales well for upto four segment servers, but beyond that, it is limited by overheadon the CX700 that results from the steps the array takes to providedata protection (as described in the “An overview of the Dell/EMCCX700 storage array” section in this article). The CX700 is designedto ensure that data written to the array (even data written only tothe write cache) will survive any single failure. If I/Os to the arraysatisfy certain alignment and size conditions, then the write cachecan be bypassed and systems can achieve higher write bandwidththan that obtained in this study.Figure 8. Performance scalability of the I/O subsystemAcknowledgmentsThe authors would like to thank the Dell HPCC team and ThomasEastham, Mike Wolak, and Wayne Paquette from IBRIX Inc. for theirimmense help in writing this article.Amina Saify is a member of the Scalable Systems Group at Dell. Amina has a bachelor’sdegree in Computer Science from Devi Ahilya University (DAVV) in India, and a master’s degreein Computer and Information Science from The Ohio State University.Ramesh Radhakrishnan, Ph.D., is a systems engineer on the Dell HPCC team. Hisareas of interest are computer architecture and performance analysis. Ramesh has a Ph.D.in Computer Engineering from The University of Texas at Austin.Sudhir Srinivasan, Ph.D., is the chief technology officer of IBRIX Inc. His interests includestorage systems, operating systems, and distributed computing. He has a Ph.D. in ComputerScience from the University of Virginia.Onur Celebioglu is an engineering manager in the Scalable Systems Group at Dell and isresponsible for developing HPC clustering products. His current areas of focus are networkingand HPC interconnects. Onur has an M.S. in Electrical and Computer Engineering fromCarnegie Mellon University.Building a high-performance computing clusterwith Dell systemsThe Dell HPCC team used a Dell/EMC storage array and the IBRIXfile system to help evaluate the performance scalability of an I/Osubsystem for a commonly used HPC cluster scenario. The findingsof this study indicate that Dell PowerEdge servers, combined witha Dell/EMC CX700 storage array and the optimized IBRIX parallelfile system, can provide a high-performing, scalable, and economicalcluster solution for HPC environments.Dell HPC clusters:www.dell.com/hpccFOR MORE INFORMATION132POWER SOLUTIONS Reprinted from Dell Power Solutions, February 2005. Copyright © 2005 Dell Inc. All rights reserved. February 2005
HIGH-PERFORMANCE COMPUTINGPlanning Considerations forJob Scheduling in HPC ClustersAs cluster installations continue growing to satisfy ever-increasing computing demands,advanced schedulers can help improve resource utilization and quality of service. Thisarticle discusses issues related to job scheduling on clusters and introduces schedulingalgorithms to help administrators select a suitable job scheduler.BY SAEED IQBAL, PH.D.; RINKU GUPTA; AND YUNG-CHIN FANGCluster installations primarily comprise two types ofstandards-based hardware components—servers andnetworking interconnects. Clusters are divided into twomajor classes: high-throughput computing clusters andhigh-performance computing clusters. High-throughputcomputing clusters usually connect a large number ofnodes using low-end interconnects. In contrast, highperformancecomputing clusters connect more powerfulcompute nodes using faster interconnects than highthroughputcomputing clusters. Fast interconnects aredesigned to provide lower latency and higher bandwidththan low-end interconnects.These two classes of clusters have different schedulingrequirements. In high-throughput computing clusters, themain goal is to maximize throughput—that is, jobs completedper unit of time—by reducing load imbalance amongcompute nodes in the cluster. Load balancing is particularlyimportant if the cluster has heterogeneous compute nodes.In high-performance computing clusters, an additional considerationarises: the need to minimize communicationoverhead by mapping applications appropriately to theavailable compute nodes. High-throughput computing clustersare suitable for executing loosely coupled parallel ordistributed applications, because such applications do nothave high communication requirements among computenodes during execution time. High-performance computingclusters are more suitable for tightly coupled parallelapplications, which have substantial communication andsynchronization requirements.A resource management system manages the processingload by preventing jobs from competing with eachother for limited compute resources. Typically, a resourcemanagement system comprises a resource manager and ajob scheduler (see Figure 1). Most resource managers havean internal, built-in job scheduler, but system administratorscan usually substitute an external scheduler for theinternal scheduler to enhance functionality. In either case,the scheduler communicates with the resource manager toobtain information about queues, loads on compute nodes,and resource availability to make scheduling decisions.Usually, the resource manager runs several daemonson the master node and compute nodes including a schedulerdaemon, which typically runs on the master node. Theresource manager also sets up a queuing system for usersto submit jobs—and users can query the resource managerto determine the status of their jobs. In addition, a resourcemanager maintains a list of available compute resourcesand reports the status of previously submitted jobs to theuser. The resource manager helps organize submitted jobsbased on priority, resources requested, and availability.As shown in Figure 1, the scheduler receives periodicinput from the resource manager regarding job queues andavailable resources, and makes a schedule that determinesthe order in which jobs will be executed. This is done whilewww.dell.com/powersolutions Reprinted from Dell Power Solutions, February 2005. Copyright © 2005 Dell Inc. All rights reserved. POWER SOLUTIONS 133
Page 1 and 2:
DELL POWER SOLUTIONS • FEBRUARY 2
Page 3 and 4:
POWERSOLUTIONSTHE MAGAZINE FOR DIRE
Page 5 and 6:
© 2005 Quantum Corporation. All ri
Page 7 and 8:
Dave is on vacation.He’s been not
Page 9 and 10:
UTILITY=AVAILABILITY.From SAP to BE
Page 11 and 12:
EXECUTIVE INSIGHTSleading third-par
Page 13 and 14:
NEW-GENERATION SERVER TECHNOLOGYis
Page 15 and 16:
NEW-GENERATION SERVER TECHNOLOGYAno
Page 17 and 18:
NEW-GENERATION SERVER TECHNOLOGYThe
Page 19 and 20:
The industry’s preeminent source
Page 21 and 22:
NEW-GENERATION SERVER TECHNOLOGYI/O
Page 23 and 24:
Will yours be there when you need i
Page 25 and 26:
NEW-GENERATION SERVER TECHNOLOGYas
Page 27 and 28:
NEW-GENERATION SERVER TECHNOLOGYThe
Page 29 and 30:
NEW-GENERATION SERVER TECHNOLOGYpor
Page 31 and 32:
More data? Less time?No problem.Del
Page 33 and 34:
NEW-GENERATION SERVER TECHNOLOGYser
Page 35:
NEW-GENERATION SERVER TECHNOLOGYDTK
Page 38 and 39:
NEW-GENERATION SERVER TECHNOLOGYpac
Page 40 and 41:
NEW-GENERATION SERVER TECHNOLOGYman
Page 42 and 43:
NEW-GENERATION SERVER TECHNOLOGYFig
Page 44 and 45:
NEW-GENERATION SERVER TECHNOLOGYSun
Page 46 and 47:
NEW-GENERATION SERVER TECHNOLOGYTab
Page 48 and 49:
NEW-GENERATION SERVER TECHNOLOGYMan
Page 50 and 51:
Page 52 and 53:
Page 54 and 55:
SYSTEMS MANAGEMENTsuch as:Dell Upda
Page 56 and 57:
SYSTEMS MANAGEMENTThis hardware-cen
Page 58 and 59:
SYSTEMS MANAGEMENTCLI taskFigure 4.
Page 60 and 61:
SYSTEMS MANAGEMENTManaging Dell Cli
Page 62 and 63:
SYSTEMS MANAGEMENTFigure 2. OMCA De
Page 64 and 65:
SYSTEMS MANAGEMENTAgentless Monitor
Page 66 and 67:
SYSTEMS MANAGEMENTWeb serverGlobal
Page 68 and 69:
SYSTEMS MANAGEMENTorganizations can
Page 70 and 71:
STORAGE• Multi-staged disk backup
Page 72 and 73:
STORAGEExec Advanced Disk-Based Bac
Page 74 and 75:
STORAGEPrimary disk (RAID)Figure 1.
Page 76 and 77:
STORAGESTORAGEFREESubscriptionReque
Page 78 and 79:
STORAGEcentralized backup can offer
Page 80 and 81:
STORAGEoccurs. Replication can be s
Page 82 and 83:
STORAGEBackup concepts, while firml
Page 84 and 85: SCALABLE ENTERPRISEFile Systemsfor
Page 86 and 87: SCALABLE ENTERPRISEServer 1 Server
Page 88 and 89: SCALABLE ENTERPRISEThe Promise ofUn
Page 90 and 91: SCALABLE ENTERPRISELANExternalcommu
Page 92 and 93: SCALABLE ENTERPRISErequired for dif
Page 94 and 95: SCALABLE ENTERPRISEFigure 1 shows v
Page 96 and 97: SCALABLE ENTERPRISEDeploying and Ma
Page 98 and 99: SCALABLE ENTERPRISEapplication serv
Page 100 and 101: SCALABLE ENTERPRISEExploitingAutoma
Page 102 and 103: SCALABLE ENTERPRISEFigure 2. Perfor
Page 104 and 105: SCALABLE ENTERPRISEMigrating Oracle
Page 106 and 107: SCALABLE ENTERPRISEexport and impor
Page 108 and 109: SCALABLE ENTERPRISE8> 'm:\expdata\o
Page 110 and 111: SCALABLE ENTERPRISEClientsPublic LA
Page 112 and 113: SCALABLE ENTERPRISEnodes, the clust
Page 114 and 115: HIGH-PERFORMANCE COMPUTING(Red Hat
Page 116 and 117: HIGH-PERFORMANCE COMPUTINGFor monit
Page 118 and 119: HIGH-PERFORMANCE COMPUTINGin a clus
Page 120 and 121: HIGH-PERFORMANCE COMPUTINGPerforman
Page 122 and 123: HIGH-PERFORMANCE COMPUTINGPowerEdge
Page 124 and 125: HIGH-PERFORMANCE COMPUTING2.50Power
Page 126 and 127: HIGH-PERFORMANCE COMPUTINGApplicati
Page 129 and 130: HIGH-PERFORMANCE COMPUTINGthe incre
Page 131 and 132: HIGH-PERFORMANCE COMPUTINGCompute n
Page 133: HIGH-PERFORMANCE COMPUTINGIBRIX Fus
Page 137 and 138: HIGH-PERFORMANCE COMPUTINGFeatureCo
Page 139 and 140: HIGH-PERFORMANCE COMPUTINGUnderstan
Page 141 and 142: HIGH-PERFORMANCE COMPUTINGprovides
Page 143 and 144: Oracle DatabaseWorld’s #1 Databas
show all

Dell Power Solutions

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?