Magellan Final Report - Office of Science - U.S. Department of Energy

More documents

Recommendations

Info

Magellan Final Report • Merge. Some scientific application results in a merge of two or more data sources. For example, observation data might need to be merged with simulation data. Such a merge operation results in an output data volume that is significantly larger than the input data size. We developed a synthetic benchmark suite to represent each of these data operations. We used the Wikipedia data that is 6TB for our analysis. We use a subset of this data for some experiments as noted below. For a filter operation, we constructed an index map of all the titles of the wikipedia pages and their corresponding files. For the reorder operation, we perform a data conversion where we convert and to to . For the merge operation, we create an index of all lines of the wikipedia data. Result We summarize the results from our experiments understanding the effects of streaming, file systems, network and replication. Streaming. Existing scientific applications can benefit from the MapReduce framework using the streaming model. Thus for our first experiment, we constructed the filter operation in both C and Java (using the Hadoop APIs) and compared the two to understand the overheads of streaming. Comparing the overheads of streaming are tricky since there is some differences in timing induced from just language choices. Hadoop streaming also does not support Java programs since Java programs can directly use the Hadoop API. Figure 10.5a shows the comparison of the timings of both programs running on a single node. We see that the C implementation is more efficient and also the performance improves as the data size increases. Figure 10.5b shows the comparison of the streaming C version with the native Hadoop version for varying file sizes. We notice that the performance through Hadoop is similar for smaller file sizes. This actually indicates that the streaming has additional overhead since the C version is more efficient as such. As the file size increases we see that the overhead for streaming increases. Effect of file system. Figure 10.6 shows the comparison of performance of the data operations on both GPFS and HDFS. For the filter operation, there is negligible difference between HDFS and GPFS performance untill 2TB. However at 3TB HDFS performs significantly better than GPFS. For the reorder and merge, GPFS seems to achieve better performance than HDFS and the gap increases with increasing file sizes. Figure 10.7 shows the comparison of HDFS and GPFS for all three data operations for a 2TB data set. The figure shows the split of the processing time. The difference between HDFS and GPFS is negligible for the filter operation. We notice that for the reorder and merge, GPFS seems to achieve better performance than HDFS overall at this data size. However, HDFS performs better than GPFS at reads whereas GPFS performs better than HDFS for the write part of the application at the given concurrency. Our earlier results compared the performance for TeraGen on HDFS and GPFS with varying number of maps. TeraGen is a write-intensive operation. Figure 10.8 shows the effect of varying number of mappers on processing time for the filter operation on both HDFS and GPFS. We see that with increasing maps the performance of GPFS goes down significantly. The clearly noticeable performance variations are likely due to artifacts of Hadoop’s scheduling. Thus, HDFS seems to achieve better read performance than GPFS and better write performance at higher concurrencies. Effect of network. Scientific applications in HPC centers traditionally use high performance, low-latency networks. However Hadoop has traditionally been run on commodity clusters based on Ethernet networks. The shuffle phase between the map and reduce phase is considered to be the most network intensive operation since all the keys are sorted and data belonging to a single key is sent to the same reducer resulting in large data movement across the network. Figure 10.9 shows the comparison of the network on various data operations with varying file sizes. We observe that filter and reorder are not affected as much by the changes 92
Magellan Final Report in the network. The merge operation shows that the application performs better on the Infiniband network at larger file sizes. This is likely due to the growth in the data in the merge compared with the reorder or filter. Replication. Replication in Hadoop is used for fault-tolerance as well as increasing the effect of data locality of the input files. Figure 10.10 shows the effects of varying replication on the filter and reorder data operations with 2TB data set. We see that for the read-intensive filter operation increasing the replication factor significantly impacts performance. While the reorder operation benefits from the replication, the effects are minimal since the operation is dominated by the write costs. 10.5.3 Summary Our benchmarking for data-intensive applications in Hadoop reveals a number of key factors. The performance an application can achieve in Hadoop is largely workload dependent. Smaller concurrencies applications can benefit from the job management framework in Hadoop without being constrained by the non-POSIX compliant HDFS. The network interface seems to have minimal impact on the performance though it is likely that the Shuffle algorithm needs to be modified to fully exploit the high-speed network. Mellanox recently announced an unstructured data accelerator (UDA) software plugin that will allow Hadoop frameworks to leverage RDMA (Remote Direct Memory Access) [60]. Similarly we see that some applications can greatly benefit from the data locality and replication aspects of Hadoop. 10.6 Other Related Efforts Magellan staff collaborated with other groups and Magellan resources were used in a number of other efforts evaluating MapReduce and Hadoop for scientific applications. We summarize some of these efforts in this section. 10.6.1 Hadoop for Scientific Ensembles This work was performed by a graduate summer intern funded through the Diversity Program at Lawrence Berkeley National Lab. The student was co-supervised by Magellan staff. More details are available in the published paper [14]. Scientific ensembles have many characteristics in common with MapReduce workloads, as they both employ a high degree of parallelism that must be run in coordination to arrive at a meaningful result. However these scientific workloads have key distinguishing characteristics: multiple files per task and specific parameters per task. Thus, we must investigate further to see whether MapReduce and its open-source implementations are a convenient programming model to manage loosely coupled asynchronous scientific ensembles. In this study, we select a set of common scientific ensemble patterns and implement the patterns with Hadoop, the open-source MapReduce implementation. Hadoop Jobs and Scientific Ensembles Scientific ensembles and Hadoop jobs have a number of similarities in their characteristics and requirements: • Hadoop jobs consist of two primary phases – map and reduce. The jobs consist of a large number of maps that performs data transformation and one or more reduces that performs a combine operation to produce the final result. Scientific ensembles might have many execution phases but they can be roughly categorized as data setup or problem decomposition, data transformation and data aggregation. The problem decomposition is implicit in vanilla Hadoop jobs where the input is divided into blocks for the workers to operate on. The data transformation and data aggregation phases are similar to the map and reduce phases of a Hadoop job. 93
Page 1 and 2:
The Magellan Report on Cloud Comput
Page 3 and 4:
Executive Summary The goal of Magel
Page 5 and 6:
Key Findings The goal of the Magell
Page 7 and 8:
Magellan Final Report Finding 8. DO
Page 9 and 10:
Magellan Final Report role in addre
Page 11 and 12:
Contents Executive Summary Key Find
Page 13 and 14:
Magellan Final Report 9.7 Discussio
Page 15 and 16:
Chapter 1 Overview Cloud computing
Page 17 and 18:
Magellan Final Report • The Argon
Page 19 and 20:
Chapter 2 Background The term “cl
Page 21 and 22:
Magellan Final Report 2.1.4 Hardwar
Page 23 and 24:
Magellan Final Report Table 3.1: Ke
Page 25 and 26:
Magellan Final Report Little Magell
Page 27 and 28:
Magellan Final Report 3.2 Advanced
Page 29 and 30:
Chapter 4 Application Characteristi
Page 31 and 32:
Magellan Final Report Table 4.1: Pe
Page 33 and 34:
Magellan Final Report Output data
Page 35 and 36:
Magellan Final Report of the pipeli
Page 37 and 38:
Chapter 5 Magellan Testbed As part
Page 39 and 40:
Magellan Final Report Figure 5.1: P
Page 41 and 42:
Magellan Final Report Figure 5.2: P
Page 43 and 44:
Magellan Final Report NERSC deploye
Page 45 and 46:
Magellan Final Report Figure 6.1: A
Page 47 and 48:
Magellan Final Report greater than
Page 49 and 50:
Magellan Final Report specific QoS
Page 51 and 52:
Magellan Final Report configuration
Page 53 and 54:
Magellan Final Report 7.4 Summary U
Page 55 and 56: Magellan Final Report Firewalls are
Page 57 and 58: Magellan Final Report Aside from le
Page 59 and 60: Magellan Final Report 9.1 Understan
Page 61 and 62: Magellan Final Report grid) on 256
Page 63 and 64: Magellan Final Report Table 9.1: HP
Page 65 and 66: Magellan Final Report 25  Ping 
Page 67 and 68: Magellan Final Report 100  12 
Page 69 and 70: Magellan Final Report case of GTC,
Page 71 and 72: Magellan Final Report 1.4 IB TCPo
Page 73 and 74: Magellan Final Report only affects
Page 75 and 76: Magellan Final Report Figure 9.11:
Page 77 and 78: Magellan Final Report charted as a
Page 79 and 80: Magellan Final Report Evaluation Cr
Page 81 and 82: Magellan Final Report Write Perform
Page 83 and 84: Magellan Final Report 3500 3000 G
Page 85 and 86: Magellan Final Report Histogram Plo
Page 87 and 88: Magellan Final Report SATA devices.
Page 89 and 90: Magellan Final Report MB/s Virident
Page 91 and 92: Magellan Final Report and the perfo
Page 93 and 94: Magellan Final Report (a) Hosts (b)
Page 95 and 96: Magellan Final Report Routing IP pa
Page 97 and 98: Chapter 10 MapReduce Programming Mo
Page 99 and 100: Magellan Final Report 10.3 Hadoop E
Page 101 and 102: Magellan Final Report 35000  3500
Page 103 and 104: Magellan Final Report summarize som
Page 105: Magellan Final Report Processing ti
Page 109 and 110: Magellan Final Report Workload Patt
Page 111 and 112: Magellan Final Report This benchmar
Page 113 and 114: Magellan Final Report Task Tracker
Page 115 and 116: Magellan Final Report processing ti
Page 117 and 118: Magellan Final Report Using ESnet
Page 119 and 120: Magellan Final Report Figure 11.2:
Page 121 and 122: Magellan Final Report data collecte
Page 123 and 124: Magellan Final Report comparison to
Page 125 and 126: Magellan Final Report 11.2.5 Integr
Page 127 and 128: Magellan Final Report very large (4
Page 129 and 130: Magellan Final Report for optimizat
Page 131 and 132: Magellan Final Report One of the ad
Page 133 and 134: Magellan Final Report commercial cl
Page 135 and 136: Magellan Final Report Table 12.2: H
Page 137 and 138: Magellan Final Report Cost per TF t
Page 139 and 140: Magellan Final Report Productivity.
Page 141 and 142: Magellan Final Report compute insta
Page 143 and 144: Chapter 13 Conclusions Cloud comput
Page 145 and 146: Magellan Final Report Inherently, t
Page 147 and 148: Bibliography [1] G. Aldering, G. Ad
Page 149 and 150: Magellan Final Report [30] I. Foste
Page 151 and 152: Magellan Final Report [67] M. Palan
Page 153 and 154: Appendix A Publications Selected Pr
Page 155 and 156: Magellan Final Report Magellan Rese
Page 157 and 158:
Magellan Final Report Selected Mage
Page 159 and 160:
Appendix B Surveys B1
Page 161 and 162:
• Nuclear Physics - Accelarator P
Page 163 and 164:
Allow users to edit responses. What
Page 165 and 166:
Amazon Eucalyptus OpenStack Other:
Page 167 and 168:
Please list any publications/report
Page 169 and 170:
Hadoop Streaming Hadoop Native Prog
show all

Magellan Final Report - Office of Science - U.S. Department of Energy

Create successful ePaper yourself

Delete template?

Save as template?