Algorithms and Data Structures for External Memory

More documents

Recommendations

Info

5.1.2 Load Balancing Across the Disks 5.1 Sorting by Distribution 33 In order to meet the sorting I/O bound (5.1), we must form the buckets at each level of recursion using O(n/D) I/Os, which is easy to do for the single-disk case. The challenge is in the more general multiple-disk case: Each input I/O step and each output I/O step during the bucket formation must involve on the average Θ(D) blocks. The file of items being partitioned is itself one of the buckets formed in the previous level of recursion. In order to read that file efficiently, its blocks must be spread uniformly among the disks, so that no one disk is a bottleneck. In summary, the challenge in distribution sort is to output the blocks of the buckets to the disks in an online manner and achieve a global load balance by the end of the partitioning, so that the bucket can be input efficiently during the next level of the recursion. Partial striping is an effective technique for reducing the amount of information that must be stored in internal memory in order to manage the disks. The disks are grouped into clusters of size C and data are output in “logical blocks” of size CB, one per cluster. Choosing C = √ D will not change the sorting time by more than a constant factor, but as pointed out in Section 4.2, full striping (in which C = D) can be nonoptimal. Vitter and Shriver [345] develop two randomized online techniques for the partitioning so that with high probability each bucket will be well balanced across the D disks. In addition, they use partial striping in order to fit in internal memory the pointers needed to keep track of the layouts of the buckets on the disks. Their first partitioning technique applies when the size N of the file to partition is sufficiently large or when M/DB = Ω(logD), so that the number Θ(n/S) ofblocksin each bucket is Ω(D logD). Each parallel output operation sends its D blocks in independent random order to a disk stripe, with all D! orders equally likely. At the end of the partitioning, with high probability each bucket is evenly distributed among the disks. This situation is intuitively analogous to the classical occupancy problem, in which b balls are inserted independently and uniformly at random into d bins. It is well-known that if the load factor b/d grows asymptotically faster than logd, the most densely populated bin contains b/d balls asymptotically
34 External Sorting and Related Problems on the average, which corresponds to an even distribution. However, if the load factor b/d is 1, the largest bin contains (lnd)/lnlnd balls on the average, whereas any individual bin contains an average of only one ball [341]. 1 Intuitively, the blocks in a bucket act as balls and the disks act as bins. In our case, the parameters correspond to b = Ω(dlogd), which suggests that the blocks in the bucket should be evenly distributed among the disks. By further analogy to the occupancy problem, if the number of blocks per bucket is not Ω(D logD), then the technique breaks down and the distribution of each bucket among the disks tends to be uneven, causing a bottleneck for I/O operations. For these smaller values of N, Vitter and Shriver use their second partitioning technique: The file is streamed through internal memory in one pass, one memoryload at a time. Each memoryload is independently and randomly permuted and output back to the disks in the new order. In a second pass, the file is input one memoryload at a time in a “diagonally striped” manner. Vitter and Shriver show that with very high probability each individual “diagonal stripe” contributes about the same number of items to each bucket, so the blocks of the buckets in each memoryload can be assigned to the disks in a balanced round robin manner using an optimal number of I/Os. DeWitt et al. [140] present a randomized distribution sort algorithm in a similar model to handle the case when sorting can be done in two passes. They use a sampling technique to find the partitioning elements and route the items in each bucket to a particular processor. The buckets are sorted individually in the second pass. An even better way to do distribution sort, and deterministically at that, is the BalanceSort method developed by Nodine and Vitter [273]. During the partitioning process, the algorithm keeps track of how evenly each bucket has been distributed so far among the disks. It maintains an invariant that guarantees good distribution across the disks for each bucket. For each bucket 1 ≤ b ≤ S and disk 1 ≤ d ≤ D, let numb be the total number of items in bucket b processed so far during the partitioning and let numb(d) be the number of those items 1 We use the notation lnd to denote the natural (base e) logarithm loge d.
Page 1 and 2: TCSv2n4.qxd 4/24/2008 11:56 AM Page
Page 4 and 5: Algorithms and Data Structures for
Page 6 and 7: Foundations and Trends R○ in Theo
Page 8 and 9: Foundations and Trends R○ in Theo
Page 10 and 11: Preface I first became fascinated a
Page 12: Preface xi I would like to thank my
Page 15 and 16: xiv Contents 6 Lower Bounds on I/O
Page 18 and 19: 1 Introduction The world is drownin
Page 20 and 21: However, not all memory references
Page 22 and 23: 1.1 Overview 5 Our general goal is
Page 24: Table 1.1 Paradigms for I/O efficie
Page 27 and 28: 10 Parallel Disk Model (PDM) spindl
Page 29 and 30: 12 Parallel Disk Model (PDM) D = nu
Page 31 and 32: 14 Parallel Disk Model (PDM) The pr
Page 33 and 34: 16 Parallel Disk Model (PDM) 2.3 Re
Page 35 and 36: 18 Parallel Disk Model (PDM) archit
Page 38 and 39: 3 Fundamental I/O Operations and Bo
Page 40: As Table 3.1 indicates, the multipl
Page 43 and 44: 26 Exploiting Locality and Load Bal
Page 45 and 46: 28 Exploiting Locality and Load Bal
Page 47 and 48: 30 External Sorting and Related Pro
Page 49: 32 External Sorting and Related Pro
Page 74 and 75: 6 Lower Bounds on I/O In this chapt
Page 76 and 77: 6.1 Permuting 59 in backwards order
Page 78 and 79: 6.2 Lower Bounds for Sorting and Ot
Page 80: 6.2 Lower Bounds for Sorting and Ot
Page 83 and 84: 66 Matrix and Grid Computations the
Page 86 and 87: 8 Batched Problems in Computational
Page 88 and 89: 8.1 Distribution Sweep 71 Goodrich
Page 90 and 91: 8.1 Distribution Sweep 73 the curre
Page 92 and 93: Time (seconds) Time (seconds) 7000
Page 94 and 95: 9 Batched Problems on Graphs The pr
Page 96 and 97: Table 9.2 Best known I/O bounds for
Page 98 and 99: 9.2 Special Cases 81 that contains
Page 100 and 101:
10 External Hashing for Online Dict
Page 102 and 103:
2 000 000 001 010 011 100 101 110 1
Page 104 and 105:
10.2 Directoryless Methods 87 a tot
Page 106 and 107:
11 Multiway Tree Data Structures In
Page 108 and 109:
11.1 B-trees and Variants 91 “sha
Page 110 and 111:
11.3 Parent Pointers and Level-Bala
Page 112 and 113:
11.4 Buffer Trees 95 I/Os by follow
Page 114:
11.4 Buffer Trees 97 between update
Page 117 and 118:
100 Spatial Data Structures and Ran
Page 119 and 120:
Page 121 and 122:
Page 123 and 124:
Page 125 and 126:
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 136 and 137:
13 Dynamic and Kinetic Data Structu
Page 138 and 139:
13.2 Continuously Moving Items 121
Page 140 and 141:
14 String Processing In this chapte
Page 142 and 143:
5 a b a 3 c a 0 b a 4 b 6 6 a b a b
Page 144 and 145:
14.3 Suffix Trees and Suffix Arrays
Page 146 and 147:
15 Compressed Data Structures The f
Page 148 and 149:
15.1 Data Representations and Compr
Page 150 and 151:
15.2 External Memory Compressed Dat
Page 152 and 153:
Page 154:
Page 157 and 158:
140 Dynamic Memory Allocation For s
Page 159 and 160:
142 External Memory Programming Env
Page 161 and 162:
144 External Memory Programming Env
Page 163 and 164:
146 Conclusions intriguing connecti
Page 165 and 166:
148 Notations and Acronyms lowercas
Page 167 and 168:
150 Notations and Acronyms � �
Page 169 and 170:
152 References Proceedings of the A
Page 171 and 172:
154 References [41] L. Arge, K. H.
Page 173 and 174:
156 References Computer Science, pp
Page 175 and 176:
158 References ACM Conference on In
Page 177 and 178:
160 References [132] F. K. H. A. De
Page 179 and 180:
162 References [165] R. W. Floyd,
Page 181 and 182:
164 References [195] M. R. Henzinge
Page 183 and 184:
166 References [228] K. Küspert,
Page 185 and 186:
168 References [261] D. R. Morrison
Page 187 and 188:
170 References [293] J. T. Robinson
Page 189 and 190:
172 References [325] “TerraServer
Page 191:
174 References [356] O. Wolfson, P.
show all

Algorithms and Data Structures for External Memory

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?