Data Structures and Algorithm Analysis - Computer Science at ...

More documents

Recommendations

Info

Sec. 9.4 Hashing 3319.4.4 Analysis of Closed HashingHow efficient is hashing? We can measure hashing performance in terms of thenumber of record accesses required when performing an operation. The primaryoperations of concern are insertion, deletion, and search. It is useful to distinguishbetween successful and unsuccessful searches. Before a record can be deleted, itmust be found. Thus, the number of accesses required to delete a record is equivalentto the number required to successfully search for it. To insert a record, anempty slot along the record’s probe sequence must be found. This is equivalent toan unsuccessful search for the record (recall that a successful search for the recordduring insertion should generate an error because two records with the same keyare not allowed to be stored in the table).When the hash table is empty, the first record inserted will always find its homeposition free. Thus, it will require only one record access to find a free slot. If allrecords are stored in their home positions, then successful searches will also requireonly one record access. As the table begins to fill up, the probability that a recordcan be inserted into its home position decreases. If a record hashes to an occupiedslot, then the collision resolution policy must locate another slot in which to storeit. Finding records not stored in their home position also requires additional recordaccesses as the record is searched for along its probe sequence. As the table fillsup, more and more records are likely to be located ever further from their homepositions.From this discussion, we see that the expected cost of hashing is a function ofhow full the table is. Define the load factor for the table as α = N/M, where Nis the number of records currently in the table.An estimate of the expected cost for an insertion (or an unsuccessful search)can be derived analytically as a function of α in the case where we assume thatthe probe sequence follows a random permutation of the slots in the hash table.Assuming that every slot in the table has equal probability of being the home slotfor the next record, the probability of finding the home position occupied is α. Theprobability of finding both the home position occupied and the next slot on theprobe sequence occupied is N(N−1)M(M−1). The probability of i collisions isN(N − 1) · · · (N − i + 1)M(M − 1) · · · (M − i + 1) .If N and M are large, then this is approximately (N/M) i . The expected numberof probes is one plus the sum over i ≥ 1 of the probability of i collisions, which isapproximately1 +∞∑(N/M) i = 1/(1 − α).i=1
332 Chap. 9 SearchingThe cost for a successful search (or a deletion) has the same cost as originallyinserting that record. However, the expected value for the insertion cost dependson the value of α not at the time of deletion, but rather at the time of the originalinsertion. We can derive an estimate of this cost (essentially an average over all theinsertion costs) by integrating from 0 to the current value of α, yielding a result of∫1 αα 011 − x dx = 1 α log e11 − α .It is important to realize that these equations represent the expected cost foroperations using the unrealistic assumption that the probe sequence is based on arandom permutation of the slots in the hash table (thus avoiding all expense resultingfrom clustering). Thus, these costs are lower-bound estimates in the averagecase. The true average cost under linear probing is 1 2 (1 + 1/(1 − α)2 ) for insertionsor unsuccessful searches and 1 2(1+1/(1−α)) for deletions or successful searches.Proofs for these results can be found in the references cited in Section 9.5.Figure 9.10 shows the graphs of these four equations to help you visualize theexpected performance of hashing based on the load factor. The two solid lines showthe costs in the case of a “random” probe sequence for (1) insertion or unsuccessfulsearch and (2) deletion or successful search. As expected, the cost for insertion orunsuccessful search grows faster, because these operations typically search furtherdown the probe sequence. The two dashed lines show equivalent costs for linearprobing. As expected, the cost of linear probing grows faster than the cost for“random” probing.From Figure 9.10 we see that the cost for hashing when the table is not too fullis typically close to one record access. This is extraordinarily efficient, much betterthan binary search which requires log n record accesses. As α increases, so doesthe expected cost. For small values of α, the expected cost is low. It remains belowtwo until the hash table is about half full. When the table is nearly empty, addinga new record to the table does not increase the cost of future search operationsby much. However, the additional search cost caused by each additional insertionincreases rapidly once the table becomes half full. Based on this analysis, the ruleof thumb is to design a hashing system so that the hash table never gets above halffull. Beyond that point performance will degrade rapidly. This requires that theimplementor have some idea of how many records are likely to be in the table atmaximum loading, and select the table size accordingly.You might notice that a recommendation to never let a hash table become morethan half full contradicts the disk-based space/time tradeoff principle, which strivesto minimize disk space to increase information density. Hashing represents an unusualsituation in that there is no benefit to be expected from locality of reference.In a sense, the hashing system implementor does everything possible to eliminatethe effects of locality of reference! Given the disk block containing the last record
Page 1 and 2:
Data Structures and AlgorithmAnalys
Page 3 and 4:
ivContents2.6.1 Direct Proof 372.6.
Page 5 and 6:
viContents6.1 General Tree Definiti
Page 7 and 8:
viiiContents10.5.2 B-Tree Analysis
Page 9 and 10:
xContents15.7 Optimal Sorting 50115
Page 12 and 13:
PrefaceWe study data structures so
Page 14 and 15:
PrefacexvA sophomore-level class wh
Page 16 and 17:
Prefacexviithe form ofcalls to meth
Page 18:
PrefacexixFor the second edition, I
Page 22 and 23:
1Data Structures and AlgorithmsHow
Page 24 and 25:
Sec. 1.1 A Philosophy of Data Struc
Page 26 and 27:
Sec. 1.1 A Philosophy of Data Struc
Page 28 and 29:
Sec. 1.2 Abstract Data Types and Da
Page 30 and 31:
Sec. 1.2 Abstract Data Types and Da
Page 32 and 33:
Sec. 1.3 Design Patterns 13that tra
Page 34 and 35:
Sec. 1.3 Design Patterns 15will cal
Page 36 and 37:
Sec. 1.4 Problems, Algorithms, and
Page 38 and 39:
Sec. 1.5 Further Reading 19vides po
Page 40 and 41:
Sec. 1.6 Exercises 21than 10,000 of
Page 42 and 43:
2Mathematical PreliminariesThis cha
Page 44 and 45:
Sec. 2.1 Sets and Relations 25A seq
Page 46 and 47:
Sec. 2.2 Miscellaneous Notation 27E
Page 48 and 49:
Sec. 2.3 Logarithms 29Unfortunately
Page 50 and 51:
Sec. 2.4 Summations and Recurrences
Page 52 and 53:
Sec. 2.4 Summations and Recurrences
Page 54 and 55:
Sec. 2.5 Recursion 35(a)(b)Figure 2
Page 56 and 57:
Sec. 2.6 Mathematical Proof Techniq
Page 58 and 59:
Page 60 and 61:
Page 62 and 63:
Page 64 and 65:
Sec. 2.8 Further Reading 45one inch
Page 66 and 67:
Sec. 2.9 Exercises 47(c) For nonzer
Page 68 and 69:
Sec. 2.9 Exercises 492.16 Write a r
Page 70 and 71:
Sec. 2.9 Exercises 512.38 How many
Page 72 and 73:
3Algorithm AnalysisHow long will it
Page 74 and 75:
Sec. 3.1 Introduction 55efficient.
Page 76 and 77:
Sec. 3.1 Introduction 57n! 2 n2n 25
Page 78 and 79:
Sec. 3.2 Best, Worst, and Average C
Page 80 and 81:
Sec. 3.3 A Faster Computer, or a Fa
Page 82 and 83:
Sec. 3.4 Asymptotic Analysis 633.4
Page 84 and 85:
Sec. 3.4 Asymptotic Analysis 65If s
Page 86 and 87:
Sec. 3.4 Asymptotic Analysis 67it i
Page 88 and 89:
Sec. 3.5 Calculating the Running Ti
Page 90 and 91:
Page 92 and 93:
Page 94 and 95:
Sec. 3.7 Common Misunderstandings 7
Page 96 and 97:
Sec. 3.8 Multiple Parameters 77the
Page 98 and 99:
Sec. 3.9 Space Bounds 79A data stru
Page 100 and 101:
Sec. 3.10 Speeding Up Your Programs
Page 102 and 103:
Sec. 3.11 Empirical Analysis 833.11
Page 104 and 105:
Sec. 3.13 Exercises 853.13 Exercise
Page 106 and 107:
Sec. 3.13 Exercises 87(f) sum = 0;f
Page 108:
Sec. 3.14 Projects 893.24 Prove tha
Page 112 and 113:
4Lists, Stacks, and QueuesIf your p
Page 114 and 115:
Sec. 4.1 Lists 95list ADT. Interfac
Page 116 and 117:
Sec. 4.1 Lists 97for (L.moveToStart
Page 118 and 119:
Sec. 4.1 Lists 99public void moveTo
Page 120 and 121:
Sec. 4.1 Lists 101/** Singly linked
Page 122 and 123:
Sec. 4.1 Lists 103headcurrtailhead2
Page 124 and 125:
Sec. 4.1 Lists 105/** Set curr at l
Page 126 and 127:
Sec. 4.1 Lists 107curr...23 10 12(a
Page 128 and 129:
Sec. 4.1 Lists 109/** Singly linked
Page 130 and 131:
Sec. 4.1 Lists 111Array-based lists
Page 132 and 133:
Sec. 4.1 Lists 113headcurrtail20 23
Page 134 and 135:
Sec. 4.1 Lists 115/** Insert "it" a
Page 136 and 137:
Sec. 4.2 Stacks 117The principle be
Page 138 and 139:
Sec. 4.2 Stacks 119/** Array-based
Page 140 and 141:
Sec. 4.2 Stacks 121top1top2Figure 4
Page 142 and 143:
Sec. 4.2 Stacks 123you might want t
Page 144 and 145:
Sec. 4.3 Queues 125/** Queue ADT */
Page 146 and 147:
Sec. 4.3 Queues 127frontfront20 512
Page 148 and 149:
Sec. 4.3 Queues 129/** Array-based
Page 150 and 151:
Sec. 4.4 Dictionaries 1314.3.3 Comp
Page 152 and 153:
Sec. 4.4 Dictionaries 133not get at
Page 154 and 155:
Sec. 4.4 Dictionaries 135/** Contai
Page 156 and 157:
Sec. 4.4 Dictionaries 137}/** @retu
Page 158 and 159:
Sec. 4.6 Exercises 139(b) L2.append
Page 160 and 161:
Sec. 4.7 Projects 1414.16 Re-implem
Page 162:
Sec. 4.7 Projects 143‘a’ ‘b
Page 165 and 166:
146 Chap. 5 Binary TreesABCDEFGHIFi
Page 167 and 168:
148 Chap. 5 Binary TreesAny number
Page 169 and 170:
150 Chap. 5 Binary Trees/** ADT for
Page 171 and 172:
152 Chap. 5 Binary Treestends to be
Page 173 and 174:
154 Chap. 5 Binary Trees5.3 Binary
Page 175 and 176:
156 Chap. 5 Binary TreesABCDEFGHIFi
Page 177 and 178:
158 Chap. 5 Binary Treesto its call
Page 179 and 180:
160 Chap. 5 Binary Trees5.3.2 Space
Page 181 and 182:
162 Chap. 5 Binary Trees012345 678
Page 183 and 184:
164 Chap. 5 Binary Trees12037422442
Page 185 and 186:
166 Chap. 5 Binary Trees37244273240
Page 187 and 188:
168 Chap. 5 Binary Treessubroot105
Page 189 and 190:
170 Chap. 5 Binary Treesin the case
Page 191 and 192:
172 Chap. 5 Binary Treessynonymous
Page 193 and 194:
174 Chap. 5 Binary Trees/** Heapify
Page 195 and 196:
176 Chap. 5 Binary TreesRH1H2Figure
Page 197 and 198:
178 Chap. 5 Binary Trees5.6 Huffman
Page 199 and 200:
180 Chap. 5 Binary TreesLetter C D
Page 201 and 202:
182 Chap. 5 Binary Trees/** Huffman
Page 203 and 204:
184 Chap. 5 Binary Trees/** Build a
Page 205 and 206:
186 Chap. 5 Binary Treesa reverse p
Page 207 and 208:
188 Chap. 5 Binary Trees18 bits, mo
Page 209 and 210:
190 Chap. 5 Binary Trees5.12 Write
Page 211 and 212:
192 Chap. 5 Binary Trees(c) What is
Page 213 and 214:
194 Chap. 5 Binary Trees5.7 Complet
Page 215 and 216:
196 Chap. 6 Non-Binary TreesRootRAn
Page 217 and 218:
198 Chap. 6 Non-Binary TreesRABCD E
Page 219 and 220:
200 Chap. 6 Non-Binary Trees/** Gen
Page 221 and 222:
202 Chap. 6 Non-Binary Treesspond t
Page 223 and 224:
204 Chap. 6 Non-Binary Treesditiona
Page 225 and 226:
206 Chap. 6 Non-Binary Treeslence o
Page 227 and 228:
208 Chap. 6 Non-Binary TreesR’RA
Page 229 and 230:
210 Chap. 6 Non-Binary TreesRRABABC
Page 231 and 232:
212 Chap. 6 Non-Binary Trees(a)(b)F
Page 233 and 234:
214 Chap. 6 Non-Binary Treestwo chi
Page 235 and 236:
216 Chap. 6 Non-Binary Treeschild v
Page 237 and 238:
218 Chap. 6 Non-Binary TreesCAFBEHG
Page 239 and 240:
220 Chap. 6 Non-Binary Treestree af
Page 242 and 243:
7Internal SortingWe sort many thing
Page 244 and 245:
Sec. 7.2 Three Θ(n 2 ) Sorting Alg
Page 246 and 247:
Page 248 and 249:
Page 250 and 251:
Sec. 7.3 Shellsort 231Insertion Bub
Page 252 and 253:
Sec. 7.4 Mergesort 233static
Page 254 and 255:
Sec. 7.4 Mergesort 235static
Page 256 and 257:
Sec. 7.5 Quicksort 237Quicksort is
Page 258 and 259:
Sec. 7.5 Quicksort 239static
Page 260 and 261:
Sec. 7.5 Quicksort 241is still unli
Page 262 and 263:
Sec. 7.6 Heapsort 243subarray, only
Page 264 and 265:
Sec. 7.7 Binsort and Radix Sort 245
Page 266 and 267:
Page 268 and 269:
Page 270 and 271:
Sec. 7.8 An Empirical Comparison of
Page 272 and 273:
Sec. 7.9 Lower Bounds for Sorting 2
Page 274 and 275:
Sec. 7.9 Lower Bounds for Sorting 2
Page 276 and 277:
Sec. 7.10 Further Reading 257• Eq
Page 278 and 279:
Sec. 7.11 Exercises 259algorithm to
Page 280 and 281:
Sec. 7.12 Projects 2617.18 Which of
Page 282:
Sec. 7.12 Projects 2637.6 It has be
Page 285 and 286:
266 Chap. 8 File Processing and Ext
Page 287 and 288:
Page 289 and 290:
Page 291 and 292:
Page 293 and 294:
Page 295 and 296:
Page 297 and 298:
Page 299 and 300: 280 Chap. 8 File Processing and Ext
Page 321 and 322: 302 Chap. 9 Searchingintroduced in
Page 323 and 324: 304 Chap. 9 Searchingvalue in L gre
Page 325 and 326: 306 Chap. 9 SearchingWe now show th
Page 327 and 328: 308 Chap. 9 Searchingrecords by exp
Page 329 and 330: 310 Chap. 9 SearchingThis is potent
Page 331 and 332: 312 Chap. 9 Searchingand the total
Page 333 and 334: 314 Chap. 9 SearchingThis method of
Page 335 and 336: 316 Chap. 9 Searchingbirthday is si
Page 337 and 338: 318 Chap. 9 Searching45674567319692
Page 341 and 342: 322 Chap. 9 Searching01HashTable100
Page 343 and 344: 324 Chap. 9 Searching/** Insert rec
Page 347 and 348: 328 Chap. 9 SearchingExample 9.9 Co
Page 349: 330 Chap. 9 Searching/** Dictionary
Page 353 and 354: 334 Chap. 9 Searchingthat is not in
Page 355 and 356: 336 Chap. 9 SearchingA good introdu
Page 357 and 358: 338 Chap. 9 Searching9.16 Assume th
Page 360 and 361: 10IndexingMany large-scale computin
Page 362 and 363: Sec. 10.1 Linear Indexing 343Linear
Page 364 and 365: Sec. 10.1 Linear Indexing 345JonesA
Page 366 and 367: Sec. 10.2 ISAM 347In−memoryTable
Page 368 and 369: Sec. 10.3 Tree-based Indexing 349Fi
Page 370 and 371: Sec. 10.4 2-3 Trees 351/** 2-3 tree
Page 372 and 373: Sec. 10.4 2-3 Trees 35318331223 30
Page 374 and 375: Sec. 10.5 B-Trees 355private TTNode
Page 376 and 377: Sec. 10.5 B-Trees 35724152033 45 48
Page 378 and 379: Sec. 10.5 B-Trees 3593318234810 12
Page 380 and 381: Sec. 10.5 B-Trees 361331012 233348
Page 382 and 383: Sec. 10.5 B-Trees 3634845 4748 50 5
Page 384 and 385: Sec. 10.6 Further Reading 365at mos
Page 386 and 387: Sec. 10.8 Projects 36710.7 Prove th
Page 388: PART IVAdvanced Data Structures369
Page 391 and 392: 372 Chap. 11 Graphs04123147123(a)(b
Page 393 and 394: 374 Chap. 11 Graphs0 1 2 3 40201 11
Page 395 and 396: 376 Chap. 11 Graphstime. This is a
Page 397 and 398: 378 Chap. 11 GraphsIt is reasonably
Page 399 and 400: 380 Chap. 11 Graphs}/** @return an
Page 401 and 402:
382 Chap. 11 Graphs}/** Set the wei
Page 403 and 404:
384 Chap. 11 GraphsABABCCDDFFEE(a)(
Page 405 and 406:
386 Chap. 11 Graphs/** Breadth firs
Page 407 and 408:
388 Chap. 11 Graphs/** Recursive to
Page 409 and 410:
390 Chap. 11 GraphsA103B2205D11C15E
Page 411 and 412:
392 Chap. 11 Graphs/** Dijkstra’s
Page 413 and 414:
394 Chap. 11 GraphsA7 5CB91 26ED 21
Page 415 and 416:
396 Chap. 11 GraphsPrim’s algorit
Page 417 and 418:
398 Chap. 11 GraphsInitialA B C D E
Page 419 and 420:
400 Chap. 11 Graphs(a) IF an undire
Page 421 and 422:
402 Chap. 11 Graphs11.8 Projects11.
Page 424 and 425:
12Lists and Arrays RevisitedSimple
Page 426 and 427:
Sec. 12.1 Multilists 407L1L2L3bcdL4
Page 428 and 429:
Sec. 12.2 Matrix Representations 40
Page 430 and 431:
Sec. 12.2 Matrix Representations 41
Page 432 and 433:
Sec. 12.3 Memory Management 413/**
Page 434 and 435:
Sec. 12.3 Memory Management 415Smal
Page 436 and 437:
Sec. 12.3 Memory Management 417Tag
Page 438 and 439:
Sec. 12.3 Memory Management 419tend
Page 440 and 441:
Sec. 12.3 Memory Management 421is e
Page 442 and 443:
Sec. 12.3 Memory Management 423AaBc
Page 444 and 445:
Sec. 12.4 Further Reading 4251a3b42
Page 446 and 447:
Sec. 12.6 Projects 42712.7 Write a
Page 448 and 449:
13Advanced Tree StructuresThis chap
Page 450 and 451:
Sec. 13.1 Tries 4310101010120001240
Page 452 and 453:
Sec. 13.1 Tries 433000xxxx00xxxxx0x
Page 454 and 455:
Sec. 13.2 Balanced Trees 4353737244
Page 456 and 457:
Sec. 13.2 Balanced Trees 437that is
Page 458 and 459:
Sec. 13.2 Balanced Trees 439GPDSASP
Page 460 and 461:
Sec. 13.3 Spatial Data Structures 4
Page 462 and 463:
Page 464 and 465:
Page 466 and 467:
Page 468 and 469:
Page 470 and 471:
Page 472 and 473:
Sec. 13.4 Further Reading 45313.3.4
Page 474 and 475:
Sec. 13.6 Projects 455(c) Show the
Page 476:
Sec. 13.6 Projects 45713.11 Select
Page 480 and 481:
14Analysis TechniquesOften it is ea
Page 482 and 483:
Sec. 14.1 Summation Techniques 463w
Page 484 and 485:
Sec. 14.1 Summation Techniques 465B
Page 486 and 487:
Sec. 14.2 Recurrence Relations 4671
Page 488 and 489:
Sec. 14.2 Recurrence Relations 469n
Page 490 and 491:
Sec. 14.2 Recurrence Relations 471E
Page 492 and 493:
Sec. 14.2 Recurrence Relations 473N
Page 494 and 495:
Sec. 14.2 Recurrence Relations 475T
Page 496 and 497:
Sec. 14.3 Amortized Analysis 477(ch
Page 498 and 499:
Sec. 14.4 Further Reading 479compar
Page 500 and 501:
Sec. 14.5 Exercises 48114.14 For th
Page 502:
Sec. 14.6 Projects 48314.24 Use mat
Page 505 and 506:
486 Chap. 15 Lower Boundsthe concep
Page 507 and 508:
488 Chap. 15 Lower Boundsat least o
Page 509 and 510:
490 Chap. 15 Lower BoundsABCGDEFFig
Page 511 and 512:
492 Chap. 15 Lower BoundsProof 1: T
Page 513 and 514:
494 Chap. 15 Lower BoundsFigure 15.
Page 515 and 516:
496 Chap. 15 Lower BoundsTo minimiz
Page 517 and 518:
498 Chap. 15 Lower BoundsThis recur
Page 519 and 520:
500 Chap. 15 Lower BoundsFigure 15.
Page 521 and 522:
502 Chap. 15 Lower Boundsnot only t
Page 523 and 524:
504 Chap. 15 Lower Bounds1234Figure
Page 525 and 526:
506 Chap. 15 Lower Bounds(a) Assume
Page 528 and 529:
16Patterns of AlgorithmsThis chapte
Page 530 and 531:
Sec. 16.1 Dynamic Programming 511Dy
Page 532 and 533:
Sec. 16.1 Dynamic Programming 513on
Page 534 and 535:
Sec. 16.2 Randomized Algorithms 515
Page 536 and 537:
Page 538 and 539:
Page 540 and 541:
Page 542 and 543:
Sec. 16.3 Numerical Algorithms 523
Page 544 and 545:
Sec. 16.3 Numerical Algorithms 525t
Page 546 and 547:
Sec. 16.3 Numerical Algorithms 527B
Page 548 and 549:
Sec. 16.3 Numerical Algorithms 529o
Page 550 and 551:
Sec. 16.3 Numerical Algorithms 531o
Page 552:
Sec. 16.6 Projects 533value depth5
Page 555 and 556:
536 Chap. 17 Limits to ComputationT
Page 557 and 558:
Page 559 and 560:
540 Chap. 17 Limits to ComputationS
Page 561 and 562:
542 Chap. 17 Limits to Computationa
Page 563 and 564:
544 Chap. 17 Limits to Computation3
Page 565 and 566:
Page 567 and 568:
548 Chap. 17 Limits to Computationp
Page 569 and 570:
550 Chap. 17 Limits to ComputationE
Page 571 and 572:
552 Chap. 17 Limits to Computation1
Page 573 and 574:
554 Chap. 17 Limits to ComputationM
Page 575 and 576:
556 Chap. 17 Limits to Computationw
Page 577 and 578:
558 Chap. 17 Limits to Computationx
Page 579 and 580:
560 Chap. 17 Limits to Computationt
Page 581 and 582:
562 Chap. 17 Limits to Computationm
Page 583 and 584:
564 Chap. 17 Limits to Computation(
Page 586 and 587:
Bibliography[AG06] Ken Arnold and J
Page 588 and 589:
BIBLIOGRAPHY 569[GI91] Zvi Galil an
Page 590 and 591:
BIBLIOGRAPHY 571[SJH93] Clifford A.
Page 592 and 593:
Index80/20 rule, 309, 333abstract d
Page 594 and 595:
INDEX 575image space, 430key space,
Page 596 and 597:
INDEX 577building, 172-177for memor
Page 598 and 599:
INDEX 579octree, 451Ω notation, 65
Page 600 and 601:
INDEX 581comparing algorithms, 224-
show all

Data Structures and Algorithm Analysis - Computer Science at ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?