Efficient Evaluation of Minimal Perfect Hash Functions

1.2 The Tarjan-Yao Displacement SchemeThe class of hash functions presented here can be seen as a variation of an early perfect class ofhash functions due to Tarjan and Yao [13]. Their class requires a universe of size u = O(n 2 )(which they improve to u = O(n c ) for constant c > 2). The idea is to split the universe into blocksof size O(n), each of which is assigned a “displacement” value. The ith element within the jth blockis mapped to i + d j , where d j is the displacement value of block j. Suitable displacement valuescan always be found, but in general displacement values (and thus hash table size) larger than nmay be required. A “harmonic decay” condition on the distribution of elements within the blocksensures that suitable displacement values in the range {0, . . . , n} can be found, and that they canin fact be found “greedily” in decreasing order of the number of elements within the blocks. Toachieve harmonic decay, Tarjan and Yao first perform a displacement “orthogonal” to the other.The central observation of this paper is that a reduction of the universe to size O(n 2 ), as well asharmonic decay, can be achieved using universal hash functions. Or equivalently, that buckets in a(universal) hash table can be resolved using displacements.ijdji+djFigure 1: Tarjan-Yao displacement scheme2 A Perfect Class of Hash FunctionsThe concept of universality [1] plays an important role in the analysis of our class. We use thefollowing notation.Definition 1 A class of functions H r = {h 1 , . . . , h k }, h i : U → {0, . . . , r − 1}, is c-universal if forany x, y ∈ U, x ≠ y,Pri[h i (x) = h i (y)] ≤ c/r .It is (c, 2)-universal if for any x, y ∈ U, x ≠ y, and p, q ∈ {0, . . . , r − 1},Pri[h i (x) = p and h i (y) = q] ≤ c/r 2 .Many such classes with constant c are known, see e.g. [4]. For our application the important thing tonote is that there are universal classes that allow efficient storage and evaluation of their functions.More specifically, O(log u) (and even O(log n + log log u)) bits of storage suffice, and a constantnumber of simple arithmetic and bit operations are enough to evaluate the functions. Furthermore,3

Previous page

Next page

3

4

5

6

7

8

10

Efficient Evaluation of Minimal Perfect Hash Functions

Create successful ePaper yourself

Delete template?

Save as template?