Applying OLAP Pre-Aggregation Techniques to ... - Jacobs University

More documents

Recommendations

Info

80 5. Pre-Aggregation Support Beyond Basic Aggregate Operations For example, scale(CalF ires, [2, 2, 2], nn) defines a scaling operation by a factor of two on each dimension, using nearest neighbor as resampling method on a 3D dataset identified as CalF ires. 5.2.2 Pre-Aggregation Selection Problem Definition 5.4 (Pre-Aggregates Selection Problem) – Given a query workload Q and a storage space constraint C, the pre-aggregates selection problem is to select a set P ⊆ Q of queries such that P minimizes the overall costs of computing Q while the storage space required by P does not exceed the limit given by C. ✷ Considering existing view selection strategies in data warehousing/OLAP, the following selection criteria are suggested for pre-aggregates: • Frequency. Pre-aggregates yield particularly significant increases in processing speed when scaling operations are executed with high frequency within a workload. • Storage space. The storage space constraint of a candidate scaling operation must be at least the size of the storage required by the query in the workload with the smallest scale vector. This guarantees that for any query in the workload at least one pre-aggregate can be used for its computation. • Benefit. A scaling operation may be used to compute the same and other dependent queries in the workload. A metric is therefore used to calculate the cost savings gained by using a candidate scaling operation. To evaluate the cost, we use the model presented in Section 4.2. We call this the benefit of a pre-aggregate set and normalize the benefit against the base object’s storage volume. Frequency The frequency of query q, denoted by F (q), is the number of occurrences of a given query in a workload: F (q) = N(q)/ |Q| (5.2) where N(q) is a function that returns the number of occurrences of a given query in workload Q. Storage Space The storage space of a given query denoted by S(q), represents the storage space required to save the result of query q and it is determined by the number of cells composing the output object defined in query q.
5.2 Conceptual Framework 81 Benefit The benefit of a candidate scale operation for pre-aggregation q, is computed by adding the savings in query cost for each scaling operation in the workload dependent on q, including all queries identical to q. That is, query q may contribute to saving processing costs for the same or similar queries in the workload. In both cases, specific matching conditions must be satisfied. Full-Match Conditions. Let q be a candidate query for pre-aggregation and p a query in workload Q. Let p and q both be scaling operations as defined in Eq. 5.1. There is a full-match between q and p if and only if: • the value of parameter objName[] in the scale function defined for q is the same as in p • the value of parameter ⃗s in the scale function defined for q is the same as in p • the value of parameter resMeth in the scale function defined for q is the same as in p Partial-Match Conditions. Let q be a candidate query for pre-aggregation and p be a query in the workload Q. There is a partial-match between p and q if and only if: • the value of parameter objName[] in the scale function defined for q is the same as in p • the value of parameter resMeth in the scale function defined for q is the same as in p • the parameter ⃗s for both q and p is of the same dimensionality • vector values defined in ⃗s for q are higher than those defined in p Definition 5.5 (Benefit) – Let T ∈ Q be a subset of scaling operations that can be fully or partially computed using query q. The benefit of query q per unit space, denoted by B(q), is the sum of the computational cost savings gained by selecting query q for pre-aggregation. ✷ B(q) = ((F (q) ∗ C(q)) + ∑ t∈T (F (t) ∗ C r (t, q)))/size(q) (5.3) where F (q) represents the frequency of query q in the workload, C ( q) is the cost of computing query q on the original dataset, C r (t, q) is the relative cost of computing query t from q, and size(q) is a function that returns the number of cells composing the spatial domain component of a query q.
Page 1:
Applying OLAP Pre-Aggregation Techn
Page 5 and 6:
Acknowledgments I would like to exp
Page 7 and 8:
Abstract Large multidimensional arr
Page 9 and 10:
Contents 1 Introduction and Problem
Page 11 and 12:
List of Figures 2.1 3D Array . . .
Page 13 and 14:
List of Tables 3.1 UNO and FAO Suit
Page 15 and 16:
Chapter 1 Introduction and Problem
Page 17 and 18:
Relevant and complementary question
Page 19 and 20:
1.2 Publications Related to this Th
Page 21 and 22:
Chapter 2 Background and Related Wo
Page 23 and 24:
2.1 Array Databases 17 Figure 2.2 s
Page 25 and 26:
2.1 Array Databases 19 toward the s
Page 27 and 28:
2.1 Array Databases 21 • Bilinear
Page 29 and 30:
2.1 Array Databases 23 given image
Page 31 and 32:
2.2 On-Line Analytical Processing (
Page 33 and 34:
2.2 On-Line Analytical Processing (
Page 35 and 36: 2.2 On-Line Analytical Processing (
Page 37 and 38: 2.2 On-Line Analytical Processing (
Page 39 and 40: 2.3 Discussion 33 spatial-vector da
Page 41 and 42: 2.3 Discussion 35 • Both applicat
Page 43 and 44: Chapter 3 Fundamental Geo-Raster Op
Page 45 and 46: 3.2 Geo-Raster Operations 39 3.1.2
Page 47 and 48: 3.2 Geo-Raster Operations 41 multip
Page 49 and 50: 3.2 Geo-Raster Operations 43 Table
Page 51 and 52: 3.2 Geo-Raster Operations 45 turn i
Page 53 and 54: 3.2 Geo-Raster Operations 47 (a) Or
Page 55 and 56: 3.2 Geo-Raster Operations 49 Query
Page 57 and 58: 3.2 Geo-Raster Operations 51 contai
Page 59 and 60: 3.2 Geo-Raster Operations 53 is the
Page 61 and 62: 3.2 Geo-Raster Operations 55 3.2.4
Page 63 and 64: 3.2 Geo-Raster Operations 57 As in
Page 65 and 66: 3.2 Geo-Raster Operations 59 Local
Page 67 and 68: 3.3 Summary 61 Slicing The slicing
Page 69 and 70: Chapter 4 Answering Basic Aggregate
Page 71 and 72: 4.1 Framework 65 pre-aggregated res
Page 73 and 74: 4.2 Cost Model 67 By partitioning t
Page 75 and 76: 4.2 Cost Model 69 Cost of independe
Page 77 and 78: 4.3 Implementation 71 Algorithm 1 Q
Page 79 and 80: 4.4 Experimental Results 73 Query E
Page 81 and 82: 4.5 Summary 75 pre-aggregates: inde
Page 83 and 84: Chapter 5 Pre-Aggregation Support B
Page 85: 5.2 Conceptual Framework 79 Figure
Page 89 and 90: 5.4 Answering Scaling Operations Us
Page 91 and 92: 5.5 Experimental Results 85 Algorit
Page 93 and 94: 5.5 Experimental Results 87 (a) Que
Page 95 and 96: 5.5 Experimental Results 89 (a) Sel
Page 97 and 98: 5.5 Experimental Results 91 vectors
Page 99 and 100: 5.5 Experimental Results 93 root no
Page 101 and 102: 5.5 Experimental Results 95 Figure
Page 107 and 108: 5.6 Summary 101 we considered non-u
Page 109 and 110: Chapter 6 Conclusion One of the big
Page 111 and 112: 6.1 Future Work 105 more non-spatio
Page 113 and 114: Bibliography [1] Blakeley J. A., La
Page 115 and 116: BIBLIOGRAPHY 109 [22] Moon B., Vega
Page 117 and 118: BIBLIOGRAPHY 111 [47] ESRI Inc. Arc
Page 119 and 120: BIBLIOGRAPHY 113 [73] Stefanovic N.
Page 121: BIBLIOGRAPHY 115 [97] Kotidis Y. an
show all

Applying OLAP Pre-Aggregation Techniques to ... - Jacobs University

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?