11.03.2014 Views

Applying OLAP Pre-Aggregation Techniques to ... - Jacobs University

Applying OLAP Pre-Aggregation Techniques to ... - Jacobs University

Applying OLAP Pre-Aggregation Techniques to ... - Jacobs University

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 6<br />

Conclusion<br />

One of the biggest challenges of database technology is <strong>to</strong> effectively and efficiently<br />

provide solutions for extremely large volumes of multidimensional array data archiving<br />

and management. This thesis focuses on investigating the problem of applying<br />

<strong>OLAP</strong> pre-aggregation technology <strong>to</strong> speed up aggregate query processing in array<br />

databases for GIS and remote-sensing imaging applications.<br />

We presented a study of fundamental imaging operations in GIS. By using a formal<br />

algebraic framework, Array Algebra, we were able <strong>to</strong> classify GIS operations<br />

according <strong>to</strong> three basic algebraic opera<strong>to</strong>rs, and thus identify a set of operations that<br />

can benefit from pre-aggregation techniques. We argued that <strong>OLAP</strong> pre-aggregation<br />

techniques cannot be applied in a straight-forward manner <strong>to</strong> array databases for our<br />

target applications. The reason is that although similar, data structures in both application<br />

domains differ in fundamental aspects. In <strong>OLAP</strong>, multidimensional data spaces<br />

are spanned by axes where cell values sit on the grid at intersection points. This<br />

is paralleled by raster image data that are discretized during acquisition. Thus, the<br />

structure of an <strong>OLAP</strong> data cube is rather similar <strong>to</strong> a raster array. Dimension hierarchies<br />

in <strong>OLAP</strong> serve <strong>to</strong> group value ranges along an axis. Querying data by referring<br />

<strong>to</strong> coordinates on the measure axes yields ground data, whereas queries using axes<br />

higher up in a dimension hierarchy will return aggregated values. A main differentiating<br />

criterion between data in <strong>OLAP</strong> and raster image data is density: <strong>OLAP</strong> data<br />

are sparse, typically 5%, whereas, raster image datasets are 100% dense. Note also<br />

that dimensions in <strong>OLAP</strong> are treated as business perspectives such as products and/or<br />

s<strong>to</strong>res, and these are non-spatial dimensions, which contrast with the spatial nature of<br />

raster image datasets. There are, however, core similarities that motivated us <strong>to</strong> further<br />

research <strong>OLAP</strong> pre-aggregation techniques. For example, we observed that array<br />

databases and <strong>OLAP</strong> systems both employ multidimensional data models <strong>to</strong> organize<br />

their data. Also, the operations convey a high degree of similarity: a roll-up (aggregate)<br />

operation in <strong>OLAP</strong> is very similar <strong>to</strong> a scaling operation in a raster domain.<br />

Moreover, both application domains make use of pre-aggregation approaches <strong>to</strong> speed<br />

up query processing, however, each has different levels of maturity and scalability.<br />

We presented a framework that focuses on computing basic aggregate operations<br />

using pre-aggregated data. We argued that the decision of computing an aggregate<br />

103

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!