23.11.2013 Aufrufe

tekom-Jahrestagung 2012 - ActiveDoc

tekom-Jahrestagung 2012 - ActiveDoc

tekom-Jahrestagung 2012 - ActiveDoc

MEHR ANZEIGEN
WENIGER ANZEIGEN

Erfolgreiche ePaper selbst erstellen

Machen Sie aus Ihren PDF Publikationen ein blätterbares Flipbook mit unserer einzigartigen Google optimierten e-Paper Software.

Sprachtechnologie / Language Technology<br />

Data Cubes<br />

A valuable way to conceptualize the gathering of data for BI is the data<br />

cube model. A data cube, also known as an OLAP (Online Analytical<br />

Processing) cube, is an extension into three or more dimensions of the<br />

type of two-dimensional, row and column data captured in spreadsheets,<br />

whereby the third dimension represents an additional variable.<br />

For example, a spreadsheet might capture products sold in rows, and the<br />

locales in which they were sold in columns. Extending this data into a<br />

third dimension would be like stacking the spreadsheet for this year on<br />

top of that from last year, and these two on top of the spreadsheet from<br />

the year before last, and so on.<br />

Applied to the context of the language industry, one may envision a<br />

cube made up of many individual cubes, each of which represents the<br />

intersection of the three dimensions. For simplicity’s sake, the cube in<br />

following example is 3x3x3, but this does not have to be the case. The<br />

top horizontal layers in the cube represent the languages into which<br />

translations were done. The vertical front-to-back layers represent new<br />

words, fuzzy matched words, and repeated words in each language, and<br />

the vertical left-to-right layers represent the years in which translations<br />

were done. Each small cube represents a number at the intersection of<br />

the three dimensions, for example, how many new words were translated<br />

into Arabic in 2011.<br />

Although it is certainly possible to drill down to them, the individual<br />

intersection cubes are not necessarily interesting in and of themselves.<br />

They provide greater insight for management decisions when they are<br />

compared in sequence to other like cubes. In this example, the drilldown<br />

looks at the intersection of fuzzy matched words in Arabic across<br />

all three years, as indicated by the red cubes.<br />

By rotating the cube and eliminating the extraneous data, we are left<br />

with the sequence of numbers across all three years.<br />

<strong>tekom</strong>-<strong>Jahrestagung</strong> <strong>2012</strong><br />

253

Hurra! Ihre Datei wurde hochgeladen und ist bereit für die Veröffentlichung.

Erfolgreich gespeichert!

Leider ist etwas schief gelaufen!