01.05.2017 Views

632598256894

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Field names are shown with syntax of tablename.columnname. The Count and Sum keyword<br />

perform aggregation based on the items listed in the GROUP BY clause. The AS keyword indicates<br />

that the column should be titled something other than its default caption. The FROM keyword is used<br />

to indicate the tables to be used. The INNER JOIN clause indicates how the data in the tables should<br />

be integrated (linked). The HAVING clause shows the restriction—in this case only customers whose<br />

count of bikes sold is >1. The ORDER BY clause is the sort order.<br />

Data Warehouse<br />

Data warehousing attempts to reconcile and integrate data from legacy applications software with<br />

today‟s newer technology. As mentioned earlier, industry is rife with older legacy systems that are<br />

currently cost prohibitive to replace. Most of these older systems are mission-critical operational<br />

transaction systems and satisfy most of the operational needs of the company. However, they are built<br />

on technology that cannot support the kinds of decision support tools that management requires. Many<br />

of these systems use older file structures or obsolete database management systems and are almost<br />

incapable of accessing and manipulating data.<br />

As an alternative to replacing these systems, data warehousing provides a state-of-the-art database<br />

management system that is fed data from the older legacy systems. However, data does get duplicated,<br />

which can potentially cause a synchronization problem between the data in the warehouse and the data<br />

in the older legacy systems. Consequently, IT management must put stringent controls in place. Still,<br />

the benefits outweigh the potential problems, for the data warehouse comes with all of the high-tech<br />

tools that will enable management to create a plethora of queries and reports. Most of the newer<br />

business intelligence tools mentioned earlier require a storage capability similar to the data warehouse.<br />

Any organization that implements a data warehouse must create a corporate data directory for the<br />

data that will reside in the data warehouse. This metadata includes attributes (name, size, data type,<br />

etc.); data about the data (where it is located, which application owns it, what data it is associated<br />

with); and descriptive information (context, quality, condition). Software tools follow these rules to<br />

extract, transform, and load operational data into the data warehouse. For example, the metadata will<br />

tell an end user that the data field called “sales” means “net sales” and comes from the general ledger<br />

application as the aggregate of all delivered orders for the period. Ideally, data elements should have<br />

names that are agreed to by those who work with the data—so maybe, in this example, the field would<br />

be called “net sales” to begin with.<br />

Exhibit 20.8 Snowflake schema.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!