The Data Lake Survival Guide
2o2JwuQ
2o2JwuQ
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>The</strong> <strong>Data</strong> <strong>Lake</strong> <strong>Survival</strong> <strong>Guide</strong><br />
Real implementations are much more complex, usually involving a data staging area<br />
where data is placed prior to ingestion into the data warehouse. This may be necessary<br />
for operational reasons such as the data warehouse needing to limit data ingest to<br />
particular times. Alternatively data may need to be cleaned or restructured before<br />
ingest. In some instances, because data took too long to flow through to a data mart,<br />
yet another database - called an Operational <strong>Data</strong> Store (ODS) - would be created to<br />
provide a more timely service to BI dashboards.<br />
<strong>The</strong> need for such awkward manoeuvres might have been eliminated in time by the<br />
increasing power of hardware. However, this approach was constrained by other factors<br />
all of which we will identify and discuss in detail later.<br />
<strong>The</strong> Value of <strong>Data</strong><br />
Businesses do not remain static. <strong>The</strong>ir processes change and evolve, their business<br />
models change and the markets they serve are gradually reshaped. Precisely how this<br />
happens varies, but generally we can think of there being a simple feedback loop, which<br />
governs the process. We illustrate this in Figure 2.<br />
<strong>The</strong> feedback loop has three steps:<br />
• Plan (business process design and implementation)<br />
• Run operational business processes<br />
• Review operational business processes<br />
<strong>The</strong>re may be many manual elements in this; it is rarely<br />
driven by IT, although IT normally contributes. BI and<br />
analytics have the clear role of providing information either<br />
to assist operational business processes or to assist planning<br />
and change management activities.<br />
Planning &<br />
Change<br />
Management<br />
Operational<br />
Activity<br />
Conceptually, there is nothing new at all about this view of<br />
company behavior and the role of BI and Analytics. What<br />
has drawn attention to Big <strong>Data</strong> and the BI & Analytics<br />
applications it supports, is that the technology parameters<br />
have shifted dramatically, as we shall discuss later in<br />
this report. If the Big <strong>Data</strong> opportunity is pursued, the<br />
efficacy of this corporate feedback loop will be improved.<br />
Organizations will be more “data driven” than before and<br />
their success will be more dependent on making effective<br />
Business<br />
Intelligence<br />
& Analytics<br />
Figure 2. Change<br />
use of technology for this purpose. This is why some businesses are now chanting the<br />
mantra, “data driven, data driven, data driven.”<br />
<strong>The</strong> triumph in 1997 IBM’s Deep Blue computer in a chess match with world chess<br />
champion, Gary Kasparov, the later victory in 2011 by IBM’s Watson computer system<br />
against three Jeopardy champions and the recent (2016) victory by Google AI against the<br />
world Go champion have demonstrated beyond argument that computer intelligence<br />
can now outstrip the most intelligent humans in well-defined contexts.<br />
2