18.12.2012 Views

Proceedings

Proceedings

Proceedings

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

of it, to dig out meaningful relationships and patterns. It is important to note that<br />

predictive analytics is more than statistics – some call it “statistics on steroids” (W.<br />

Eckerson, 2007). It’s true that nearly all analytical modelers use descriptive statistics<br />

to understand the nature of data that has to be analyzed, but there are a lot of<br />

predictive techniques (neural networks, decision trees, genetic algorithms) that take<br />

advantage of increased computer processing power to perform complex calculations<br />

that often require multiple passes through extremely large volumes of data.<br />

At that point, we can ask: what’s the difference than between predictive analytics and<br />

data mining? Because the term data mining has been used - especially by the software<br />

companies, while academics and researchers have used the term “knowledge<br />

discovery” instead - to describe the techniques and processes involved in creating<br />

predictive models. Both data mining and predictive analytics apply sophisticated<br />

mathematics to data in order to solve business problems. But when we talk about data<br />

mining, we are usually referring to an analytical toolset that automatically searches for<br />

useful patterns in large data sets. On the other side, predictive analytics is an analyst –<br />

guided (not automatic) discipline that uses data patterns to make forward –looking<br />

predictions by evaluating multiple data patterns. Data mining searches for clues, while<br />

predictive analytics delivers answers that can guide to a “what next” action.<br />

Figure 5. Data mining versus predictive analytics<br />

Data<br />

mining<br />

Explore<br />

~ 1115 ~<br />

Predictive<br />

analitics<br />

Answers<br />

What next?<br />

Data mining is often one stage in developing a predictive model; it’s automated<br />

techniques are used to isolate the most data variables within a vast field of<br />

possibilities. These variables are used to build a mathematical model that predicts the<br />

future behavior consistently.<br />

The term data mining it‘s so “out of fashion” that vendors and consultants now<br />

embrace the term “predictive analytics” or “advanced analytics” or just “analytics” to<br />

describe the nature of tools they offer. But not all the analytics are predictive! In fact,<br />

there are only two major types of predictive analytics: supervised learning (the<br />

process of creating predictive models using a set of historical data that contains the<br />

results we want to predict) and unsupervised learning (previously known results are<br />

not used to train its models).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!