16.08.2013 Views

course structure - DSpace at CUSAT

course structure - DSpace at CUSAT

course structure - DSpace at CUSAT

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

E15 - D<strong>at</strong>a Mining<br />

(July 2008)<br />

Prerequisite: E3 - Applied Probability and St<strong>at</strong>istics<br />

Unit 1<br />

An overview of d<strong>at</strong>a mining: D<strong>at</strong>a Mining: applic<strong>at</strong>ions, Knowledge discovery,<br />

Challenges, D<strong>at</strong>a mining tasks, Examples.<br />

D<strong>at</strong>a: Different types of d<strong>at</strong>a, Quality of d<strong>at</strong>a, D<strong>at</strong>a preprocessing methods. Measures of<br />

similarity and dissimilarity of d<strong>at</strong>a.<br />

Unit 2<br />

The Iris d<strong>at</strong>a sets, Summary st<strong>at</strong>istics, Visualiz<strong>at</strong>ion: Motiv<strong>at</strong>ions, General concepts,<br />

Techniques of visualiz<strong>at</strong>ion, Visualizing higher dimensional d<strong>at</strong>a, Overview of OLAP<br />

and multidimensional d<strong>at</strong>a analysis.<br />

Unit 3<br />

Basic concepts of classific<strong>at</strong>ion: Definition, Descriptive and Predictive modeling,<br />

General approach to solving a classific<strong>at</strong>ion problem, Decision Trees, Model overfitting<br />

Evalu<strong>at</strong>ing the performance of a classifier, Methods for Comparing classifiers.<br />

Altern<strong>at</strong>ive Techniques Of Classific<strong>at</strong>ion: Rule based classifier, Nearest<br />

neighbor classifiers, Baysiean classifiers, Artificial neural networks.<br />

Unit 4<br />

Associ<strong>at</strong>ion analysis: Basic concepts: Problem Definition ,Frequent Item set gener<strong>at</strong>ion,<br />

Rule gener<strong>at</strong>ion, compact item sets, Altern<strong>at</strong>ive methods for gener<strong>at</strong>ing frequent item<br />

sets. Evalu<strong>at</strong>ion of associ<strong>at</strong>ion p<strong>at</strong>terns.<br />

Unit 5<br />

Cluster analysis: Basic concepts And algorithms: K means, Agglomer<strong>at</strong>ive hierarchical<br />

clustering, DBSCAN, Cluster evalu<strong>at</strong>ion .<br />

Basics Of anomaly detection: Preliminaries, St<strong>at</strong>istical approaches.<br />

Text Book:<br />

Pang-Ning Tan, Michael Steinbach, Vipin Kumar, ‘Introduction to D<strong>at</strong>a Mining’,<br />

Pearson, 2006 .<br />

References:<br />

1 Ian H. Witten, Eibe Frank, ‘D<strong>at</strong>a Mining: Practical Machine Learning Tools<br />

and Techniques’, 2 nd Ed., Morgan Kaufmann, 2005.<br />

2 Arun K. Pujari, ‘D<strong>at</strong>a Mining Techniques’, Universities Press, 2006.<br />

******

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!