E15 - D<strong>at</strong>a Mining (July 2008) Prerequisite: E3 - Applied Probability and St<strong>at</strong>istics Unit 1 An overview of d<strong>at</strong>a mining: D<strong>at</strong>a Mining: applic<strong>at</strong>ions, Knowledge discovery, Challenges, D<strong>at</strong>a mining tasks, Examples. D<strong>at</strong>a: Different types of d<strong>at</strong>a, Quality of d<strong>at</strong>a, D<strong>at</strong>a preprocessing methods. Measures of similarity and dissimilarity of d<strong>at</strong>a. Unit 2 The Iris d<strong>at</strong>a sets, Summary st<strong>at</strong>istics, Visualiz<strong>at</strong>ion: Motiv<strong>at</strong>ions, General concepts, Techniques of visualiz<strong>at</strong>ion, Visualizing higher dimensional d<strong>at</strong>a, Overview of OLAP and multidimensional d<strong>at</strong>a analysis. Unit 3 Basic concepts of classific<strong>at</strong>ion: Definition, Descriptive and Predictive modeling, General approach to solving a classific<strong>at</strong>ion problem, Decision Trees, Model overfitting Evalu<strong>at</strong>ing the performance of a classifier, Methods for Comparing classifiers. Altern<strong>at</strong>ive Techniques Of Classific<strong>at</strong>ion: Rule based classifier, Nearest neighbor classifiers, Baysiean classifiers, Artificial neural networks. Unit 4 Associ<strong>at</strong>ion analysis: Basic concepts: Problem Definition ,Frequent Item set gener<strong>at</strong>ion, Rule gener<strong>at</strong>ion, compact item sets, Altern<strong>at</strong>ive methods for gener<strong>at</strong>ing frequent item sets. Evalu<strong>at</strong>ion of associ<strong>at</strong>ion p<strong>at</strong>terns. Unit 5 Cluster analysis: Basic concepts And algorithms: K means, Agglomer<strong>at</strong>ive hierarchical clustering, DBSCAN, Cluster evalu<strong>at</strong>ion . Basics Of anomaly detection: Preliminaries, St<strong>at</strong>istical approaches. Text Book: Pang-Ning Tan, Michael Steinbach, Vipin Kumar, ‘Introduction to D<strong>at</strong>a Mining’, Pearson, 2006 . References: 1 Ian H. Witten, Eibe Frank, ‘D<strong>at</strong>a Mining: Practical Machine Learning Tools and Techniques’, 2 nd Ed., Morgan Kaufmann, 2005. 2 Arun K. Pujari, ‘D<strong>at</strong>a Mining Techniques’, Universities Press, 2006. ******
E16 - Software Project Management Unit 1 Introduction to Management – Management: Science, Theory and Practice – Definition of Management: It’s n<strong>at</strong>ure and purpose, The aim of all managers Science or Art? The Systems approach to Oper<strong>at</strong>ional Management, The functions of Managers. Software Engineering Project Management: Major issues of Software Engineering, Functions and activities of Management, planning, organizing, staffing, directing and controlling a software Engineering Project. Unit 2 Project Evalu<strong>at</strong>ion: Str<strong>at</strong>egic Assessment, technical Assessment, Cost-benefit analysis, cash flow forecasting, cost-benefit evalu<strong>at</strong>ion techniques, Risk evalu<strong>at</strong>ion. Selection of an appropri<strong>at</strong>e project approach: Choosing Technologies, technical plan contents list, choice of process models, <strong>structure</strong> versus speed of delivery, The W<strong>at</strong>erfall model, The V-process model, the spiral model, Software prototyping, other ways of c<strong>at</strong>egorizing software prototypes. Controlling changes during prototyping, incremental delivery, dynamic systems development method, Extreme programming Managing iter<strong>at</strong>ive processes, selecting the most appropri<strong>at</strong>e process model. Unit 3 Software Effort Estim<strong>at</strong>ion: Problems with over and under estim<strong>at</strong>es, The basis for Software estim<strong>at</strong>ing, Software effort estim<strong>at</strong>ion techniques, expert judgment, estim<strong>at</strong>ing by analogy, Albrecht function point analysis, function points Mark II, Object points, a procedural code –oriented approach, COCOMO: A parametric model. Unit 4 Activity planning: The objectives of activity planning, When to plan, Project Schedules, Projects and activities, Sequencing and scheduling activities, Network planning models, Formul<strong>at</strong>ing a network model, Adding the time dimension, The forward pass, The backward pass, Identifying the critical p<strong>at</strong>h, Activity flo<strong>at</strong>, Shortening the project dur<strong>at</strong>ion, Identifying critical activities, Activity-on-arrow networks. Risk Management: The n<strong>at</strong>ure of risk, types of risks, Managing Risk, Hazard Identific<strong>at</strong>ion, Hazard Analysis, Risk planning and control, Evalu<strong>at</strong>ing risks to the schedule. Unit 5 Monitoring and control: Cre<strong>at</strong>ing the framework, Collecting the d<strong>at</strong>a, Visualizing progress, Cost monitoring, Earned value, prioritizing monitoring, Getting the project back to target, change control Managing Contracts: Types of contracts, Stages in contract placement, Typical terms of a contract, Contract Management, acceptance. Managing people and organizing teams: Understanding behavior, Organiz<strong>at</strong>ional behavior: a background, Selecting the right person for the job, instruction in the best methods, Motiv<strong>at</strong>ion, The Oldham-Hackman job characteristics Model, Working in groups, Becoming a team, Decision making, Leadership, Organiz<strong>at</strong>ional <strong>structure</strong>s, Stress, Health and Safety. References: 1. Bob Hughes & Mike Cotterell, ‘Software Project Management’, T<strong>at</strong>a McGraw-Hill, 2004. 2. ‘Software Engineering Project Management’, Edited by Richard H Thayer, Wiley- IEEE, Computer Society Press, 2004. 3. Walker Royce, ‘Software Project Management: A unified framework’, Pearson Educ<strong>at</strong>ion, 2003. 4. Roger S Pressman, ‘Software Engineering: a Practitioner’s approach’, 6 th Ed., T<strong>at</strong>a McGraw-Hill, 2004. 5. Donald J Reifer, ‘Software Management’, 6 th Ed., Wiley-IEEE Computer Society Press, 2002. ******