Introduction To Data Mining Instructor's Solution Manual

Recommendations

Info

28 Chapter 4 Classification (b) What are the information gains of a1 and a2 relative to these training examples? Answer: For attribute a1, the corresponding counts and probabilities are: a1 + - T 3 1 F 1 4 The entropy for a1 is � � 4 − (3/4) log 9 2(3/4) − (1/4) log2(1/4) + 5 � � − (1/5) log 9 2(1/5) − (4/5) log2(4/5) =0.7616. Therefore, the information gain for a1 is 0.9911 − 0.7616 = 0.2294. For attribute a2, the corresponding counts and probabilities are: a2 + - T 2 3 F 2 2 The entropy for a2 is � � 5 − (2/5) log 9 2(2/5) − (3/5) log2(3/5) + 4 � � − (2/4) log 9 2(2/4) − (2/4) log2(2/4) =0.9839. Therefore, the information gain for a2 is 0.9911 − 0.9839 = 0.0072. (c) For a3, which is a continuous attribute, compute the information gain for every possible split. Answer: a3 Class label Split point Entropy Info Gain 1.0 + 2.0 0.8484 0.1427 3.0 - 3.5 0.9885 0.0026 4.0 + 4.5 0.9183 0.0728 5.0 - 5.0 - 5.5 0.9839 0.0072 6.0 + 6.5 0.9728 0.0183 7.0 + 7.0 - 7.5 0.8889 0.1022 The best split for a3 occurs at split point equals to 2.
29 (d) What is the best split (among a1, a2, anda3) accordingtotheinformation gain? Answer: According to information gain, a1 produces the best split. (e) What is the best split (between a1 and a2) according to the classification error rate? Answer: For attribute a1: error rate = 2/9. For attribute a2: error rate = 4/9. Therefore, according to error rate, a1 produces the best split. (f) What is the best split (between a1 and a2) according to the Gini index? Answer: For attribute a1, the gini index is � 4 1 − (3/4) 9 2 − (1/4) 2 � + 5 � 1 − (1/5) 9 2 − (4/5) 2 � =0.3444. For attribute a2, the gini index is � 5 1 − (2/5) 9 2 − (3/5) 2 � + 4 � 1 − (2/4) 9 2 − (2/4) 2 � =0.4889. Since the gini index for a1 is smaller, it produces the better split. 4. Show that the entropy of a node never increases after splitting it into smaller successor nodes. Answer: Let Y = {y1,y2, ··· ,yc} denote the c classes and X = {x1,x2, ··· ,xk} denote the k attribute values of an attribute X. Before a node is split on X, the entropy is: E(Y )=− c� P (yj) log2 P (yj) = j=1 c� j=1 i=1 k� P (xi,yj) log2 P (yj), (4.1) where we have used the fact that P (yj) = �k i=1 P (xi,yj) from the law of total probability. After splitting on X, the entropy for each child node X = xi is: E(Y |xi) =− c� P (yj|xi) log2 P (yj|xi) (4.2) j=1
Page 1: Introduction to Data Mining Instruc
Page 5 and 6: Introduction 1 1. Discuss whether o
Page 7: 3. For each of the following data s
Page 10 and 11: 6 Chapter 2 Data (i) Ability to pas
Page 12 and 13: 8 Chapter 2 Data (a) How would you
Page 14 and 15: 10 Chapter 2 Data 13. Consider the
Page 16 and 17: 12 Chapter 2 Data x = 0101010001 y
Page 18 and 19: 14 Chapter 2 Data Same as previous
Page 20 and 21: 16 Chapter 2 Data 23. Given a simil
Page 22 and 23: 18 Chapter 2 Data However, note tha
Page 24 and 25: 20 Chapter 3 Exploring Data the vie
Page 26 and 27: 22 Chapter 3 Exploring Data The bes
Page 28 and 29: 24 Chapter 3 Exploring Data 16. Con
Page 30 and 31: 26 Chapter 4 Classification The pre
Page 34 and 35: 30 Chapter 4 Classification where P
Page 36 and 37: 32 Chapter 4 Classification The inf
Page 38 and 39: 34 Chapter 4 Classification X C1 C2
Page 40 and 41: 36 Chapter 4 Classification Answer:
Page 42 and 43: 38 Chapter 4 Classification 0 A B C
Page 44 and 45: 40 Chapter 4 Classification • Eac
Page 46 and 47: 42 Chapter 4 Classification Table 4
Page 49 and 50: Classification: Alternative Techniq
Page 51 and 52: Answer: For this problem, P = 500,
Page 53 and 54: 49 For R2, the expected frequency f
Page 55 and 56: Answer: If the examples for R1 are
Page 57 and 58: 53 (b) Use the estimate of conditio
Page 59 and 60: Table 5.2. Data set for Exercise 8.
Page 61 and 62: 57 (b) The prior probabilities are
Page 63 and 64: Answer: 59 P (B = good, F = empty,
Page 65 and 66: 15. For each of the Boolean functio
Page 67 and 68: When t =0.5, the confusion matrix f
Page 69 and 70: Y =0 Y =1 Y =2 + 0 30 0 − 200 200
Page 71 and 72: The total cost is F = p(a + d)+q(b
Page 73 and 74: Notice that the dual Lagrangian dep
Page 75 and 76: Association Analysis: Basic Concept
Page 77 and 78: (d) Use the results in part (c) to
Page 79 and 80: ζ({A, B, C}) = min � c(A −→
Page 81 and 82: Since s(A, B, C) ≤ s(A, B) and mi
Page 83 and 84:
{1, 3, 4, 5},{1, 3, 4, 6},{2, 3, 4,
Page 85 and 86:
L1 1,4,7 2,5,8 3,6,9 L5 L6 1,4,7 L7
Page 87 and 88:
ab ac null a b c d e ad abc abd abe
Page 89 and 90:
Table 6.4. Example of market basket
Page 91 and 92:
items. We will apply the Apriori al
Page 93 and 94:
89 where β =1− P (A) +2P (A, B).
Page 95 and 96:
91 (d) How does M behave when P (B)
Page 97 and 98:
C = 0 C = 1 Table 6.5. A Contingenc
Page 99 and 100:
Association Analysis: Advanced Conc
Page 101 and 102:
97 The number of frequent itemsets
Page 103 and 104:
Pressure Answer: The graph of Tempe
Page 105 and 106:
101 for A.) For each rule that corr
Page 107 and 108:
When bin − width =4: Where Table
Page 109 and 110:
105 approximate each interval by it
Page 111 and 112:
107 - Portuguese - Russian - Spanis
Page 113 and 114:
109 (b) Consider the approach where
Page 115 and 116:
111 (c) List all the 4-subsequences
Page 117 and 118:
• s =< {1, 2, 3, 4, 5, 6}{1, 2, 3
Page 119 and 120:
115 When there is no timing constra
Page 121 and 122:
b a a a a a b a a b a a b a a a (a)
Page 123 and 124:
a b 1 1 1 1 G 1 G 3 c e 1 d a 1 e 1
Page 125 and 126:
Table 7.17. Example of numeric data
Page 127:
123 (d) How should the support coun
Page 130 and 131:
126 Chapter 8 Cluster Analysis: Bas
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 151 and 152:
Cluster Analysis: Additional Issues
Page 153 and 154:
149 The main advantage of this appr
Page 155 and 156:
y 6 4 2 0 -2 -4 -6 -8 -10 -8 -6 -4
Page 157 and 158:
Table 9.1. Two nearest neighbors of
Page 159:
155 Proof: We know d(b, c) ≤ d(b,
Page 162 and 163:
158 Chapter 10 Anomaly Detection ni
Page 164 and 165:
160 Chapter 10 Anomaly Detection tr
Page 166 and 167:
162 Chapter 10 Anomaly Detection Th
Page 168 and 169:
164 Chapter 10 Anomaly Detection 15
show all

Introduction To Data Mining Instructor's Solution Manual

Create successful ePaper yourself

Delete template?

Save as template?