Introduction To Data Mining Instructor's Solution Manual

Recommendations

Info

10 Chapter 2 <strong>Data</strong> 13. Consider the problem of finding the K nearest neighbors of a data object. A programmer designs Algorithm 2.1 for this task. Algorithm 2.1 Algorithm for finding K nearest neighbors. 1: for i =1tonumber of data objects do 2: Find the distances of the i th object to all other objects. 3: Sort these distances in decreasing order. (Keep track of which object is associated with each distance.) 4: return the objects associated with the first K distances of the sorted list 5: end for (a) Describe the potential problems with this algorithm if there are duplicate objects in the data set. Assume the distance function will only return a distance of 0 for objects that are the same. There are several problems. First, the order of duplicate objects on a nearest neighbor list will depend on details of the algorithm and the order of objects in the data set. Second, if there are enough duplicates, the nearest neighbor list may consist only of duplicates. Third, an object may not be its own nearest neighbor. (b) How would you fix this problem? There are various approaches depending on the situation. One approach is to to keep only one object for each group of duplicate objects. In this case, each neighbor can represent either a single object or a group of duplicate objects. 14. The following attributes are measured for members of a herd of Asian elephants: weight, height, tusk length, trunk length, and ear area. Based on these measurements, what sort of similarity measure from Section 2.4 would you use to compare or group these elephants? Justify your answer and explain any special circumstances. These attributes are all numerical, but can have widely varying ranges of values, depending on the scale used to measure them. Furthermore, the attributes are not asymmetric and the magnitude of an attribute matters. These latter two facts eliminate the cosine and correlation measure. Euclidean distance, applied after standardizing the attributes to have a mean of 0 and a standard deviation of 1, would be appropriate. 15. You are given a set of m objects that is divided into K groups, where the i th group is of size mi. If the goal is to obtain a sample of size n
11 (a) We randomly select n ∗ mi/m elements from each group. (b) We randomly select n elements from the data set, without regard for the group to which an object belongs. The first scheme is guaranteed to get the same number of objects from each group, while for the second scheme, the number of objects from each group will vary. More specifically, the second scheme only guarantes that, on average, the number of objects from each group will be n ∗ mi/m. 16. Consider a document-term matrix, where tfij is the frequency of the i th word (term) in the j th document and m is the number of documents. Consider the variable transformation that is defined by tf ′ ij = tfij ∗ log m , (2.1) dfi where dfi is the number of documents in which the ith term appears and is known as the document frequency of the term. This transformation is known as the inverse document frequency transformation. (a) What is the effect of this transformation if a term occurs in one document? In every document? Terms that occur in every document have 0 weight, while those that occur in one document have maximum weight, i.e., log m. (b) What might be the purpose of this transformation? This normalization reflects the observation that terms that occur in every document do not have any power to distinguish one document from another, while those that are relatively rare do. 17. Assume that we apply a square root transformation to a ratio attribute x to obtain the new attribute x ∗ . As part of your analysis, you identify an interval (a, b) inwhichx ∗ has a linear relationship to another attribute y. (a) What is the corresponding interval (a, b) in terms of x? (a2 ,b2 ) (b) Give an equation that relates y to x. In this interval, y = x2 . 18. This exercise compares and contrasts some similarity and distance measures. (a) For binary data, the L1 distance corresponds to the Hamming distance; that is, the number of bits that are different between two binary vectors. The Jaccard similarity is a measure of the similarity between two binary vectors. Compute the Hamming distance and the Jaccard similarity between the following two binary vectors.
Page 1: Introduction to Data Mining Instruc
Page 5 and 6: Introduction 1 1. Discuss whether o
Page 7: 3. For each of the following data s
Page 10 and 11: 6 Chapter 2 Data (i) Ability to pas
Page 12 and 13: 8 Chapter 2 Data (a) How would you
Page 16 and 17: 12 Chapter 2 Data x = 0101010001 y
Page 18 and 19: 14 Chapter 2 Data Same as previous
Page 20 and 21: 16 Chapter 2 Data 23. Given a simil
Page 22 and 23: 18 Chapter 2 Data However, note tha
Page 24 and 25: 20 Chapter 3 Exploring Data the vie
Page 26 and 27: 22 Chapter 3 Exploring Data The bes
Page 28 and 29: 24 Chapter 3 Exploring Data 16. Con
Page 30 and 31: 26 Chapter 4 Classification The pre
Page 32 and 33: 28 Chapter 4 Classification (b) Wha
Page 34 and 35: 30 Chapter 4 Classification where P
Page 36 and 37: 32 Chapter 4 Classification The inf
Page 38 and 39: 34 Chapter 4 Classification X C1 C2
Page 40 and 41: 36 Chapter 4 Classification Answer:
Page 42 and 43: 38 Chapter 4 Classification 0 A B C
Page 44 and 45: 40 Chapter 4 Classification • Eac
Page 46 and 47: 42 Chapter 4 Classification Table 4
Page 49 and 50: Classification: Alternative Techniq
Page 51 and 52: Answer: For this problem, P = 500,
Page 53 and 54: 49 For R2, the expected frequency f
Page 55 and 56: Answer: If the examples for R1 are
Page 57 and 58: 53 (b) Use the estimate of conditio
Page 59 and 60: Table 5.2. Data set for Exercise 8.
Page 61 and 62: 57 (b) The prior probabilities are
Page 63 and 64: Answer: 59 P (B = good, F = empty,
Page 65 and 66:
15. For each of the Boolean functio
Page 67 and 68:
When t =0.5, the confusion matrix f
Page 69 and 70:
Y =0 Y =1 Y =2 + 0 30 0 − 200 200
Page 71 and 72:
The total cost is F = p(a + d)+q(b
Page 73 and 74:
Notice that the dual Lagrangian dep
Page 75 and 76:
Association Analysis: Basic Concept
Page 77 and 78:
(d) Use the results in part (c) to
Page 79 and 80:
ζ({A, B, C}) = min � c(A −→
Page 81 and 82:
Since s(A, B, C) ≤ s(A, B) and mi
Page 83 and 84:
{1, 3, 4, 5},{1, 3, 4, 6},{2, 3, 4,
Page 85 and 86:
L1 1,4,7 2,5,8 3,6,9 L5 L6 1,4,7 L7
Page 87 and 88:
ab ac null a b c d e ad abc abd abe
Page 89 and 90:
Table 6.4. Example of market basket
Page 91 and 92:
items. We will apply the Apriori al
Page 93 and 94:
89 where β =1− P (A) +2P (A, B).
Page 95 and 96:
91 (d) How does M behave when P (B)
Page 97 and 98:
C = 0 C = 1 Table 6.5. A Contingenc
Page 99 and 100:
Association Analysis: Advanced Conc
Page 101 and 102:
97 The number of frequent itemsets
Page 103 and 104:
Pressure Answer: The graph of Tempe
Page 105 and 106:
101 for A.) For each rule that corr
Page 107 and 108:
When bin − width =4: Where Table
Page 109 and 110:
105 approximate each interval by it
Page 111 and 112:
107 - Portuguese - Russian - Spanis
Page 113 and 114:
109 (b) Consider the approach where
Page 115 and 116:
111 (c) List all the 4-subsequences
Page 117 and 118:
• s =< {1, 2, 3, 4, 5, 6}{1, 2, 3
Page 119 and 120:
115 When there is no timing constra
Page 121 and 122:
b a a a a a b a a b a a b a a a (a)
Page 123 and 124:
a b 1 1 1 1 G 1 G 3 c e 1 d a 1 e 1
Page 125 and 126:
Table 7.17. Example of numeric data
Page 127:
123 (d) How should the support coun
Page 130 and 131:
126 Chapter 8 Cluster Analysis: Bas
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
Page 138 and 139:
Page 140 and 141:
Page 142 and 143:
Page 144 and 145:
Page 146 and 147:
Page 148 and 149:
Page 151 and 152:
Cluster Analysis: Additional Issues
Page 153 and 154:
149 The main advantage of this appr
Page 155 and 156:
y 6 4 2 0 -2 -4 -6 -8 -10 -8 -6 -4
Page 157 and 158:
Table 9.1. Two nearest neighbors of
Page 159:
155 Proof: We know d(b, c) ≤ d(b,
Page 162 and 163:
158 Chapter 10 Anomaly Detection ni
Page 164 and 165:
160 Chapter 10 Anomaly Detection tr
Page 166 and 167:
162 Chapter 10 Anomaly Detection Th
Page 168 and 169:
164 Chapter 10 Anomaly Detection 15
show all

Introduction To Data Mining Instructor's Solution Manual

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?