anytime algorithms for learning anytime classifiers saher ... - Technion

More documents

Recommendations

Info

Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Test costs a1 a2 a3 a4 a5 a6 10 10 10 10 10 10 MC costs FP FN 100 100 r = 1 T1 T2 5 95 + - EE = 7.3 a3 - + a1 a5 - + 95 + 5 + 95 + 5 - 95 - 5 - EE = 7.3 EE = 7.3 EE = 7.3 mcost (T1) = 7.3*4*100$ / 400 = 7.3$ tcost (T1) = 20$ total(T1) = 27.3$ 0 50 + - EE = 1.4 a4 - + a2 a6 - + 100 + 0 + 100 + 50 - 50 - 50 - EE = 54.1 EE = 1.4 EE = 54.1 mcost (T2) = (1.4*2 + 54.1*2) * 100$ / 400 = 27.7$ tcost (T2) = 20$ total (T2) = 47.7$ Figure 4.4: Evaluation of tree samples in ACT. The leftmost column defines the costs: 6 attributes with identical cost and uniform error penalties. T1 was sampled for a1 and T2 for a2. EE stands for the expected error. Because the total cost of T1 is lower, ACT would prefer to split on a1. Test costs a1 a2 a3 a4 a5 a6 40 10 10 10 10 10 MC costs FP FN 100 100 r = 1 T1 T2 a3 - + a1 5 + 95 + 5 + 95 + 95 - 5 - 95 - 5 - EE = 7.3 EE = 7.3 EE = 7.3 EE = 7.3 mcost (T1) = 7.3*4*100$ / 400 = 7.3$ tcost (T1) = 50$ total(T1) = 57.3$ a5 - + a4 - + a2 0 + 100 + 0 + 100 + 50 - 50 - 50 - 50 - EE = 1.4 EE = 54.1 EE = 1.4 EE = 54.1 a6 - + mcost (T2) = (1.4*2 + 54.1*2) * 100$ / 400 = 27.7$ tcost (T2) = 20$ total (T2) = 47.7$ Figure 4.5: Evaluation of tree samples in ACT. The leftmost column defines the costs: 6 attributes with identical cost (except for the expensive a1) and uniform error penalties. T1 was sampled for a1 and T2 for a2. Because the total cost of T2 is lower, ACT would prefer to split on a2. respectively. The expected error costs of T1 and T2 are: 1 mcost(T1) � = 1 4 · 7.3 (4 · EE (100, 5, 0.25)) · 100$ = · 100$ = 7.3$ 400 400 mcost(T2) � = 1 (2 · EE (50, 0, 0.25) · 100$ + 2 · EE (150, 50, 0.25) · 100$) 400 = 2 · 1.4 + 2 · 54.1 · 100$ = 27.7$ 400 When both test and error costs are involved, ACT considers their sum. Since the test cost of both trees is identical (20$), ACT would prefer to split on a1. 1 In this example we set cf to 0.25, as in C4.5. In Section 4.5 we discuss how to tune cf. 76
Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Test costs a1 a2 a3 a4 a5 a6 10 10 10 10 10 10 MC costs FP FN 1 199 r = 1 T1 T2 a3 - + a1 5 + 95 + 5 + 95 + 95 - 5 - 95 - 5 - EE = 7.3 EE = 7.3 EE = 7.3 EE = 7.3 a5 - + mcost (T1) = (7.3*2*1$ + 7.3*2*199$) / 400 = 7.3$ tcost (T1) = 20$ total (T1) = 27.3$ a4 - + a2 0 + 100 + 0 + 100 + 50 - 50 - 50 - 50 - EE = 1.4 EE = 54.1 EE = 1.4 EE = 54.1 a6 - + mcost (T2) = (54.1*2*1$ + 1.4*2*199$) / 400 = 1.7$ tcost (T2) = 20$ total (T2) = 21.7$ Figure 4.6: Evaluation of tree samples in ACT. The leftmost column defines the costs: 6 attributes with identical cost and nonuniform error penalties. T1 was sampled for a1 and T2 for a2. Because the total cost of T2 is lower, ACT would prefer to split on a2. If, however, the cost of a1 were 40$, as in Figure 4.5, tcost(T1) would become 50$ and the total cost of T1 would become 57.3$, while that of T2 would remain 47.7$. Hence, in this case ACT would split on a2. To illustrate how ACT handles nonuniform error penalties, let us assume that the cost of all attributes is again 10$, while the cost of a false positive (FP) is 1$ and the cost of a false negative (FN) is 199$. Let the trees in Figure 4.6, denoted T1 and T2, be those sampled for a1 and a2 respectively. As in the first example, only misclassification costs play a role because the test costs of both trees is the same. Although on average the misclassification cost is also 100$, ACT now evaluates these trees differently: mcost(T1) � = 1 (2 · EE (100, 5, 0.25) · 1$ + 2 · EE (100, 5, 0.25) · 199$) 400 = 2 · 7.3 · 1$ + 2 · 7.3.1 · 199$ = 7.3$ 400 mcost(T2) � = 1 (2 · EE (50, 0, 0.25) · 199$ + 2 · EE (100, 50, 0.25) · 1$) 400 = 2 · 1.4 · 199$ + 2 · 54.1 · 1$ = 1.7$ 400 Therefore, in the nonuniform setup, ACT would prefer a2. This makes sense because in the given setup we prefer trees that may result in more false positives but reduce the number of expensive false negatives. 77
Page 1 and 2:
Technion - Computer Science Departm
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42: Technion - Computer Science Departm
Page 91: Technion - Computer Science Departm
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
show all

anytime algorithms for learning anytime classifiers saher ... - Technion

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?