anytime algorithms for learning anytime classifiers saher ... - Technion

More documents

Recommendations

Info

Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Table 5.1: Characteristics of the datasets used to evaluate TATA Attributes Max Dataset Instances Nom. (bin.) Num. domain Classes Breast Cancer 277 9 (3) 0 13 2 Bupa 345 0 (0) 5 - 2 Car 1728 6 (0) 0 4 4 Flare 323 10 (5) 0 7 4 Glass 214 0 (0) 9 - 7 Heart 296 8(4) 5 4 2 Hepatitis 154 13(13) 6 2 2 Iris 150 0 (0) 4 - 3 KRK 28056 6(0) 0 8 17 Monks-1 124+432 6 (2) 0 4 2 Monks-2 169+432 6 (2) 0 4 2 Monks-3 122+432 6 (2) 0 4 2 Multiplexer-20 615 20 (20) 0 2 2 Multi-XOR 200 11 (11) 0 2 2 Multi-AND-OR 200 11 (11) 0 2 2 Nursery 8703 8(8) 0 5 5 Pima 768 0(0) 8 - 2 TAE 151 4(1) 1 26 3 Tic-Tac-Toe 958 9 (0) 0 3 2 Titanic 2201 3(2) 0 4 2 Thyroid 3772 15(15) 5 2 3 Voting 232 16 (16) 0 2 2 Wine 178 0 (0) 13 - 3 XOR 3D 200 0 (0) 6 - 2 XOR-5 200 10 (10) 0 2 2 administer any test and thus their performance is identical. At the other end, when ρ ≥ ρc max , the attribute costs are actually not a constraint. In this case TATA(r = 5) performed best, confirming the results reported in Chapter 4 when misclassification costs were dominant. The more interesting ρc values are those in between. Table 5.2 lists the normalized area under the misclassification cost curve over the range [33%−99%]ρc max. Confirming the curves, the results indicate that TATA(r = 5) has the best overall performance. The Wilcoxon test (Demsar, 2006), which compares classifiers over multiple datasets, finds TATA(r = 5) to be significantly better than all the other algorithms. As expected, all five algorithms improve with the increase in ρc because they can use more features. For ρc values slightly larger than ρc min we can see that EG2, which is cost-sensitive, performs better than C4.5. The reason is that EG2 takes into account attribute costs and hence will prefer lower cost attributes. With the increase in ρc and the relaxation of cost constraints, C4.5 becomes better than EG2. 112
Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Misclassification cost 50 45 40 35 30 25 20 C4.5 EG2 EG2$ TATA(r=0) TATA(r=5) 15 0 20 40 60 80 100 Maximal classification cost (% of total cost) Figure 5.11: Results for pre-contract classification: the misclassification cost for different preallocated testing costs, as percentage of the total cost. The results are averaged over all 100 datasets. Table 5.2: Comparing the misclassification cost for different testing cost contracts. The numbers represent the average over 100 datasets. The last column indicates whether the advantage of TATA(r = 5) is statistically significant according to the Wilcoxon test (α = 5%). Learner ρ c Zmax 0.33ρ c max TATA(r = 5) 21.12 C4.5 28.84 TATA(r = 0) 26.93 EG2 31.21 EG2$ 30.48 MC(ρ c ) Wilcoxon It is interesting to compare the TDIDT$ variants of C4.5 and EG2 to their TDIDT counterparts. It is easy to see that both TDIDT$ variants exhibit better anycost behavior, until the point where all relevant attributes can be used (ρc ∼ ρ c max ), where the performance of each couple becomes identical. The advantage of the TDIDT$ variants is because they will not choose tests that violate the cost limits and therefore will not be forced to stop the induction process earlier. A comparison of TATA(r = 0) to TATA(r = 5) indicates that the latter is clearly better: while TATA(r = 0) chooses split attributes greedily, TATA(r = 5) 113 √ √ √ √
Page 1 and 2:
Technion - Computer Science Departm
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32:
Page 33 and 34:
Page 35 and 36:
Page 37 and 38:
Page 39 and 40:
Page 41 and 42:
Page 43 and 44:
Page 45 and 46:
Page 47 and 48:
Page 49 and 50:
Page 51 and 52:
Page 53 and 54:
Page 55 and 56:
Page 57 and 58:
Page 59 and 60:
Page 61 and 62:
Page 63 and 64:
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
Page 71 and 72:
Page 73 and 74:
Page 75 and 76:
Page 77 and 78: Technion - Computer Science Departm
Page 127: Technion - Computer Science Departm
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
show all

anytime algorithms for learning anytime classifiers saher ... - Technion

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?