anytime algorithms for learning anytime classifiers saher ... - Technion

More documents

Recommendations

Info

Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Average Size Average Accuracy 100 90 80 70 60 50 40 30 20 10 IIDT(0.1) Skewing Sequential Skewing 0 0 0.05 0.1 0.15 0.2 0.25 100 90 80 70 60 50 Time [sec] IIDT(0.1) Skewing Sequential Skewing Bagging-ID3 Bagging-LSID3 0 0.05 0.1 0.15 0.2 0.25 Time [sec] Figure 3.33: Anytime behavior of modern learners on the XOR-5 dataset Empirical Comparison We used our own implementation for IIDT, skewing, and bagging, and the commercial version for GATree. 15 The skewing and sequential skewing versions were run with linearly increasing parameters. The generalized skewing algorithm was run with exponentially increasing parameters. The performance of the ensemble method was tested for exponentially increasing committee sizes (1, 2, 4, 8, . . .). Figures 3.33, 3.34, and 3.35 compare IIDT to bagging with ID3 as a base learner, bagging with LSID3(r = 1), and skewing on the XOR-5, Multiplexer-20, and Tic-tac-toe tasks respectively. Note that the results for ID3 are identical to those of bagging-ID3 with a single tree in the committee and hence are not plotted independently. Since GATree was run on a different machine, we report 15 The experiments with GATree were run on a Pentium IV 2.8 GHz machine with the Windows XP operating system. The reported times are as output by the application itself. 66
Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Average Size Average Accuracy 180 160 140 120 100 80 60 40 20 IIDT(0.1) Skewing Sequential Skewing 0 0 1 2 3 4 5 6 7 100 90 80 70 60 50 Time [sec] IIDT(0.1) Skewing Sequential Skewing Bagging-ID3 Bagging-LSID3 0 1 2 3 4 5 6 7 Time [sec] Figure 3.34: Anytime behavior of modern learners on the Multiplexer-20 dataset its results separately later in this section. The graphs for the first 2 problems, which are known to be hard, show that IIDT clearly outperforms the other methods both in terms of tree size and accuracy. In both cases IIDT reaches almost perfect accuracy (99%), while bagging- ID3 and skewing topped at 55% for the first problem and 75% for the second. The inferior performance of bagging-ID3 on the XOR-5 and Multiplexer-20 tasks is not surprising. The trees that form the committee were induced greedily and hence could not discover these difficult concepts, even when they were combined. Similar results were obtained when running bagging over C4.5 and RTG. However, when our LSID3(r = 1) was used as a base learner, performance was significantly better than that of the greedy committees. Still, IIDT performed significantly better than bagging-LSID3, indicating that for difficult concepts, it is better to invest more resources for improving a single tree than for adding more trees of lower quality to the committee. 67
Page 1 and 2:
Technion - Computer Science Departm
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26:
Page 27 and 28:
Page 29 and 30:
Page 31 and 32: Technion - Computer Science Departm
Page 81: Technion - Computer Science Departm
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
show all

anytime algorithms for learning anytime classifiers saher ... - Technion

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?