anytime algorithms for learning anytime classifiers saher ... - Technion

More documents

Recommendations

Info

Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Average Size Average Accuracy 40 38 36 34 32 30 28 26 24 0 50 100 150 200 250 300 72 70 68 66 Time [sec] ID3 C4.5 IIDT(1) IIDT(0.1) 64 ID3 C4.5 IIDT(1) 62 IIDT(0.1) 0 50 100 150 200 250 300 350 Time [sec] Figure 3.28: Anytime behavior of IIDT on the Glass dataset 3.30 show the anytime performance of IIDT in terms of tree size and accuracy for the Glass, XOR-10, and Tic-tac-toe datasets. Each graph represents an average of 100 runs (for the 10 ×10 cross-validation). Unlike the graphs given in the previous section, these are interruptible anytime graphs, i.e., for each point, the y coordinate reflects the performance if the algorithm was interrupted at the associated x coordinate. In the contract algorithm graphs, however, each point reflects the performance if the algorithm was initially allocated the time represented by the x coordinate. In all cases, the two anytime versions indeed exploit the additional resources and produce both smaller and more accurate trees. Since our algorithm replaces a subtree only if the new one is smaller, all size graphs decrease monotonically. The most interesting anytime behavior is for the difficult XOR-10 problem. There, the tree size decreases from 4000 leaves to almost the optimal size (1024), and the accuracy increases from 50% (which is the accuracy achieved by ID3 and C4.5) to almost 100%. The shape of the graphs is typical to those of anytime algorithms 60
Technion - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008 Average size Average Accuracy 4000 3500 3000 2500 2000 1500 ID3 C4.5 IIDT(1) IIDT(0.1) 1000 0 100 200 300 400 500 600 100 95 90 85 80 75 70 65 60 55 50 45 Time [sec] 0 100 200 300 400 500 600 Time [sec] ID3 C4.5 IIDT(1) IIDT(0.1) Figure 3.29: Anytime behavior of IIDT on the 10-XOR dataset with diminishing returns. The improvement in the accuracy of IIDT (at the latest point it was measured) over ID3 and C4.5 was found by t-test (α = 0.05) to be significant for the Glass and XOR-10 datasets. The performance of IIDT on Tic-tac-toe slightly degrades over time. We believe that similarly to LSID3, IIDT can perform much better if binary splits are used. The difference in performance of the two anytime algorithms is interesting. IIDT(0.1), with the lower granularity parameter, indeed produces smoother anytime graphs (with lower volatility), which allows for better control and better predictability of return. Moreover, in large portions of the time axis, the IIDT(0.1) graph shows better performance than that of IIDT(1). This is due to more sophisticated node selection in the former. Recall that g = 1 means that the algorithm always selects the entire tree for improvement. The smoothness of the IIDT(0.1) graphs is somehow misleading because it represents an average of 100 runs, with each step taking place at a different time (this is in contrast to the graph for IIDT(1), where the steps are at roughly the 61
Page 1 and 2:
Technion - Computer Science Departm
Page 3 and 4:
Page 5 and 6:
Page 7 and 8:
Page 9 and 10:
Page 11 and 12:
Page 13 and 14:
Page 15 and 16:
Page 17 and 18:
Page 19 and 20:
Page 21 and 22:
Page 23 and 24:
Page 25 and 26: Technion - Computer Science Departm
Page 75: Technion - Computer Science Departm
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
Page 151 and 152:
Page 153 and 154:
Page 155 and 156:
Page 157 and 158:
Page 159 and 160:
Page 161 and 162:
Page 163 and 164:
Page 165 and 166:
Page 167 and 168:
Page 169 and 170:
Page 171 and 172:
Page 173 and 174:
Page 175 and 176:
Page 177 and 178:
Page 179 and 180:
Page 181 and 182:
Page 183 and 184:
Page 185 and 186:
Page 187 and 188:
Page 189 and 190:
Page 191 and 192:
show all

anytime algorithms for learning anytime classifiers saher ... - Technion

Create successful ePaper yourself

Delete template?

Save as template?