18.11.2012 Views

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />

Table 4.1: Characteristics of the datasets used to evaluate ACT<br />

Attributes Max<br />

Dataset Instances Nom. (bin.) Num. domain Classes<br />

Breast Cancer 277 9 (3) 0 13 2<br />

Bupa 345 0 (0) 5 - 2<br />

Car 1728 6 (0) 0 4 4<br />

Flare 323 10 (5) 0 7 4<br />

Glass 214 0 (0) 9 - 7<br />

Heart 296 8(4) 5 4 2<br />

Hepatitis 154 13(13) 6 2 2<br />

Iris 150 0 (0) 4 - 3<br />

KRK 28056 6(0) 0 8 17<br />

Monks-1 124+432 6 (2) 0 4 2<br />

Monks-2 169+432 6 (2) 0 4 2<br />

Monks-3 122+432 6 (2) 0 4 2<br />

Multiplexer-20 615 20 (20) 0 2 2<br />

Multi-XOR 200 11 (11) 0 2 2<br />

Multi-AND-OR 200 11 (11) 0 2 2<br />

Nursery 8703 8(8) 0 5 5<br />

Pima 768 0(0) 8 - 2<br />

TAE 151 4(1) 1 26 3<br />

Tic-Tac-Toe 958 9 (0) 0 3 2<br />

Titanic 2201 3(2) 0 4 2<br />

Thyroid 3772 15(15) 5 2 3<br />

Voting 232 16 (16) 0 2 2<br />

Wine 178 0 (0) 13 - 3<br />

XOR 3D 200 0 (0) 6 - 2<br />

XOR-5 200 10 (10) 0 2 2<br />

Cost-insensitive <strong>learning</strong> <strong>algorithms</strong> focus on accuracy and there<strong>for</strong>e are expected<br />

to per<strong>for</strong>m well when testing costs are negligible relative to misclassification<br />

costs. However, when testing costs are significant, ignoring them<br />

would result in expensive <strong>classifiers</strong>. There<strong>for</strong>e, evaluating cost-sensitive learners<br />

requires a wide spectrum of misclassification costs. For each problem out<br />

of the 105, we created 5 instances, with uni<strong>for</strong>m misclassification costs mc =<br />

100, 500, 1000, 5000, 10000. Later on, we also consider nonuni<strong>for</strong>m misclassification<br />

costs.<br />

4.6.2 Methodology<br />

We start our experimental evaluation by comparing ACT, given a fixed resource<br />

allocation, with several other cost-sensitive and cost-insensitive <strong>algorithms</strong>. Next<br />

we examine the <strong>anytime</strong> behavior of ACT, and finally, we evaluate the <strong>algorithms</strong><br />

with two modifications on the problem instances: random test cost assignment<br />

and nonuni<strong>for</strong>m misclassification costs.<br />

82

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!