anytime algorithms for learning anytime classifiers saher ... - Technion
anytime algorithms for learning anytime classifiers saher ... - Technion
anytime algorithms for learning anytime classifiers saher ... - Technion
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />
Average accuracy<br />
Average tree size<br />
100<br />
90<br />
80<br />
70<br />
60<br />
50<br />
220<br />
200<br />
180<br />
160<br />
140<br />
120<br />
100<br />
80<br />
60<br />
40<br />
20<br />
0<br />
LSID3(5)<br />
Skewing(30)<br />
Sequential Skewing<br />
ID3<br />
10 15 20 25 30 35 40 45 50 55 60<br />
Number of attributes (only 4 are relevant)<br />
LSID3(5)<br />
Skewing(30)<br />
Sequential Skewing<br />
ID3<br />
10 15 20 25 30 35 40 45 50 55 60<br />
Number of attributes (only 4 are relevant)<br />
Figure 6.1: The sensitivity of LSID3 and skewing to irrelevant attributes. The<br />
concept is XOR-4 while all the other attributes are irrelevant. The x-axis represents<br />
the total number of attributes.<br />
A similar problem occurs if we fix the number of relevant attributes but increase<br />
the number of irrelevant ones. In that case, the space of possible sub-paths<br />
becomes larger and the probability of creating a good cluster decreases. To test<br />
this hypothesis, we compared the sensitivity of LSID3, skewing, and Sequential<br />
skewing to the number of irrelevant attributes. LSID3 was run with r = 5 while<br />
the skewing <strong>algorithms</strong> were run with their default parameters. The target concept<br />
(XOR-4) and the number of examples (512) were fixed, while the number<br />
of irrelevant attributes ranged from 6 to 56 (thus, the total number of attributes<br />
ranged from 10 to 60). Figure 6.1 displays the results.<br />
The graphs indicate that LSID3 continues to attain high accuracy and produce<br />
almost optimal trees even when the number of irrelevant attributes increases.<br />
The degradation in LSID3 per<strong>for</strong>mance is noticeable only when the number of<br />
123