18.11.2012 Views

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />

Average accuracy<br />

Average tree size<br />

100<br />

90<br />

80<br />

70<br />

60<br />

50<br />

220<br />

200<br />

180<br />

160<br />

140<br />

120<br />

100<br />

80<br />

60<br />

40<br />

20<br />

0<br />

LSID3(5)<br />

Skewing(30)<br />

Sequential Skewing<br />

ID3<br />

10 15 20 25 30 35 40 45 50 55 60<br />

Number of attributes (only 4 are relevant)<br />

LSID3(5)<br />

Skewing(30)<br />

Sequential Skewing<br />

ID3<br />

10 15 20 25 30 35 40 45 50 55 60<br />

Number of attributes (only 4 are relevant)<br />

Figure 6.1: The sensitivity of LSID3 and skewing to irrelevant attributes. The<br />

concept is XOR-4 while all the other attributes are irrelevant. The x-axis represents<br />

the total number of attributes.<br />

A similar problem occurs if we fix the number of relevant attributes but increase<br />

the number of irrelevant ones. In that case, the space of possible sub-paths<br />

becomes larger and the probability of creating a good cluster decreases. To test<br />

this hypothesis, we compared the sensitivity of LSID3, skewing, and Sequential<br />

skewing to the number of irrelevant attributes. LSID3 was run with r = 5 while<br />

the skewing <strong>algorithms</strong> were run with their default parameters. The target concept<br />

(XOR-4) and the number of examples (512) were fixed, while the number<br />

of irrelevant attributes ranged from 6 to 56 (thus, the total number of attributes<br />

ranged from 10 to 60). Figure 6.1 displays the results.<br />

The graphs indicate that LSID3 continues to attain high accuracy and produce<br />

almost optimal trees even when the number of irrelevant attributes increases.<br />

The degradation in LSID3 per<strong>for</strong>mance is noticeable only when the number of<br />

123

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!