anytime algorithms for learning anytime classifiers saher ... - Technion
anytime algorithms for learning anytime classifiers saher ... - Technion
anytime algorithms for learning anytime classifiers saher ... - Technion
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />
Tree size<br />
Average Accuracy<br />
4500<br />
4000<br />
3500<br />
3000<br />
2500<br />
2000<br />
1500<br />
1000<br />
500<br />
(3)<br />
0<br />
0 200 400 600 800 1000 1200<br />
0.9<br />
0.85<br />
0.8<br />
0.75<br />
0.7<br />
0.65<br />
0.6<br />
0.55<br />
0.5<br />
(1) I-IIDT-TS(Et)<br />
(2) I-IIDT-EE(Et)<br />
(3) C4.5(Et)<br />
(4) ID3(Et)<br />
(5) IIDT-TS(E0)<br />
Time [sec]<br />
(1) I-IIDT-TS(Et)<br />
(2) I-IIDT-EE(Et)<br />
(3) C4.5(Et)<br />
(4) ID3(Et)<br />
(5) IIDT-TS(E0)<br />
(1)<br />
(4)<br />
(5)<br />
(2)<br />
0.45<br />
0 200 400 600 800 1000 1200<br />
Time [sec]<br />
Figure 3.32: Learning the XOR-10 problem incrementally<br />
Overview of the Compared Methods<br />
Page and Ray (2003) introduced skewing as an alternative to lookahead <strong>for</strong> addressing<br />
problematic concepts such as parity functions. At each node, the algorithm<br />
skews the set of examples and produces several versions of it, each with<br />
different weights <strong>for</strong> the instances. The algorithm chooses to split on the attribute<br />
that exceeds a pre-set gain threshold <strong>for</strong> the greatest number of weightings. Skewing<br />
was reported to per<strong>for</strong>m well on hard concepts such as XOR-n, mainly when<br />
the dataset is large enough relative to the number of attributes. Skewing can be<br />
viewed as a contract algorithm parameterized by w, the number of weightings.<br />
In order to convert skewing into an interruptible algorithm, we apply the general<br />
conversion method described in Section 3.6.1.<br />
Two improvements to the original skewing algorithm were presented, namely<br />
sequential skewing (Page & Ray, 2004), which skews one variable at a time instead<br />
of all of them simultaneously, and generalized skewing (Ray & Page, 2005),<br />
64<br />
(1)<br />
(2)