18.11.2012 Views

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />

Tree size<br />

Average Accuracy<br />

4500<br />

4000<br />

3500<br />

3000<br />

2500<br />

2000<br />

1500<br />

1000<br />

500<br />

(3)<br />

0<br />

0 200 400 600 800 1000 1200<br />

0.9<br />

0.85<br />

0.8<br />

0.75<br />

0.7<br />

0.65<br />

0.6<br />

0.55<br />

0.5<br />

(1) I-IIDT-TS(Et)<br />

(2) I-IIDT-EE(Et)<br />

(3) C4.5(Et)<br />

(4) ID3(Et)<br />

(5) IIDT-TS(E0)<br />

Time [sec]<br />

(1) I-IIDT-TS(Et)<br />

(2) I-IIDT-EE(Et)<br />

(3) C4.5(Et)<br />

(4) ID3(Et)<br />

(5) IIDT-TS(E0)<br />

(1)<br />

(4)<br />

(5)<br />

(2)<br />

0.45<br />

0 200 400 600 800 1000 1200<br />

Time [sec]<br />

Figure 3.32: Learning the XOR-10 problem incrementally<br />

Overview of the Compared Methods<br />

Page and Ray (2003) introduced skewing as an alternative to lookahead <strong>for</strong> addressing<br />

problematic concepts such as parity functions. At each node, the algorithm<br />

skews the set of examples and produces several versions of it, each with<br />

different weights <strong>for</strong> the instances. The algorithm chooses to split on the attribute<br />

that exceeds a pre-set gain threshold <strong>for</strong> the greatest number of weightings. Skewing<br />

was reported to per<strong>for</strong>m well on hard concepts such as XOR-n, mainly when<br />

the dataset is large enough relative to the number of attributes. Skewing can be<br />

viewed as a contract algorithm parameterized by w, the number of weightings.<br />

In order to convert skewing into an interruptible algorithm, we apply the general<br />

conversion method described in Section 3.6.1.<br />

Two improvements to the original skewing algorithm were presented, namely<br />

sequential skewing (Page & Ray, 2004), which skews one variable at a time instead<br />

of all of them simultaneously, and generalized skewing (Ray & Page, 2005),<br />

64<br />

(1)<br />

(2)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!