18.11.2012 Views

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

anytime algorithms for learning anytime classifiers saher ... - Technion

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Technion</strong> - Computer Science Department - Ph.D. Thesis PHD-2008-12 - 2008<br />

Procedure TATA-Choose-Attribute(E, A, r, ρ c )<br />

If r = 0<br />

Return C4.5$-Choose-Attribute(E, A, ρ c )<br />

Foreach θ ∈ {θ ∈ A|cost(θ) < ρ c }<br />

V ← Outcomes(θ)<br />

Foreach vi ∈ V<br />

Ei ← {e ∈ E | θ(e) = vi}<br />

T ← C4.5$(Ei, A, ρ c − cost(θ))<br />

mini ← ExpectedMC(T)<br />

Repeat r − 1 times<br />

T ← Stochastic-C4.5$(Ei, A, ρ c − cost(θ))<br />

mini ← min (mini,ExpectedMC(T))<br />

totalθ ← � |V |<br />

i=1 mini<br />

Return θ <strong>for</strong> which totalθ is minimal<br />

Figure 5.4: Attribute selection in pre-contract-TATA. ExpectedMC(T) returns<br />

the expected misclassification cost of T.<br />

by the reconstruction process.<br />

5.2 Contract: When Allocation is Made Right<br />

Be<strong>for</strong>e Classification<br />

The pre-contract classification scenario assumes that ρ c , the bound on testing<br />

costs, is known to the learner. In many real-life scenarios, however, we do not<br />

know ρ c be<strong>for</strong>e building the model and there<strong>for</strong>e we need <strong>classifiers</strong> that either<br />

get ρ c as a parameter be<strong>for</strong>e proceeding with classification (contract classification)<br />

or can do their best until stopped and queried <strong>for</strong> a decision (interruptible<br />

classification). Note that TDIDT$-based <strong>algorithms</strong> cannot be used as is because<br />

ρ c is unavailable at the time of <strong>learning</strong>. Obviously, C4.5, EG2, and ACT, can<br />

be slightly modified, by storing default classifications at each internal node, to<br />

produce contract and interruptible trees because they do not need the value of ρ c .<br />

However, they are not designed to exploit a given testing budget. There<strong>for</strong>e, we<br />

are looking <strong>for</strong> a learner that has the advantages of pre-contract-TATA without<br />

getting the value of ρ c as parameter.<br />

105

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!