08.10.2016 Views

Foundations of Data Science

2dLYwbK

2dLYwbK

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

√<br />

L log(L(T ))<br />

|S|) will be at most err S (h L ) + O( ), where h<br />

|S|<br />

L = argmin h:L(h)=L err S (h)<br />

is the pruning <strong>of</strong> T with L leaves <strong>of</strong> lowest error on S.<br />

3. In the above question, we assumed L was given. Explain √how we can ] remove this assumption<br />

and achieve a bound <strong>of</strong> min L<br />

[err S (h L ) + O( L log(L(T ))<br />

by instantiating<br />

|S|<br />

)<br />

L(T ) copies <strong>of</strong> the above algorithm (one for each value <strong>of</strong> L) and then combining<br />

these algorithms using the experts algorithm (in this case, none <strong>of</strong> them will be<br />

sleeping).<br />

Exercise 6.9 Kernels; (Section 6.6) Prove Theorem 6.10.<br />

Exercise 6.10 What is the VC-dimension <strong>of</strong> right corners with axis aligned edges that<br />

are oriented with one edge going to the right and the other edge going up?<br />

Exercise 6.11 (VC-dimension; Section 6.9) What is the VC-dimension V <strong>of</strong> the<br />

class H <strong>of</strong> axis-parallel boxes in R d ? That is, H = {h a,b : a, b ∈ R d } where h a,b (x) = 1<br />

if a i ≤ x i ≤ b i for all i = 1, . . . , d and h a,b (x) = −1 otherwise.<br />

1. Prove that the VC-dimension is at least your chosen V by giving a set <strong>of</strong> V points<br />

that is shattered by the class (and explaining why it is shattered).<br />

2. Prove that the VC-dimension is at most your chosen V by proving that no set <strong>of</strong><br />

V + 1 points can be shattered.<br />

Exercise 6.12 (VC-dimension, Perceptron, and Margins; Sections 6.5.3, 6.9)<br />

Say that a set <strong>of</strong> points S is shattered by linear separators <strong>of</strong> margin γ if every labeling<br />

<strong>of</strong> the points in S is achievable by a linear separator <strong>of</strong> margin at least γ. Prove that no<br />

set <strong>of</strong> 1/γ 2 + 1 points in the unit ball is shattered by linear separators <strong>of</strong> margin γ.<br />

Hint: think about the Perceptron algorithm and try a pro<strong>of</strong> by contradiction.<br />

Exercise 6.13 (Linear separators) Suppose the instance space X is {0, 1} d and consider<br />

the target function c ∗ that labels an example x as positive if the least index i for<br />

which x i = 1 is odd, else labels x as negative. In other words, c ∗ (x) = “if x 1 = 1 then<br />

positive else if x 2 = 1 then negative else if x 3 = 1 then positive else ... else negative”.<br />

Show that the rule can be represented by a linear threshold function.<br />

Exercise 6.14 (Linear separators; harder) Prove that for the problem <strong>of</strong> Exercise<br />

6.13, we cannot have a linear separator with margin at least 1/f(d) where f(d) is bounded<br />

above by a polynomial function <strong>of</strong> d.<br />

Exercise 6.15 VC-dimension Prove that the VC-dimension <strong>of</strong> circles in the plane is<br />

three.<br />

Exercise 6.16 VC-dimension Show that the VC-dimension <strong>of</strong> arbitrary right triangles<br />

in the plane is seven.<br />

235

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!