21.06.2014 Views

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

6.2 High dimensional setting 63<br />

(d) Set <strong>the</strong> adaptive <strong>Lasso</strong> solution to<br />

ˆβ (m)<br />

b<br />

and <strong>the</strong>n set L n,b,m and L n,r,m to<br />

= diag(w1 −1 , . . . , w−1 p n<br />

) ˜β (m)<br />

b<br />

L n,b,m = √ ( )<br />

b ˆβ (m)<br />

b − ˆβ n<br />

and<br />

L n,r,m = √ ( )<br />

r ˆβ (m)<br />

b − ˆβ n<br />

respectively. Here, r = b/(1 − b/n) is <strong>the</strong> finite sample corrected subsample size,<br />

cf. (Politis et al., 1999, Section 10.3.1).<br />

(v) Determine separately for each j ∈ {1, . . . , p n } <strong>the</strong> following empirical quantiles <strong>of</strong><br />

L (j)<br />

n,b,· and L(j) n,r,·, that is, L (j)<br />

n,b,(·)<br />

and L(j)<br />

n,r,(·)<br />

being <strong>the</strong> ordered statistics, set<br />

c (j)<br />

n,b<br />

(1 − α) = L(j)<br />

n,b,(⌊(1−α)·B⌋) ,<br />

c (j)<br />

n,b<br />

(α/2) = L(j)<br />

n,b,(⌊α/2·B⌋) ,<br />

and <strong>the</strong> analogous for L (j)<br />

n,r,(·) .<br />

c (j)<br />

n,b<br />

(1 − α/2) = L(j)<br />

n,b,(⌊(1−α/2)·B⌋) .<br />

(vi) For each j ∈ {1, . . . , p}, define <strong>the</strong> confidence intervals<br />

[<br />

I (j)<br />

1 =<br />

[<br />

I (j)<br />

2 =<br />

[<br />

I (j)<br />

3 =<br />

n − √ 1 c (j)<br />

n<br />

β (j)<br />

β (j)<br />

)<br />

n,r(1 − α), ∞<br />

n − √ 1 c (j)<br />

n<br />

n,r(1 − α/2), β n (j) − √ 1 c (j)<br />

n<br />

β (j)<br />

n<br />

−<br />

[<br />

I (j)<br />

4 = β n<br />

(j) −<br />

1<br />

√ n −<br />

√<br />

b<br />

c (j)<br />

n,b (1 − α), ∞ )<br />

1<br />

√ √ c (j) (1 − α/2), β(j) −<br />

n − b<br />

n,b<br />

n<br />

]<br />

n,r(α/2)<br />

1<br />

√ n −<br />

√<br />

b<br />

c (j)<br />

n,b (α/2) ]<br />

Remark. Note that <strong>the</strong> procedure above does not follow <strong>the</strong> subsampling scheme in <strong>the</strong><br />

strict sense since <strong>the</strong> adaptive <strong>Lasso</strong> weights are not recomputed over subsamples, however<br />

we could note that using weights obtained on <strong>the</strong> whole sample to compute subsamples<br />

<strong>estimates</strong> actually yields better results.<br />

Coverage rates <strong>of</strong> <strong>the</strong> two sided confidence interval I (·)<br />

2 are illustrated in Figure 6.4. First,<br />

we note that <strong>the</strong> results are quite robust to violation <strong>of</strong> <strong>the</strong> partial orthogonality condition.<br />

Then we see that <strong>the</strong> coverage rate for relevant coefficients are slightly below <strong>the</strong> nominal<br />

level and that <strong>the</strong> false positive rates for zero coefficients are conservative. A possible<br />

reason for <strong>the</strong> former is <strong>the</strong> bias introduced by <strong>the</strong> <strong>Lasso</strong>. The conservative false rejection

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!