21.06.2014 Views

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

62 Numerical results<br />

• C i,j = 0.8 |i−j| , i, j = 1, . . . , 800 in model E’.<br />

Models D and E respect <strong>the</strong> partial orthogonality condition between relevant and irrelevant<br />

parameters while D’ and E’ violate it.<br />

Estimation procedure<br />

For a sample (Y i , x i ) n i=1 <strong>of</strong> simulated observations, we proceed as follows to construct<br />

confidence intervals for <strong>the</strong> coefficinents:<br />

(i) Center and scale <strong>the</strong> observations, that is, consider<br />

Ỹ i = Y i − n −1<br />

⎛<br />

n ∑<br />

˜x ij = ⎝x ij − n −1<br />

Y l , i = 1, . . . , n<br />

l=1<br />

⎞<br />

∑ n (<br />

⎠ n −1<br />

n ∑<br />

(x lj −<br />

lj<br />

l=1 k=1<br />

)<br />

n∑ −1<br />

x kj ) 2 , j = 1, . . . , p n , i = 1, . . . , n.<br />

(ii) Set weights w j as<br />

w j = n −1<br />

n ∑<br />

i=1<br />

Ỹ i˜x ij , j = 1, . . . , p n .<br />

(iii) Compute <strong>the</strong> <strong>Lasso</strong> path for <strong>the</strong> data set with scaled covariates, that is, for<br />

(Ỹi, diag(w 1 , . . . , w pn )˜x i ) n i=1.<br />

Then choose <strong>the</strong> penalization parameter by K-fold cross validation (we choose K=10)<br />

based on (Ỹi, diag(w 1 , . . . , w pn )x i ) n i=1 . Denote it by λ n,CV . Let ˜β n be <strong>the</strong> <strong>Lasso</strong> solution<br />

corresponding to (Ỹi, diag(w 1 , . . . , w pn )x i ) n i=1 and to <strong>the</strong> penalization parameter<br />

λ n,CV . Finally, set <strong>the</strong> adaptive <strong>Lasso</strong> solution to<br />

ˆβ n = diag(w −1<br />

1 , . . . , w−1 p n<br />

) ˜β n .<br />

(iv) Repeat <strong>the</strong> folowing steps for m = 1, . . . , B:<br />

(a) Generate a random subsample I m ⊂ {1, . . . , n} <strong>of</strong> size b by drawing without<br />

replacement.<br />

(b) Center and scale observations with index i ∈ I m to obtain a data set (Ỹ (m)<br />

i , x (m)<br />

i ) i∈Im .<br />

(m)<br />

(c) Compute <strong>the</strong> <strong>Lasso</strong> solution path for <strong>the</strong> data set (Ỹi , diag(w1 , . . . , w pn )x (m)<br />

i<br />

with covariates scaled by weights obtained in step ii. Let ˜β (m)<br />

b be <strong>the</strong> <strong>Lasso</strong> solution<br />

corresponding to (Ỹi , diag(w1 , . . . , w pn )x (m)<br />

i ) i∈Im and to <strong>the</strong> rescaled<br />

(m)<br />

penalization parameter λ b,CV = λ n,CV (b/n) 0.4 .<br />

) i∈Im

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!