21.06.2014 Views

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

Subsampling estimates of the Lasso distribution.

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2.1 Convergence in probability 5<br />

for every x ∈ C, where f is a real (random) variable on C. Fur<strong>the</strong>r, assume that f is<br />

uniquely minimized at α ∈ C and let {α n } n be a sequence <strong>of</strong> minimizers <strong>of</strong> {f n } n , i.e.<br />

α n ∈ arg min<br />

x∈C<br />

f n (·), n ∈ N.<br />

Then,<br />

α n → P α.<br />

Pro<strong>of</strong>. The pro<strong>of</strong> rests on (Hjort and Pollard, 1993, Lemma 2) and subsequent remarks.<br />

For arbitrary δ > 0, define<br />

and<br />

∆ n (δ) =<br />

h(δ) =<br />

sup |f n (x) − f(x)|<br />

‖x−α‖≤δ<br />

inf f n(x) − f n (α)<br />

‖x−α‖=δ<br />

First, we show that<br />

P (‖α n − α‖ ≥ δ) ≤ P<br />

(∆ n (δ) ≥ 1 2 h(δ) )<br />

. (2.1.0.2)<br />

For every x ∈ R p with ‖x − α‖ > δ <strong>the</strong>re exist some c with ‖c − α‖ = δ and t ∈ [0, 1] such<br />

that c = tx + (1 − t)α. By convexity <strong>of</strong> <strong>the</strong> functions f n , we have<br />

f n (c) ≤ tf n (x) + (1 − t)f n (α).<br />

for every n ∈ N. Writing r n (x) = f(x) − f n (x), we obtain<br />

t(f n (x) − f n (α)) ≥ f n (c) − f n (α)<br />

= f(c) − f(α) + r n (c) − r n (α)<br />

≥ h(δ) − 2∆ n (δ)<br />

In particular,<br />

{∆ n (δ) < 1 }<br />

2 h(δ) ⊆ {f n (x) > f n (α)}<br />

for every x with ‖x − α‖ > δ and every n ∈ N. Since α minimizes f n , this implies that<br />

{‖α n − α‖} ⊆<br />

{∆ n (δ) ≥ 1 }<br />

2 h(δ)<br />

and 2.1.0.2 follows.<br />

Next, we show that<br />

∆ n (δ) = o P (1) (2.1.0.3)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!