Subsampling estimates of the Lasso distribution.
Subsampling estimates of the Lasso distribution.
Subsampling estimates of the Lasso distribution.
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
2.1 Convergence in probability 5<br />
for every x ∈ C, where f is a real (random) variable on C. Fur<strong>the</strong>r, assume that f is<br />
uniquely minimized at α ∈ C and let {α n } n be a sequence <strong>of</strong> minimizers <strong>of</strong> {f n } n , i.e.<br />
α n ∈ arg min<br />
x∈C<br />
f n (·), n ∈ N.<br />
Then,<br />
α n → P α.<br />
Pro<strong>of</strong>. The pro<strong>of</strong> rests on (Hjort and Pollard, 1993, Lemma 2) and subsequent remarks.<br />
For arbitrary δ > 0, define<br />
and<br />
∆ n (δ) =<br />
h(δ) =<br />
sup |f n (x) − f(x)|<br />
‖x−α‖≤δ<br />
inf f n(x) − f n (α)<br />
‖x−α‖=δ<br />
First, we show that<br />
P (‖α n − α‖ ≥ δ) ≤ P<br />
(∆ n (δ) ≥ 1 2 h(δ) )<br />
. (2.1.0.2)<br />
For every x ∈ R p with ‖x − α‖ > δ <strong>the</strong>re exist some c with ‖c − α‖ = δ and t ∈ [0, 1] such<br />
that c = tx + (1 − t)α. By convexity <strong>of</strong> <strong>the</strong> functions f n , we have<br />
f n (c) ≤ tf n (x) + (1 − t)f n (α).<br />
for every n ∈ N. Writing r n (x) = f(x) − f n (x), we obtain<br />
t(f n (x) − f n (α)) ≥ f n (c) − f n (α)<br />
= f(c) − f(α) + r n (c) − r n (α)<br />
≥ h(δ) − 2∆ n (δ)<br />
In particular,<br />
{∆ n (δ) < 1 }<br />
2 h(δ) ⊆ {f n (x) > f n (α)}<br />
for every x with ‖x − α‖ > δ and every n ∈ N. Since α minimizes f n , this implies that<br />
{‖α n − α‖} ⊆<br />
{∆ n (δ) ≥ 1 }<br />
2 h(δ)<br />
and 2.1.0.2 follows.<br />
Next, we show that<br />
∆ n (δ) = o P (1) (2.1.0.3)