Relax and Randomize: From Value to Algorithms

More documents

Recommendations

Info

However this argmin calculation is identical to the one in the proof of Proposition 4 (with C = 1 and T − t = 0) and the solution is given by Thus we conclude the proof. f ∗ t = f t = − ∑ t−1 i=1 x i + 4 √ 2 ∑ T i=t+1 x i √ ∥− ∑ t−1 i=1 x i + 4 √ 2 ∑ T i=t+1 ɛ i x i ∥ 2 + 1 2 Proof of Lemma 11. We shall start by showing that the relaxation is admissible for the game where we pick prediction ŷ t and the adversary then directly picks the gradient ∂l(ŷ t , y t ). To this end note that inf ŷ t sup ∂l(ŷ t,y t) {∂l(ŷ t , y t ) ⋅ ŷ t + Rel T (F∣∂l(ŷ 1 , y 1 ), . . . , ∂l(ŷ t , y t ))} = inf ŷ t ≤ inf ŷ t sup ∂l(ŷ t,y t) sup r t∈[−L,L] {∂l(ŷ t , y t ) ⋅ ŷ t + E ɛ {r t ⋅ ŷ t + E ɛ [sup f∈F 2L [sup f∈F T ∑ i=t+1 2L T ∑ i=t+1 t ɛ i f[t] − ∑ ∂l(ŷ i , y i ) ⋅ f[i]]} i=1 ɛ i f[t] − L t−1 (f) − r t ⋅ f[t]]} Let us use the notation L t−1 (f) = ∑ t−1 i=1 ∂l(ŷ i , y i ) ⋅ f[i] for the present proof. The supremum over r t ∈ [−L, L] is achieved at the endpoints since the expression is convex in r t . Therefore, the last expression is equal to inf ŷ t = inf ŷ t sup r t∈{−L,L} sup p t∈∆({−L,L}) = sup inf p t∈∆({−L,L}) ŷ t {r t ⋅ ŷ t + E ɛ sup f∈F [2L T ∑ i=t+1 E [r t ⋅ ŷ t + E ɛ sup [2L r t∼p t f∈F E [r t ⋅ ŷ t + E ɛ sup [2L r t∼p t f∈F ɛ i f[t] − L t−1 (f) − r t ⋅ f[t]]} T ∑ i=t+1 T ∑ i=t+1 ɛ i f[t] − L t−1 (f) − r t ⋅ f[t]]] ɛ i f[t] − L t−1 (f) − r t ⋅ f[t]]] where the last step is due to the minimax theorem. The last quantity is equal to sup p t∈∆({−L,L}) ≤ ≤ sup p t∈∆({−L,L}) sup p t∈∆({−L,L}) = sup p t∈∆({−L,L}) E [ E ɛ [inf E [r t ] ⋅ ŷ t + sup (2L r t∼p t r t∼p t ŷ t E [ E [sup (2L ɛ r t∼p t f∈F E [E ɛ sup [2L r t,r t ′ ∼pt f∈F E [E ɛ sup [2L r t,r t ′ ∼pt f∈F T ∑ i=t+1 T ∑ i=t+1 T ∑ i=t+1 f∈F T ∑ i=t+1 ɛ i f[t] − L t−1 (f) − r t ⋅ f[t])]] ɛ i f[t] − L t−1 (f) + ( E r t∼p t [r t ] − r t ) ⋅ f[t])]] ɛ i f[t] − L t−1 (f) + (r ′ t − r t ) ⋅ f[t]]] ɛ i f[t] − L t−1 (f) + ɛ t (r ′ t − r t ) ⋅ f[t]]] By passing to the worst-case choice of r t , r ′ t (which is achieved at the endpoints because of convexity), we obtain a further upper bound sup E ɛ sup [2L r t,r t ′ ∈{L,−L} f∈F ≤ sup r t∈{L,−L} E ɛ sup f∈F [2L T ∑ i=t+1 T ∑ i=t+1 ɛ i f[t] − L t−1 (f) + ɛ t (r ′ t − r t ) ⋅ f[t]] ɛ i f[t] − L t−1 (f) + 2ɛ t r t ⋅ f[t]] = sup E ɛ sup [2L ∑ ɛ i f[t] − L t−1 (f)] r t∈{L,−L} f∈F T i=t = Rel T (F∣∂l(ŷ 1 , y 1 ), . . . , ∂l(ŷ t−1 , y t−1 )) 22
Thus we see that the relaxation is admissible. Now the corresponding prediction is given by ŷ t = argmin ŷ = argmin ŷ = argmin ŷ sup r t∈[−L,L] sup r t∈[−L,L] sup r t∈{−L,L} {r t ŷ + E ɛ {r t ŷ + E ɛ {r t ŷ + E ɛ [sup f∈F [sup f∈F [sup f∈F {2L {2L {2L T ∑ i=t+1 T ∑ i=t+1 T ∑ i=t+1 t−1 ɛ i f[i] − ∑ i=1 t−1 ɛ i f[i] − ∑ i=1 t−1 ɛ i f[i] − ∑ i=1 ∂l(ŷ i , y i )f[i] − r t f[t]}]} ∂l(ŷ i , y i )f[i] − r t f[t]}]} ∂l(ŷ i , y i )f[i] − r t f[t]}]} The last step holds because of convexity of the term inside the supremum over r t is convex in r t and so the suprema is attained at the endpoints of the interval. The ŷ t above is attained when both terms of the supremum are equalized, that is for ŷ t is the prediction that satisfies : ŷ t = E ɛ [sup f∈F { T ∑ i=t+1 ɛ i f[i] − 1 t−1 2L ∑ i=1 ∂l(ŷ i , y i )f[i] + 1 2 f[t]} − sup f∈F { T ∑ i=t+1 ɛ i f[i] − 1 t−1 2L Finally since the relaxation is admissible we can conclude that the regret of the algorithm is bounded as This concludes the proof. Reg T ≤ Rel T (F) = 2 L E ɛ T [sup ∑ ɛ t f[t]] . f∈F t=1 Proof of Lemma 12. The proof is similar to that of Lemma 11, with a few more twists. We want to establish admissibility of the relaxation given in (21) w.r.t. the randomized strategy q t we provided. To this end note that sup { E y t = sup {E y t ɛ ≤ E ɛ [l(ŷ t , y t )] + E ŷ t∼q t ɛ [sup y t [l(ŷ t (ɛ), y t )] + E ɛ [sup f∈F {l(ŷ t (ɛ), y t ) + sup f∈F {2L [sup f∈F {2L T ∑ i=t+1 {2L T ∑ i=t+1 T ∑ i=t+1 ɛ i f[i] − L t (f)}]} ɛ i f[i] − L t (f)}]} ɛ i f[i] − L t (f)}}] by Jensen’s inequality, with the usual notation L t (f) = ∑ t i=1 l(f[i], y i ). Further, by convexity of the loss, we may pass to the upper bound E ɛ [sup y t {∂l(ŷ t (ɛ), y t )ŷ t (ɛ) + sup f∈F ≤ E [sup {E [r t ⋅ ŷ t (ɛ)] + sup {2L ɛ y t r t f∈F {2L T ∑ i=t+1 T ∑ i=t+1 ∑ i=1 ɛ i f[i] − L t−1 (f) − ∂l(ŷ t (ɛ), y t )f[t]}}] ɛ i f[i] − L t−1 (f) − E r t [r t ⋅ f[t]]}}] where r t is a {±L}-valued random variable with the mean ∂l(ŷ t (ɛ), y t ). With the help of Jensen’s inequality, and passing to the worst-case r t (observe that this is legal for any given ɛ), we have an upper bound E ɛ ⎡ ⎧ ⎪⎨⎪⎩ ⎢ sup E y ⎣ t ⎡ sup ⎢ ⎣r t∈{±L} ≤ E ɛ r t∼∂l(ŷ t(ɛ),y t) T ⎤ ⎫⎪ [r t ⋅ ŷ t (ɛ)] + E [sup {2L ∑ ɛ i f[i] − L t−1 (f) − r t ⋅ f[t]}] ⎬ r t∼∂l(ŷ t(ɛ),y t) f∈F i=t+1 ⎪⎭ ⎥ ⎦ T ⎤ {2L ∑ ɛ i f[i] − L t−1 (f) − r t ⋅ f[t]}} (44) ⎥ i=t+1 ⎦ {r t ⋅ ŷ t (ɛ) + sup f∈F Now the strategy we defined is ŷ t (ɛ) = argmin ŷ t sup r t∈{±L} {r t ⋅ ŷ t (ɛ) + sup f∈F {2L 23 T ∑ i=t+1 t−1 ɛ i f[i] − ∑ i=1 l(f[i], y i ) − r t ⋅ f[t]}} ∂l(ŷ i , y i )f[i] − 1 2 f[t]}]
Page 1 and 2: Relax and Randomize: From Value to
Page 3 and 4: play f t ∼ q t and receive x t fr
Page 5 and 6: Proposition 5. The relaxation Rel T
Page 7 and 8: As in the previous example the upda
Page 9 and 10: References [1] J. Abernethy, A. Aga
Page 11 and 12: The last equality is easy to verify
Page 13 and 14: where δ = ⟨∇ 1 2 ∥˜xt−1
Page 15 and 16: where we have dropped the λ (T −
Page 17 and 18: Last inequality is by Assumption 1,
Page 19 and 20: Proof of Lemma 8. On any round t, t
Page 21: Pick any direction w ⊥ perpendicu
Page 25: The above step used the fact that t

Relax and Randomize: From Value to Algorithms

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?