Relax and Randomize: From Value to Algorithms

More documents

Recommendations

Info

A PROOFS Proof of Proposition 1. By definition, T ∑ t=1 T E ft∼q t l(f t , x t ) − inf ∑ l(f, x t ) ≤ ∑ E ft∼q t l(f t , x t ) + Rel T (F∣x 1 , . . . , x T ) . f∈F t=1 Peeling off the T -th expected loss, we have T ∑ t=1 T t=1 T −1 E ft∼q t l(f t , x t ) + Rel T (F∣x 1 , . . . , x T ) ≤ ∑ t=1 T −1 ≤ ∑ t=1 E ft∼q t l(f t , x t ) + {E ft∼q t l(f t , x t ) + Rel T (F∣x 1 , . . . , x T )} E ft∼q t l(f t , x t ) + Rel T (F∣x 1 , . . . , x T −1 ) where we used the fact that q T is an admissible algorithm for this relaxation, and thus the last inequality holds for any choice x T of the opponent. Repeating the process, we obtain T ∑ t=1 T E ft∼q t l(f t , x t ) − inf ∑ l(f, x t ) ≤ Rel T (F) . f∈F t=1 We remark that the left-hand side of this inequality is random, while the right-hand side is not. Since the inequality holds for any realization of the process, it also holds in expectation. The inequality V T (F) ≤ Rel T (F) holds by unwinding the value recursively and using admissibility of the relaxation. The highprobability bound is an immediate consequences of (6) and the Hoeffding-Azuma inequality for bounded martingales. The last statement is immediate. Proof of Proposition 2. Denote L t (f) = ∑ t s=1 l(f, x s ). The first step of the proof is an application of the minimax theorem (we assume the necessary conditions hold): inf q t∈∆(F) x t∈X = sup sup { E [l(f t , x t )] + sup f t∼q t x p t∈∆(X ) f t∈F E ɛt+1∶T sup f∈F inf { E [l(f t , x t )] + E sup x t∼p t x t∼p t x [2 T ∑ s=t+1 E ɛt+1∶T sup f∈F ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t (f)]} [2 T ∑ s=t+1 For any p t ∈ ∆(X ), the infimum over f t of the above expression is equal to E sup E ɛt+1∶T sup [2 x t∼p t x f∈F ≤ E sup E ɛt+1∶T sup [2 x t∼p t x f∈F T ∑ s=t+1 T ∑ s=t+1 ≤ E sup E ɛt+1∶T sup [2 x t,x ′ t ∼pt x f∈F ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t−1 (f) + inf ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t (f)]} f t∈F E [l(f t , x t )] − l(f, x t )] x t∼p t ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t−1 (f) + E [l(f, x t )] − l(f, x t )] x t∼p t T ∑ s=t+1 ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t−1 (f) + l(f, x ′ t) − l(f, x t )] We now argue that the independent x t and x ′ t have the same distribution p t , and thus we can introduce a random sign ɛ t . The above expression then equals to E E x t,x ′ t ∼pt ɛ t sup x E ɛt+1∶T sup f∈F [2 ≤ sup E sup E ɛt+1∶T sup [2 x t,x ′ t ∈X ɛ t x f∈F T ∑ s=t+1 T ∑ s=t+1 ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t−1 (f) + ɛ t (l(f, x ′ t) − l(f, x t ))] ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − L t−1 (f) + ɛ t (l(f, x ′ t) − l(f, x t ))] where we upper bounded the expectation by the supremum. Splitting the resulting expression into two parts, we arrive at the upper bound of 2 sup E x t∈X sup ɛ t x E ɛt+1∶T sup f∈F [ T ∑ s=t+1 ɛ s l(f, x s−t (ɛ t+1∶s−1 )) − 1 2 L t−1(f) + ɛ t l(f, x t )] = R T (F∣x 1 , . . . , x t−1 ) . 10
The last equality is easy to verify, as we are effectively adding a root x t to the two subtrees, for ɛ t = +1 and ɛ t = −1, respectively. One can see that the proof of admissibility corresponds to one step minimax swap and symmetrization in the proof of [14]. In contrast, in the latter paper, all T minimax swaps are performed at once, followed by T symmetrization steps. Proof of Proposition 3. Let us first prove that the relaxation is admissible with the Exponential Weights algorithm as an admissible algorithm. Let L t (f) = ∑ t i=1 l(f, x i ). Let λ ∗ be the optimal value in the definition of Rel T (F∣x 1 , . . . , x t−1 ). Then inf sup q t∈∆(F) x t∈X ≤ inf q t∈∆(F) x t∈X { E f∼qt [l(f, x t )] + Rel T (F∣x 1 , . . . , x t )} ⎧⎪ sup ⎨ E [l(f, x t )] + 1 f∼q t λ ⎪⎩ log ⎛ ∗ ⎝ ∑ f∈F exp (−λ ∗ L t (f)) ⎞ ⎫⎪ ⎠ + 2λ∗ (T − t) ⎬ ⎪⎭ Let us upper bound the infimum by a particular choice of q which is the exponential weights distribution q t (f) = exp(−λ ∗ L t−1 (f))/Z t−1 where Z t−1 = ∑ f∈F exp (−λ ∗ L t−1 (f)). By [6, Lemma A.1], 1 λ log ⎛ ∗ ⎝ ∑ exp (−λ ∗ L t (f)) ⎞ f∈F ⎠ = 1 λ log (E ∗ f∼q t exp (−λ ∗ l(f, x t ))) + 1 λ log Z ∗ t−1 Hence, ≤ −E f∼qt l(f, x t ) + λ∗ 2 + 1 λ ∗ log Z t−1 inf sup q t∈∆(F) x t∈X { E f∼qt [l(f, x t )] + Rel T (F∣x 1 , . . . , x t )} ≤ 1 λ ∗ log ⎛ ⎝ ∑ f∈F exp (−λ ∗ L t−1 (f)) ⎞ ⎠ + 2λ∗ (T − t + 1) = Rel T (F∣x 1 , . . . , x t−1 ) by the optimality of λ ∗ . The bound can be improved by a factor of 2 for some loss functions, since it will disappear from the definition of sequential Rademacher complexity. We conclude that the Exponential Weights algorithm is an admissible strategy for the relaxation (9). The final regret bound follows immediately from the bound on sequential Rademacher complexity (which, in this case, is simply the supremum of a martingale difference process indexed by N elements – see e.g. [14]). Arriving at the relaxation We now show that the Exponential Weights relaxation arises naturally as an upper bound on sequential Rademacher complexity of a finite class. For any λ > 0, E ɛ T −t [sup {2 f∈F ∑ i=1 ɛ i l(f, x i (ɛ)) − L t (f)}] ≤ 1 λ log (E ɛ ≤ 1 λ log ⎛ ⎡ ⎝ E ∑ ɛ ⎢ ⎣f∈F T −t [sup exp (2λ f∈F ∑ i=1 T −t exp (2λ ∑ i=1 = 1 λ log ⎛ ⎝ ∑ f∈F exp (−λL t (f)) E ɛ ɛ i l(f, x i (ɛ)) − λL t (f))]) ⎤ ɛ i l(f, x i (ɛ)) − λL t (f)) ⎞ ⎥⎠ ⎦ T −t [ ∏ i=1 exp (2λɛ i l(f, x i (ɛ)))] ⎞ ⎠ Since, conditioned on ɛ 1 , . . . , ɛ i−1 , the random variable ɛ i l(f, x i (ɛ)) is subgaussian, we can upper bound the expected value of the product, peeling one random variable at a time from the end (see 11
Page 1 and 2: Relax and Randomize: From Value to
Page 3 and 4: play f t ∼ q t and receive x t fr
Page 5 and 6: Proposition 5. The relaxation Rel T
Page 7 and 8: As in the previous example the upda
Page 9: References [1] J. Abernethy, A. Aga
Page 13 and 14: where δ = ⟨∇ 1 2 ∥˜xt−1
Page 15 and 16: where we have dropped the λ (T −
Page 17 and 18: Last inequality is by Assumption 1,
Page 19 and 20: Proof of Lemma 8. On any round t, t
Page 21 and 22: Pick any direction w ⊥ perpendicu
Page 23 and 24: Thus we see that the relaxation is
Page 25: The above step used the fact that t

Relax and Randomize: From Value to Algorithms

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?