TheoryofDeepLearning.2022

Recommendations

Info

No information found
1Basic Setup and some math notionsThis Chapter introduces the basic nomenclature. Training/test error,generalization error etc. ≪Tengyu notes: Todos: Illustrate with plots: a typical trainingcurve and test curveMention some popular architectures (feed forward, convolutional, pooling, resnet, densenet) ina brief para each. ≫We review the basic notions in statistical learning theory.• A space of possible data points X .• A space of possible labels Y.• A joint probability distribution D on X × Y. We assume that ourtraining data consist of n data points(x (1) , y (1) ), . . . , (x (n) , y (n) ) i.i.d.∼ D ,each drawn independently from D.• Hypothesis space: H is a family of hypotheses, or a family ofpredictors. E.g., H could be the set of all neural networks witha fixed architecture: H = {h θ } where h θ is neural net that isparameterized by parameters θ.• Loss function: l : (X × Y) × H → R.– E.g., in binary classification where Y = {−1, +1}, and supposewe have a hypothesis h θ (x), then the logistic loss function forthe hypothesis h θ on data point (x, y) is• Expected loss:l((x, y), θ) =L(h) =11 + exp(−yh θ (x)) .E [l((x, y), h)] .(x,y)∼DRecall D is the data distribution over X × Y.
Page 1: C O N T R I B U T O R S : R A M A N
Page 4 and 5: 44 Basics of generalization theory
Page 6 and 7: 612 Representation Learning 11113 E
Page 8 and 9: 810.2 Autoencoder defined using a d
Page 11: IntroductionThis monograph discusse
Page 15: basic setup and some math notions 1
Page 18 and 19: 18 theory of deep learningSuppose w
Page 20 and 21: 20 theory of deep learningHere β(w
Page 22 and 23: 22 theory of deep learningComputing
Page 24 and 25: 24 theory of deep learningvisualize
Page 26 and 27: 26 theory of deep learning3.1.2 Nai
Page 28 and 29: 28 theory of deep learningExtension
Page 30 and 31: 30 theory of deep learningThe proof
Page 32 and 33: 32 theory of deep learningThe notio
Page 34 and 35: 34 theory of deep learningiteration
Page 36 and 37: 36 theory of deep learningare desce
Page 39: 5Advanced Optimization notionsThis
Page 42 and 43: 42 theory of deep learningdescent c
Page 44 and 45: 44 theory of deep learningapproxima
Page 46 and 47: 46 theory of deep learningarg min w
Page 48 and 49: 48 theory of deep learningdirection
Page 50 and 51: 50 theory of deep learningcoordinat
Page 52 and 53: 52 theory of deep learningBy applyi
Page 54 and 55: 54 theory of deep learningConstrain
Page 57 and 58: 7Tractable Landscapes for Nonconvex
Page 59 and 60: tractable landscapes for nonconvex
Page 61 and 62: tractable landscapes for nonconvex
Page 63 and 64:
tractable landscapes for nonconvex
Page 65 and 66:
Page 67 and 68:
Page 69 and 70:
8Ultra-wide Neural Networks and Neu
Page 71 and 72:
ultra-wide neural networks and neur
Page 73 and 74:
Page 75 and 76:
Page 77 and 78:
Page 79:
Page 82 and 83:
82 theory of deep learningsights fr
Page 84 and 85:
84 theory of deep learningProvided
Page 86 and 87:
86 theory of deep learningConsequen
Page 88 and 89:
88 theory of deep learninga rotatio
Page 90 and 91:
90 theory of deep learningWe focus
Page 92 and 93:
92 theory of deep learningPropositi
Page 94 and 95:
94 theory of deep learningso that M
Page 96 and 97:
96 theory of deep learningLet δ =
Page 98 and 99:
98 theory of deep learningrotations
Page 100 and 101:
100 theory of deep learningU(t) = W
Page 102 and 103:
102 theory of deep learningsuch sad
Page 104 and 105:
104 theory of deep learningFigure 1
Page 106 and 107:
106 theory of deep learningcorrespo
Page 108 and 109:
108 theory of deep learningconstrai
Page 110 and 111:
110 theory of deep learningdomness
Page 112 and 113:
112 theory of deep learningModel G
Page 115 and 116:
13Examples of Theorems, Proofs, Alg
Page 117 and 118:
examples of theorems, proofs, algor
show all

TheoryofDeepLearning.2022

Create successful ePaper yourself

Delete template?

Save as template?