Shared Gaussian Process Latent Variables Models - Oxford Brookes ...

More documents

Recommendations

Info

2.7. GAUSSIAN PROCESSES 41 where ǫ ∼ N(0, β −1 ). We are interested in encoding our prior knowledge about the relationship in a distribution over f. For regression we usually have a preference to functions varying smoothly over X, limxi→xj+ |f(xi) − f(xj)| = = limxi→−xj |f(xi) − f(xj)| = 0, ∀xj ∈ X. This assumption can be encoded by the GP through the choice of covariance function k(x,x ′ ). The covariance function encodes how we expect variables to vary together, k(x,x ′ ) = E ((f(x − µ(x)))(f(x ′ ) − µ(x ′ ))), this means that we can encode the smoothness behavior over X by choosing a covariance function which is smooth over the same domain. The mean function µ(x) = E(f(x)) encodes the expected value of f. By translating the observed data to be centered around zero the mean function can, for simplicity, be chosen as the constant function zero. 2.7.1 Prediction Having specified a prior distribution encoding our knowledge (and preference) about the relationship between X and Y we are interested in inferring the locations y∗ corresponding to a previously unobserved point x∗ ∈ X. The joint distribution of the observed data (y,x) and the unobserved point (y∗,x∗) can be
42 CHAPTER 2. BACKGROUND 2.5 2 1.5 1 0.5 −3 −2 −1 −0.5 1 2 3 −1 −1.5 −2 −2.5 Figure 2.5: Samples from GP Prior using a RBF covariance function and a constant zero mean function. As can be seen, each sample are smooth over the input domain. written as follows, ⎡ ⎢ ⎣ y y∗ ⎤ ⎛ ⎡ ⎤⎞ ⎥ ⎜ ⎢ ⎦ ∼ ⎝0, ⎣ K(X,X) + β−1I K(X,x∗) K(x∗,X) K(x∗,x∗) + β−1 ⎥⎟ ⎦⎠ . Predictions over the unobserved locations are made from the posterior distribution. The posterior is formulated by conditioning the joint distribution on the observed data. Conditioning two <strong>Gaussian</strong>s results in a <strong>Gaussian</strong> distribution, defined by mean and covariance, ¯y(x∗) = k(x∗,X)(K + β −1 I) −1 Y σ 2 (x∗) = (k(x∗,x∗) + β −1 ) − k(x∗,X)(K + β −1 ) −1 k(x∗,X), (2.47) where K = k(X,X). Those equations are the central predictive equations in the GP framework. In Figure 2.6 samples from the posterior distribution of a GP with an RBF covariance function and a constant zero mean function is shown. As can
Page 1 and 2: Shared Gaussian Process Latent Vari
Page 3 and 4: Acknowledgements 2
Page 5 and 6: 4 CONTENTS 2.7.2 Training . . . . .
Page 7 and 8: 6 CONTENTS 5.10 Summary . . . . . .
Page 9 and 10: 8 LIST OF FIGURES 3.11 Toy data3: l
Page 11 and 12: Chapter 1 Introduction Information
Page 13 and 14: 12 CHAPTER 1. INTRODUCTION current
Page 15 and 16: Chapter 2 Background 2.1 Introducti
Page 17 and 18: 16 CHAPTER 2. BACKGROUND 1 0.9 0.8
Page 19 and 20: 18 CHAPTER 2. BACKGROUND X to its o
Page 21 and 22: 20 CHAPTER 2. BACKGROUND dimensions
Page 23 and 24: 22 CHAPTER 2. BACKGROUND approximat
Page 25 and 26: 24 CHAPTER 2. BACKGROUND diagonal c
Page 27 and 28: 26 CHAPTER 2. BACKGROUND unraveled.
Page 29 and 30: 28 CHAPTER 2. BACKGROUND apply an a
Page 31 and 32: 30 CHAPTER 2. BACKGROUND be connect
Page 33 and 34: 32 CHAPTER 2. BACKGROUND Local Line
Page 35 and 36: 34 CHAPTER 2. BACKGROUND 15 10 5 0
Page 37 and 38: 36 CHAPTER 2. BACKGROUND by W, Figu
Page 39 and 40: 38 CHAPTER 2. BACKGROUND Figure 2.4
Page 41: 40 CHAPTER 2. BACKGROUND X ∈ ℜ
Page 45 and 46: 44 CHAPTER 2. BACKGROUND rupted by
Page 47 and 48: 46 CHAPTER 2. BACKGROUND where ˜ C
Page 49 and 50: 48 CHAPTER 2. BACKGROUND prior over
Page 51 and 52: 50 CHAPTER 2. BACKGROUND generative
Page 53 and 54: 52 CHAPTER 2. BACKGROUND the latent
Page 55 and 56: 54 CHAPTER 2. BACKGROUND 2.9 Shared
Page 57 and 58: 56 CHAPTER 2. BACKGROUND Y X Figure
Page 59 and 60: 58 CHAPTER 2. BACKGROUND Gaussian p
Page 61 and 62: 60 CHAPTER 3. SHARED GP-LVM sponden
Page 63 and 64: 62 CHAPTER 3. SHARED GP-LVM makes m
Page 65 and 66: 64 CHAPTER 3. SHARED GP-LVM where t
Page 67 and 68: 66 CHAPTER 3. SHARED GP-LVM 4 3 2 1
Page 69 and 70: 68 CHAPTER 3. SHARED GP-LVM 1 0.8 0
Page 71 and 72: 70 CHAPTER 3. SHARED GP-LVM Example
Page 73 and 74: 72 CHAPTER 3. SHARED GP-LVM 0.3 0.2
Page 75 and 76: 74 CHAPTER 3. SHARED GP-LVM φ Y X
Page 77 and 78: 76 CHAPTER 3. SHARED GP-LVM leading
Page 79 and 80: 78 CHAPTER 3. SHARED GP-LVM servati
Page 81 and 82: Chapter 4 NCCA 4.1 Introduction In
Page 83 and 84: 82 CHAPTER 4. NCCA as u Y i = x S
Page 85 and 86: 84 CHAPTER 4. NCCA X Y Z X X S Y Z
Page 87 and 88: 86 CHAPTER 4. NCCA By pre-multiplyi
Page 89 and 90: 88 CHAPTER 4. NCCA 1 0.8 0.6 0.4 0.
Page 91 and 92: 90 CHAPTER 4. NCCA 4.5 Extensions W
Page 93 and 94:
Chapter 5 Applications 5.1 Introduc
Page 95 and 96:
94 CHAPTER 5. APPLICATIONS as optic
Page 97 and 98:
96 CHAPTER 5. APPLICATIONS of an im
Page 99 and 100:
98 CHAPTER 5. APPLICATIONS histogra
Page 101 and 102:
100 CHAPTER 5. APPLICATIONS the dat
Page 103 and 104:
102 CHAPTER 5. APPLICATIONS Figure
Page 105 and 106:
104 CHAPTER 5. APPLICATIONS the GP
Page 107 and 108:
106 CHAPTER 5. APPLICATIONS varianc
Page 109 and 110:
108 CHAPTER 5. APPLICATIONS locatio
Page 111 and 112:
Page 113 and 114:
112 CHAPTER 5. APPLICATIONS Error (
Page 115 and 116:
Page 117 and 118:
116 CHAPTER 5. APPLICATIONS over th
Page 119 and 120:
118 CHAPTER 5. APPLICATIONS age rep
Page 121 and 122:
120 CHAPTER 6. CONCLUSIONS vated an
Page 123 and 124:
122 CHAPTER 6. CONCLUSIONS a shared
Page 125 and 126:
124 BIBLIOGRAPHY [7] S. Belongie, J
Page 127 and 128:
126 BIBLIOGRAPHY [24] K. Grochow, S
Page 129 and 130:
128 BIBLIOGRAPHY [39] D. MacKay. Ba
Page 131 and 132:
130 BIBLIOGRAPHY [56] H. A. Simon.
Page 133 and 134:
132 BIBLIOGRAPHY [74] S. Wachter an
show all

Shared Gaussian Process Latent Variables Models - Oxford Brookes ...

Create successful ePaper yourself

Delete template?

Save as template?