Shared Gaussian Process Latent Variables Models - Oxford Brookes ...

More documents

Recommendations

Info

2.8. GP-LVM 47 forcing the model to explain the data using few kernel functions leading to a sparse model. As noted in [64, 45, 14] the RVM is a special case of a GP with covariance function, k(xi,xj) = N l=1 1 c(xi,xk)c(xj,xl), (2.55) αl where c is the kernel basis function as in Eq. 2.51. The covariance function is different in form as it depends on the training data xl. Further, it correspond to a degenerate covariance matrix having at most rank N as it is an expansion around the training data. Training the RVM is the same as optimizing a GP regression model i.e. finding the hyper-parameters that maximizes the marginal likelihood of the model. However, as noted in [45] the covariance function of the RVM has some undesirable effects. Using a standard RBF kernel for the GP the predictive variance associated with a point far away from the training data will be large, i.e. the model will be uncertain in regions where it has not previously seen data. Rather the opposite is true using the covariance function specified by the RVM as a both terms in the predictive variance Eq. 2.47 will be close to zero while for a standard RBF kernel the first term will be large. 2.8 GP-LVM Lawrence [33] suggested an alternative Gaussian latent variable model capable of handling non-linear generative mappings while at the same time avoiding the problems associated with the GTM. Both the PPCA and the GTM specifies a
48 CHAPTER 2. BACKGROUND prior over the latent locations and seek a maximum likelihood solution for the parameters of the generating mapping. However, from a Bayesian perspective both the mapping and the latent locations are nuisance parameters and should therefore be marginalized. In Lawrence’s formulation, the prior is specified over the mapping instead of the latent locations and the marginal likelihood over the mapping formulated. Using the GP-framework a rich and flexible prior over non- linear mappings can be specified. The algorithm is referred to as the Gaussian process Latent Variable Model (GP-LVM). By marginalizing over the mapping f the GP-LVM proceeds by seeking the maximum likelihood solution to the latent locations X and the hyper-parameters Φ of the GP, { ˆ X, ˆ Φ} = argmax {X,Φ} p(Y|X,Φ) = argmax {X,Φ} p(Y|X, f,Φ)p(f)df, (2.56) where p(f) = GP(µ(x), k(x,x ′ )). The posterior distribution of the data can be written as, p(X,Φ|Y) ∝ p(Y|X,Φ)p(X)p(Φ). (2.57) In the standard GP-LVM formulation uninformative priors [33] are specified over the latent locations and the hyper-parameters. Learning in the GP-LVM framework consists of minimizing the log posterior of the data with respect to the locations of the latent variables X and the hyper-parameter θ of the process. With a simple spherical Gaussian prior over the latent locations and an uninformative
Page 1 and 2: Shared Gaussian Process Latent Vari
Page 3 and 4: Acknowledgements 2
Page 5 and 6: 4 CONTENTS 2.7.2 Training . . . . .
Page 7 and 8: 6 CONTENTS 5.10 Summary . . . . . .
Page 9 and 10: 8 LIST OF FIGURES 3.11 Toy data3: l
Page 11 and 12: Chapter 1 Introduction Information
Page 13 and 14: 12 CHAPTER 1. INTRODUCTION current
Page 15 and 16: Chapter 2 Background 2.1 Introducti
Page 17 and 18: 16 CHAPTER 2. BACKGROUND 1 0.9 0.8
Page 19 and 20: 18 CHAPTER 2. BACKGROUND X to its o
Page 21 and 22: 20 CHAPTER 2. BACKGROUND dimensions
Page 23 and 24: 22 CHAPTER 2. BACKGROUND approximat
Page 25 and 26: 24 CHAPTER 2. BACKGROUND diagonal c
Page 27 and 28: 26 CHAPTER 2. BACKGROUND unraveled.
Page 29 and 30: 28 CHAPTER 2. BACKGROUND apply an a
Page 31 and 32: 30 CHAPTER 2. BACKGROUND be connect
Page 33 and 34: 32 CHAPTER 2. BACKGROUND Local Line
Page 35 and 36: 34 CHAPTER 2. BACKGROUND 15 10 5 0
Page 37 and 38: 36 CHAPTER 2. BACKGROUND by W, Figu
Page 39 and 40: 38 CHAPTER 2. BACKGROUND Figure 2.4
Page 41 and 42: 40 CHAPTER 2. BACKGROUND X ∈ ℜ
Page 43 and 44: 42 CHAPTER 2. BACKGROUND 2.5 2 1.5
Page 45 and 46: 44 CHAPTER 2. BACKGROUND rupted by
Page 47: 46 CHAPTER 2. BACKGROUND where ˜ C
Page 51 and 52: 50 CHAPTER 2. BACKGROUND generative
Page 53 and 54: 52 CHAPTER 2. BACKGROUND the latent
Page 55 and 56: 54 CHAPTER 2. BACKGROUND 2.9 Shared
Page 57 and 58: 56 CHAPTER 2. BACKGROUND Y X Figure
Page 59 and 60: 58 CHAPTER 2. BACKGROUND Gaussian p
Page 61 and 62: 60 CHAPTER 3. SHARED GP-LVM sponden
Page 63 and 64: 62 CHAPTER 3. SHARED GP-LVM makes m
Page 65 and 66: 64 CHAPTER 3. SHARED GP-LVM where t
Page 67 and 68: 66 CHAPTER 3. SHARED GP-LVM 4 3 2 1
Page 69 and 70: 68 CHAPTER 3. SHARED GP-LVM 1 0.8 0
Page 71 and 72: 70 CHAPTER 3. SHARED GP-LVM Example
Page 73 and 74: 72 CHAPTER 3. SHARED GP-LVM 0.3 0.2
Page 75 and 76: 74 CHAPTER 3. SHARED GP-LVM φ Y X
Page 77 and 78: 76 CHAPTER 3. SHARED GP-LVM leading
Page 79 and 80: 78 CHAPTER 3. SHARED GP-LVM servati
Page 81 and 82: Chapter 4 NCCA 4.1 Introduction In
Page 83 and 84: 82 CHAPTER 4. NCCA as u Y i = x S
Page 85 and 86: 84 CHAPTER 4. NCCA X Y Z X X S Y Z
Page 87 and 88: 86 CHAPTER 4. NCCA By pre-multiplyi
Page 89 and 90: 88 CHAPTER 4. NCCA 1 0.8 0.6 0.4 0.
Page 91 and 92: 90 CHAPTER 4. NCCA 4.5 Extensions W
Page 93 and 94: Chapter 5 Applications 5.1 Introduc
Page 95 and 96: 94 CHAPTER 5. APPLICATIONS as optic
Page 97 and 98: 96 CHAPTER 5. APPLICATIONS of an im
Page 99 and 100:
98 CHAPTER 5. APPLICATIONS histogra
Page 101 and 102:
100 CHAPTER 5. APPLICATIONS the dat
Page 103 and 104:
102 CHAPTER 5. APPLICATIONS Figure
Page 105 and 106:
104 CHAPTER 5. APPLICATIONS the GP
Page 107 and 108:
106 CHAPTER 5. APPLICATIONS varianc
Page 109 and 110:
108 CHAPTER 5. APPLICATIONS locatio
Page 111 and 112:
Page 113 and 114:
112 CHAPTER 5. APPLICATIONS Error (
Page 115 and 116:
Page 117 and 118:
116 CHAPTER 5. APPLICATIONS over th
Page 119 and 120:
118 CHAPTER 5. APPLICATIONS age rep
Page 121 and 122:
120 CHAPTER 6. CONCLUSIONS vated an
Page 123 and 124:
122 CHAPTER 6. CONCLUSIONS a shared
Page 125 and 126:
124 BIBLIOGRAPHY [7] S. Belongie, J
Page 127 and 128:
126 BIBLIOGRAPHY [24] K. Grochow, S
Page 129 and 130:
128 BIBLIOGRAPHY [39] D. MacKay. Ba
Page 131 and 132:
130 BIBLIOGRAPHY [56] H. A. Simon.
Page 133 and 134:
132 BIBLIOGRAPHY [74] S. Wachter an
show all

Shared Gaussian Process Latent Variables Models - Oxford Brookes ...

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?