Online Model Selection Based on the Variational Bayes
Online Model Selection Based on the Variational Bayes
Online Model Selection Based on the Variational Bayes
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<str<strong>on</strong>g>Online</str<strong>on</strong>g> <str<strong>on</strong>g>Model</str<strong>on</strong>g> <str<strong>on</strong>g>Selecti<strong>on</strong></str<strong>on</strong>g> <str<strong>on</strong>g>Based</str<strong>on</strong>g> <strong>on</strong> <strong>the</strong> Variati<strong>on</strong>al <strong>Bayes</strong> 1655<br />
c<strong>on</strong>jugate distributi<strong>on</strong> for <strong>the</strong> EFH model with posterior hyperparameters<br />
(®, c ) (see appendix A):<br />
Q h (µ ) D P a (µ | ®, c ) D exp [c (® ¢ µ ¡ Ã (µ )) ¡ © (®, c )] , (2.16)<br />
c D T C c 0, (2.17)<br />
® D 1 h i<br />
Thr(x, z)i N C ®0 ¢ c 0 ,<br />
c µ<br />
(2.18)<br />
hr(x, z)i N D<br />
µ 1<br />
TX<br />
Z<br />
T<br />
dm (z(t))P(z(t)|x(t), µ N )r(x(t), z(t)). (2.19)<br />
tD1<br />
The effective amount of data c D (T C c 0) represents <strong>the</strong> reliability (or uncertainty)<br />
of <strong>the</strong> estimati<strong>on</strong>. As <strong>the</strong> amount of data T increases, <strong>the</strong> reliability<br />
of <strong>the</strong> estimati<strong>on</strong> increases. The prior hyperparameter c 0 represents <strong>the</strong> reliability<br />
of <strong>the</strong> prior belief <strong>on</strong> <strong>the</strong> prior hyperparameter ®0. The posterior<br />
hyperparameter ® is determined by <strong>the</strong> expectati<strong>on</strong> value of <strong>the</strong> suf�cient<br />
statistics. The prior hyperparameter ®0 gives <strong>the</strong> initial value for ®.<br />
Since <strong>the</strong> posterior parameter distributi<strong>on</strong> Qh (µ ) is given by <strong>the</strong> c<strong>on</strong>jugate<br />
distributi<strong>on</strong> Pa (µ | ®, c ), which is also an exp<strong>on</strong>ential family model, <strong>the</strong><br />
integrati<strong>on</strong> over <strong>the</strong> parameter µ in equati<strong>on</strong> 2.15 can be explicitly calculated<br />
as<br />
Nµ D hµi ® , (2.20)<br />
Z<br />
hµi ® D dm (µ )Pa (µ | ®, c )µ D 1 @©<br />
c @® (®, c ). (2.21)<br />
The natural parameter of <strong>the</strong> c<strong>on</strong>jugate distributi<strong>on</strong> is given by (c ®, c ). The<br />
corresp<strong>on</strong>ding expectati<strong>on</strong> parameters are given by <strong>the</strong> ensemble averages<br />
of <strong>the</strong> model parameters: hµi ® de�ned in equati<strong>on</strong> 2.21 and<br />
hà (µ )i ® D<br />
Z<br />
D 1<br />
c<br />
dm (µ )Pa (µ | ®, c )Ã (µ )<br />
@©<br />
@® (®, c ) ¢ ® ¡ @©<br />
@c (®, c ). (2.22)<br />
2.5 Parameterized Free Energy Functi<strong>on</strong>. Since <strong>the</strong> optimal soluti<strong>on</strong> simultaneously<br />
satis�es equati<strong>on</strong>s 2.14 and 2.16, <strong>the</strong> trial posterior distributi<strong>on</strong>s,<br />
Q h (µ ) and Qz(ZfTg), can be parameterized as<br />
Qh (µ ) D Pa (µ | ®, c ) D exp [c (® ¢ µ ¡ Ã (µ )) ¡ © (®, c )] , (2.23)<br />
TY<br />
Qz(ZfTg) D Qz(z(t)), (2.24)<br />
tD1<br />
Qz(z(t)) D P(z(t)|x(t), N µ ), (2.25)