24.12.2012 Views

Online Model Selection Based on the Variational Bayes

Online Model Selection Based on the Variational Bayes

Online Model Selection Based on the Variational Bayes

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<str<strong>on</strong>g>Online</str<strong>on</strong>g> <str<strong>on</strong>g>Model</str<strong>on</strong>g> <str<strong>on</strong>g>Selecti<strong>on</strong></str<strong>on</strong>g> <str<strong>on</strong>g>Based</str<strong>on</strong>g> <strong>on</strong> <strong>the</strong> Variati<strong>on</strong>al <strong>Bayes</strong> 1655<br />

c<strong>on</strong>jugate distributi<strong>on</strong> for <strong>the</strong> EFH model with posterior hyperparameters<br />

(®, c ) (see appendix A):<br />

Q h (µ ) D P a (µ | ®, c ) D exp [c (® ¢ µ ¡ Ã (µ )) ¡ © (®, c )] , (2.16)<br />

c D T C c 0, (2.17)<br />

® D 1 h i<br />

Thr(x, z)i N C ®0 ¢ c 0 ,<br />

c µ<br />

(2.18)<br />

hr(x, z)i N D<br />

µ 1<br />

TX<br />

Z<br />

T<br />

dm (z(t))P(z(t)|x(t), µ N )r(x(t), z(t)). (2.19)<br />

tD1<br />

The effective amount of data c D (T C c 0) represents <strong>the</strong> reliability (or uncertainty)<br />

of <strong>the</strong> estimati<strong>on</strong>. As <strong>the</strong> amount of data T increases, <strong>the</strong> reliability<br />

of <strong>the</strong> estimati<strong>on</strong> increases. The prior hyperparameter c 0 represents <strong>the</strong> reliability<br />

of <strong>the</strong> prior belief <strong>on</strong> <strong>the</strong> prior hyperparameter ®0. The posterior<br />

hyperparameter ® is determined by <strong>the</strong> expectati<strong>on</strong> value of <strong>the</strong> suf�cient<br />

statistics. The prior hyperparameter ®0 gives <strong>the</strong> initial value for ®.<br />

Since <strong>the</strong> posterior parameter distributi<strong>on</strong> Qh (µ ) is given by <strong>the</strong> c<strong>on</strong>jugate<br />

distributi<strong>on</strong> Pa (µ | ®, c ), which is also an exp<strong>on</strong>ential family model, <strong>the</strong><br />

integrati<strong>on</strong> over <strong>the</strong> parameter µ in equati<strong>on</strong> 2.15 can be explicitly calculated<br />

as<br />

Nµ D hµi ® , (2.20)<br />

Z<br />

hµi ® D dm (µ )Pa (µ | ®, c )µ D 1 @©<br />

c @® (®, c ). (2.21)<br />

The natural parameter of <strong>the</strong> c<strong>on</strong>jugate distributi<strong>on</strong> is given by (c ®, c ). The<br />

corresp<strong>on</strong>ding expectati<strong>on</strong> parameters are given by <strong>the</strong> ensemble averages<br />

of <strong>the</strong> model parameters: hµi ® de�ned in equati<strong>on</strong> 2.21 and<br />

hà (µ )i ® D<br />

Z<br />

D 1<br />

c<br />

dm (µ )Pa (µ | ®, c )Ã (µ )<br />

@©<br />

@® (®, c ) ¢ ® ¡ @©<br />

@c (®, c ). (2.22)<br />

2.5 Parameterized Free Energy Functi<strong>on</strong>. Since <strong>the</strong> optimal soluti<strong>on</strong> simultaneously<br />

satis�es equati<strong>on</strong>s 2.14 and 2.16, <strong>the</strong> trial posterior distributi<strong>on</strong>s,<br />

Q h (µ ) and Qz(ZfTg), can be parameterized as<br />

Qh (µ ) D Pa (µ | ®, c ) D exp [c (® ¢ µ ¡ Ã (µ )) ¡ © (®, c )] , (2.23)<br />

TY<br />

Qz(ZfTg) D Qz(z(t)), (2.24)<br />

tD1<br />

Qz(z(t)) D P(z(t)|x(t), N µ ), (2.25)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!