26.10.2013 Views

Nonparametric Bayesian Discrete Latent Variable Models for ...

Nonparametric Bayesian Discrete Latent Variable Models for ...

Nonparametric Bayesian Discrete Latent Variable Models for ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3 Dirichlet Process Mixture <strong>Models</strong><br />

– For each indicator variable:<br />

– If ci is a singleton, assign its parameters µ ci and Sci to one of the auxiliary<br />

parameter pairs.<br />

– Invent other auxiliary components by sampling values <strong>for</strong> µ j and Sj from their<br />

respective priors.<br />

– Update the indicator variable, conditional on the data, the other indicators and<br />

hyperparameters.<br />

– Discard the empty components.<br />

– Update the DP concentration parameter α.<br />

The integral we want to evaluate is over two parameters, µ j and Sj. Exploiting the<br />

conditional conjugacy, it is possible to integrate over one of these parameters given the<br />

other. Thus, we can pick only one of the parameters randomly from its prior, and integrate<br />

over the other, which might lead to faster mixing. The log likelihood <strong>for</strong> the<br />

SampleMu and the SampleS schemes are as follows:<br />

SampleMu<br />

Sampling µ j from its prior and integrating over Sj gives the conditional log likelihood:<br />

log p(xi|c−i, µ j, β, W ) = − D<br />

β+nj<br />

2 log π + 2 log |W ∗ | − β+nj+1<br />

2 log |W ∗ + xix T i |<br />

+ log Γ( β + nj + 1<br />

2<br />

where W ∗ = βW + <br />

(xl − µ j)(xl − µ j) T .<br />

l:cl=j<br />

) − log Γ( β + nj + 1 − D<br />

),<br />

2<br />

(3.66)<br />

The sampling steps <strong>for</strong> the indicator variables are:<br />

– Remove all precision parameters Sj from the representation.<br />

– If ci is a singleton, assign its mean parameter µ ci to one of the auxiliary parameters.<br />

– Invent other auxiliary components by sampling values <strong>for</strong> the component mean µ j<br />

from its prior.<br />

– Update the indicator variable, conditional on the data, the other indicators, the<br />

component means and hyperparameters using the likelihood given in eq. (3.66).<br />

– Discard the empty components.<br />

SampleS<br />

Sampling Sj from its prior and integrating over µ j, the conditional log likelihood becomes:<br />

where ξ ∗ = Sj<br />

46<br />

log p(xi|c−i, Sj, R, ξ) = − D<br />

1<br />

2 log(2π) −<br />

<br />

l:cl=j<br />

+ 1<br />

2 (ξ∗ <br />

T<br />

+ Sjxi) (nj + 1)Sj + R<br />

− 1<br />

<br />

2ξ∗T njSj + R<br />

xj + Rξ.<br />

−1<br />

2 xT i Sjxi<br />

−1 (ξ ∗ + Sjxi)<br />

ξ ∗ + 1<br />

2 log |Sj||njSj + R|<br />

|(nj + 1)Sj + R| ,<br />

(3.67)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!