Nonparametric Bayesian Discrete Latent Variable Models for ...
Nonparametric Bayesian Discrete Latent Variable Models for ...
Nonparametric Bayesian Discrete Latent Variable Models for ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
A Details of Derivations <strong>for</strong> the<br />
Stick-Breaking Representation of IBP<br />
The stick-breaking construction of the IBP has been summarized in Section 4.1.2. Here,<br />
we give details of derivation <strong>for</strong> obtaining this representation.<br />
A.1 Densities of ordered stick lengths<br />
The density <strong>for</strong> the kth largest feature presence probability is obtained by considering<br />
the decreasing order statistics of K i.i.d. random variables in the limit as K → ∞. We<br />
define µl1 ≥ µl2 ≥ · · · ≥ µlK to be the decreasing ordering of µ1, . . . , µK, and Lk to be<br />
the set of indices other than the k largest µ’s,<br />
Lk = {1, . . . , K}\{l1, . . . , lk}.<br />
Thus, the indices <strong>for</strong> the k largest feature presence probabilities are {l1, . . . , lk}. Denoting<br />
µ (k) = µlk , the k largest values among µ1:K is denoted as µ (1:k) = {µ (1), . . . , µ (k)}<br />
and we have<br />
µl ≤ µ (k) <strong>for</strong> all l ∈ Lk. (A.1)<br />
Thus, the range of the subsequent µl’s given µ (k) is restricted to [0, µ (k)], resulting in<br />
the following pdf:<br />
p(µl | µ (1:k)) =<br />
α<br />
K<br />
µ(k)<br />
0<br />
µ α<br />
K −1<br />
l<br />
α α<br />
t K<br />
K −1 dt<br />
= α<br />
K<br />
µ− α<br />
K<br />
(k)<br />
Integrating, the cdf <strong>for</strong> µl conditioned on µ (1:k) is found to be<br />
F (µl | µ (1:k)) =<br />
= µ<br />
µl<br />
0<br />
α<br />
− K<br />
(k)<br />
α<br />
K<br />
α<br />
K<br />
µ−<br />
(k)<br />
t α<br />
K −1 dt<br />
µ α<br />
K<br />
l I(0 ≤ µl ≤ µ (k)) + I(µ (k) < µl).<br />
µ α<br />
K −1<br />
l . (A.2)<br />
(A.3)<br />
115