Identifying Speculative Bubbles with an Infinite Hidden Markov Model

More documents

Recommendations

Info

∑where n i is the number of occurrences of i in a sample of n = K n i points from the discretedistribution on {1, · · · , K} defined by X. Then,i=1X | β = (n 1 , . . . , n K ) ∼ Dir(α + β).This relationship is used in Bayesian statistics to estimate the hidden parameters X, given acollection of n samples. Intuitively, if the prior is represented as Dir(α), then Dir(α + β) isthe posterior following a sequence of observations with histogram β.The Dirichlet process was introduced by Ferguson (1973) as the extension of the Dirichletdistribution from finite dimensions to infinite dimensions. It is a distribution of distributionsand has two parameters: the shape parameter G 0 is a distribution over a sample space Ω andthe concentration parameter α 0 is a positive scalar. They have similar interpretation as theircounterparts in the Dirichlet distribution. The formal definition is the following:Definition The Dirichlet process over a set Ω is a stochastic process whose sample path is aprobability distribution over Ω. For a random distribution F distributed according to a Dirichletprocess DP(α 0 , G 0 ), given any finite measurable partition A 1 , A 2 , · · · , A K of the samplespace Ω, the random vector (F (A 1 ), · · · , F (A K )) is distributed as a Dirichlet distribution withparameters (α 0 G 0 (A 1 ), · · · , α 0 G 0 (A K )).Use the results form the Dirichlet distribution, for any measurable set A, the randomvariable F (A) has mean G 0 (A) and variance G 0(A)(1−G 0 (A))α 0 +1. The mean implies the shapeparameter G 0 represents the center of a random distribution F drawn from a Dirichlet processDP(α 0 , G 0 ). Define a i ∼ F as an observation drawn from the distribution F . Because bydefinition P (a i ∈ A | F ) = F (A), we can derive P (a i ∈ A | G 0 ) = E(P (a i ∈ A | F ) | G 0 ) =E(F (A) | G 0 ) = G 0 (A). Hence, the shape parameter G 0 is also the marginal distribution ofan observation a i . The variance implies the concentration parameter α 0 controls how close therandom distribution F is to the shape parameter G 0 . The larger α 0 is, the more likely F isclose to G 0 , and vice versa.26
Suppose there are n observations, a = (a 1 , · · · , a n ), drawn from the distribution F . Usen∑δ ai (A j ) toi=1represent the number of a i in set A j , where A 1 , · · · , A K is a measurable partition of thesample space Ω and δ ai (A j ) is the Dirac measure, where⎧⎪⎨ 1 if a i ∈ A jδ ai (A j ) =.⎪⎩ 0 if a i /∈ A j( n∑)n∑Conditional on (F (A 1 ), · · · , F (A K )), the vector δ ai (A 1 ), · · · , δ ai (A K ) has a multinominaldistribution. By the conjugacy of Dirichlet distribution to the multi-nominali=1i=1distribution,the posterior distribution of (F (A 1 ), · · · , F (A K )) is still a Dirichlet distribution(F (A 1 ), · · · , F (A K )) | a ∼ Dir(α 0 G 0 (A 1 ) +n∑δ ai (A 1 ), · · · , α 0 G 0 (A K ) +i=1)n∑δ ai (A K )Because this result is valid for any finite measurable partition, the posterior of F is still Dirichletprocess by definition, with new parameters α ∗ 0 and G∗ 0 , wherei=1α ∗ 0 = α 0 + nG ∗ 0 = α 0α 0 + n G 0 +nα 0 + nThe posterior shape parameter, G ∗ 0 , is the mixture of the prior and the empirical distributionimplied by observations. As n → ∞, the shape parameter of the posterior converges tothe empirical distribution. The concentration parameter α ∗ 0n∑i=1δ ain→ ∞ implies the posterior of Fconverges to the empirical distribution with probability one. Ferguson (1973) showed that arandom distribution drawn from a Dirichlet process is almost sure discrete, although the shapeparameter G 0 can be continuous.27
Page 2 and 3: 1 IntroductionBubbles, which are re
Page 4 and 5: involves nonstationary (especially
Page 6 and 7: 2 Infinite Hidden Markov ModelThe i
Page 8 and 9: This methodology is less subjective
Page 10 and 11: efficient than the individual sampl
Page 12: 3.2 Dating Algorithm of BubblesThe
Page 15 and 16: (money growth) is consistent with t
Page 18 and 19: Raftery’s (1995) criterion.5 Empi
Page 20 and 21: mean E (β st |Y ) from the MS2 mod
Page 22 and 23: Ferguson. A bayesian analysis of so
Page 24 and 25: Y.W. Teh, M.I. Jordan, M.J. Beal, a
Page 28 and 29: A.2 Stick breaking processFor a ran
Page 30 and 31: where n ji is the number of {τ | s

Identifying Speculative Bubbles with an Infinite Hidden Markov Model

Create successful ePaper yourself

Delete template?

Save as template?