12.07.2015 Views

Stat 5101 Lecture Notes - School of Statistics

Stat 5101 Lecture Notes - School of Statistics

Stat 5101 Lecture Notes - School of Statistics

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

154 <strong>Stat</strong> <strong>5101</strong> (Geyer) Course <strong>Notes</strong>because X i X j = 0 with probability one.Hence var(X) =Mhas components{p i (1 − p i ),m ij =−p i p ji = ji ≠ j(5.37)We can also write this using more matrixy notation by introducing the diagonalmatrix P having diagonal elements p i and noting that the “outer product” pp ′has elements p i p j , hencevar(X) =P−pp ′Question: Is var(X) positive definite? This is <strong>of</strong> course related to the question<strong>of</strong> whether X is degenerate. We haven’t said anything explicit about either,but the information needed to answer these questions is in the text above. Itshould be obvious if you know what to look for (a good exercise testing yourunderstanding <strong>of</strong> degenerate random vectors).5.4 The Multinomial DistributionThe multinomial distribution is the multivariate analog <strong>of</strong> the binomial distribution.It is sort <strong>of</strong>, but not quite, the multivariate generalization, that is, thebinomial distribution is sort <strong>of</strong>, but not precisely, a special case <strong>of</strong> the multinomialdistribution. Thus is unlike the normal, where the univariate normaldistribution is precisely the one-dimensional case <strong>of</strong> the multivariate normal.Suppose X 1 , X 2 are an i. i. d. sequence <strong>of</strong> Ber k (p) random vectors (caution:the subscripts on the X i indicate elements <strong>of</strong> an infinite sequence <strong>of</strong> i. i. d.random vectors, not components <strong>of</strong> one vector). ThenY = X 1 + ···+X nhas the multinomial distribution with sample size n and dimension k, abbreviatedY ∼ Multi k (n, p)if we want to indicate the dimension in the notation or just Y ∼ Multi(n, p) ifthe dimension is clear from the context.Note the dimension is k, not n, that is, both Y and p are vectors <strong>of</strong> dimensionk.5.4.1 Categorical Random VariablesRecall that a multinomial random vector is the sum <strong>of</strong> i. i. d. BernoullisY = X 1 + · + X nand that each Bernoulli is related to a categorical random variable: X i,j =1ifand only if the i-th observation fell in the j-th category. Thus Y j = ∑ i X i,j isthe number <strong>of</strong> individuals that fell in the j-th category.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!