27.01.2014 Views

Entropy Inference and the James-Stein Estimator, with Application to ...

Entropy Inference and the James-Stein Estimator, with Application to ...

Entropy Inference and the James-Stein Estimator, with Application to ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

ENTROPY INFERENCE AND THE JAMES-STEIN ESTIMATOR<br />

is a weighted average of <strong>the</strong>se two estima<strong>to</strong>rs, where <strong>the</strong> weight is chosen in a data-driven fashion<br />

such that ˆθ Shrink is improved in terms of mean squared error relative <strong>to</strong> both ˆθ <strong>and</strong> ˆθ Target .<br />

A key advantage of <strong>James</strong>-<strong>Stein</strong>-type shrinkage is that <strong>the</strong> optimal shrinkage intensity λ ⋆ can be<br />

calculated analytically <strong>and</strong> <strong>with</strong>out knowing <strong>the</strong> true value θ, via<br />

λ ⋆ = ∑p k=1 Var(ˆθ k ) − Cov(ˆθ k , ˆθ Target<br />

k<br />

)+Bias(ˆθ k )E(ˆθ k − ˆθ Target<br />

∑ p k=1 E[(ˆθ k − ˆθ Target<br />

k<br />

) 2 ]<br />

k<br />

)<br />

. (10)<br />

A simple estimate of λ ⋆ is obtained by replacing all variances <strong>and</strong> covariances in Equation 10 <strong>with</strong><br />

<strong>the</strong>ir empirical counterparts, followed by truncation of ˆλ ⋆ at 1 (so that ˆλ ⋆ ≤ 1 always holds).<br />

Equation 10 is discussed in detail in Schäfer <strong>and</strong> Strimmer (2005b) <strong>and</strong> Opgen-Rhein <strong>and</strong> Strimmer<br />

(2007a). More specialized versions of it are treated, for example, in Ledoit <strong>and</strong> Wolf (2003) for<br />

unbiased ˆθ <strong>and</strong> in Thompson (1968) (unbiased, univariate case <strong>with</strong> deterministic target). A very<br />

early version (univariate <strong>with</strong> zero target) even predates <strong>the</strong> estima<strong>to</strong>r of <strong>James</strong> <strong>and</strong> <strong>Stein</strong>, see Goodman<br />

(1953). For <strong>the</strong> multinormal setting of <strong>James</strong> <strong>and</strong> <strong>Stein</strong> (1961), Equation 9 <strong>and</strong> Equation 10<br />

reduce <strong>to</strong> <strong>the</strong> shrinkage estima<strong>to</strong>r described in Stigler (1990).<br />

<strong>James</strong>-<strong>Stein</strong> shrinkage has an empirical Bayes interpretation (Efron <strong>and</strong> Morris, 1973). Note,<br />

however, that only <strong>the</strong> first two moments of <strong>the</strong> distributions of ˆθ Target <strong>and</strong> ˆθ need <strong>to</strong> be specified in<br />

Equation 10. Hence, <strong>James</strong>-<strong>Stein</strong> estimation may be viewed as a quasi-empirical Bayes approach<br />

(in <strong>the</strong> same sense as in quasi-likelihood, which also requires only <strong>the</strong> first two moments).<br />

Appendix B. Computer Implementation<br />

The proposed shrinkage estima<strong>to</strong>rs of entropy <strong>and</strong> mutual information, as well as all o<strong>the</strong>r investigated<br />

entropy estima<strong>to</strong>rs, have been implemented in R (R Development Core Team, 2008). A<br />

corresponding R package “entropy” was deposited in <strong>the</strong> R archive CRAN <strong>and</strong> is accessible at <strong>the</strong><br />

URL http://cran.r-project.org/web/packages/entropy/ under <strong>the</strong> GNU General Public<br />

License.<br />

References<br />

A. Agresti <strong>and</strong> D. B. Hitchcock. Bayesian inference for categorical data analysis. Statist. Meth.<br />

Appl., 14:297–330, 2005.<br />

A. J. Butte, P. Tamayo, D. Slonim, T. R. Golub, <strong>and</strong> I. S. Kohane. Discovering functional relationships<br />

between RNA expression <strong>and</strong> chemo<strong>the</strong>rapeutic susceptibility using relevance networks.<br />

Proc. Natl. Acad. Sci. USA, 97:12182–12186, 2000.<br />

A. Chao <strong>and</strong> T.-J. Shen. Nonparametric estimation of Shannon’s index of diversity when <strong>the</strong>re are<br />

unseen species. Environ. Ecol. Stat., 10:429–443, 2003.<br />

A. Dobra, C. Hans, B. Jones, J. R. Nevins, G. Yao, <strong>and</strong> M. West. Sparse graphical models for<br />

exploring gene expression data. J. Multiv. Anal., 90:196–212, 2004.<br />

B. Efron <strong>and</strong> C. N. Morris. <strong>Stein</strong>’s estimation rule <strong>and</strong> its competi<strong>to</strong>rs–an empirical Bayes approach.<br />

J. Amer. Statist. Assoc., 68:117–130, 1973.<br />

1481

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!