To the Graduate Council: I am submitting herewith a thesis written by ...
To the Graduate Council: I am submitting herewith a thesis written by ...
To the Graduate Council: I am submitting herewith a thesis written by ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Chapter 4: Algorithm Overview 54smoothing as applied to uni-variate and multi-variate datasets. Papers that discuss <strong>the</strong>information bound bandwidth selection methods are [Wu and Lin, 1996] and [Jones etal., 1996].The bandwidth selection for <strong>the</strong> process of density estimation is important to assert <strong>the</strong>accuracy of <strong>the</strong> density estimate. The choice of <strong>the</strong> bandwidth at least <strong>the</strong>oretically canbe derived to minimize <strong>the</strong> mean integrated square error between <strong>the</strong> actual densityand <strong>the</strong> computed density. Some methods that are used for this purpose are Distribution Scale Methods, Cross Validation Methods, Plug-In Methods, and Bootstrap methods.In <strong>the</strong> next few paragraphs, we will very briefly discuss <strong>the</strong> rationale behind <strong>the</strong>seobjective methods for bandwidth selection. Assume that f is <strong>the</strong> actual density of <strong>the</strong>data and fˆ is <strong>the</strong> estimated density. The process of bandwidth selection is aimed atminimizing <strong>the</strong> integrated mean square error between <strong>the</strong> actual and <strong>the</strong> estimateddensity. The integrated mean square error is defined as <strong>the</strong> expected value of <strong>the</strong>integrated square error and is given <strong>by</strong> Equation 4.13.MSE{f ( x;h )}122= n− {( K h * f )( x ) − ( K h * f ) ( x )} + {( K h * f )( x ) − f ( x )} . (4.13)The Mean Integrated Square Error ( MISE ) is <strong>the</strong> integral of <strong>the</strong> mean squared errorthat can be simplified as shown below.2MISE{MISE{ fˆ ( x,h )}12=MSE{ fˆ ( x;h )}dx = E{fˆ ( x;h ) − f ( x )} dx22{(Kh* f )( x ) −(Kh* f ) ( x )}dx +{(Kh* f )( x ) −fˆ (.;h )} = n− f ( x )} dx = −− + − − + 2 (4.14)MISE is <strong>the</strong> sum of <strong>the</strong> integrated square bias and <strong>the</strong> integrated variance and henceminimization of that error is effectively <strong>the</strong> tradeoff between <strong>the</strong> bias and variance.The closed form solution that is derived for <strong>the</strong> optimal bandwidth <strong>by</strong> minimizing <strong>the</strong>MISE is <strong>the</strong> h opt in Equation 4.15.h opt= n(2K(z)dz 222z K(z)dz)( f "( x)dx)1/ 5(4.15)