10.07.2015 Views

To the Graduate Council: I am submitting herewith a thesis written by ...

To the Graduate Council: I am submitting herewith a thesis written by ...

To the Graduate Council: I am submitting herewith a thesis written by ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 4: Algorithm Overview 54smoothing as applied to uni-variate and multi-variate datasets. Papers that discuss <strong>the</strong>information bound bandwidth selection methods are [Wu and Lin, 1996] and [Jones etal., 1996].The bandwidth selection for <strong>the</strong> process of density estimation is important to assert <strong>the</strong>accuracy of <strong>the</strong> density estimate. The choice of <strong>the</strong> bandwidth at least <strong>the</strong>oretically canbe derived to minimize <strong>the</strong> mean integrated square error between <strong>the</strong> actual densityand <strong>the</strong> computed density. Some methods that are used for this purpose are Distribution Scale Methods, Cross Validation Methods, Plug-In Methods, and Bootstrap methods.In <strong>the</strong> next few paragraphs, we will very briefly discuss <strong>the</strong> rationale behind <strong>the</strong>seobjective methods for bandwidth selection. Assume that f is <strong>the</strong> actual density of <strong>the</strong>data and fˆ is <strong>the</strong> estimated density. The process of bandwidth selection is aimed atminimizing <strong>the</strong> integrated mean square error between <strong>the</strong> actual and <strong>the</strong> estimateddensity. The integrated mean square error is defined as <strong>the</strong> expected value of <strong>the</strong>integrated square error and is given <strong>by</strong> Equation 4.13.MSE{f ( x;h )}122= n− {( K h * f )( x ) − ( K h * f ) ( x )} + {( K h * f )( x ) − f ( x )} . (4.13)The Mean Integrated Square Error ( MISE ) is <strong>the</strong> integral of <strong>the</strong> mean squared errorthat can be simplified as shown below.2MISE{MISE{ fˆ ( x,h )}12=MSE{ fˆ ( x;h )}dx = E{fˆ ( x;h ) − f ( x )} dx22{(Kh* f )( x ) −(Kh* f ) ( x )}dx +{(Kh* f )( x ) −fˆ (.;h )} = n− f ( x )} dx = −− + − − + 2 (4.14)MISE is <strong>the</strong> sum of <strong>the</strong> integrated square bias and <strong>the</strong> integrated variance and henceminimization of that error is effectively <strong>the</strong> tradeoff between <strong>the</strong> bias and variance.The closed form solution that is derived for <strong>the</strong> optimal bandwidth <strong>by</strong> minimizing <strong>the</strong>MISE is <strong>the</strong> h opt in Equation 4.15.h opt= n(2K(z)dz 222z K(z)dz)( f "( x)dx)1/ 5(4.15)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!