30.01.2014 Views

Cloud Statistics from Calipso Lidar Data for the ... - espace-tum.de

Cloud Statistics from Calipso Lidar Data for the ... - espace-tum.de

Cloud Statistics from Calipso Lidar Data for the ... - espace-tum.de

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 5. <strong>Cloud</strong> free statistics 48<br />

estimates ˆα in <strong>the</strong> limit of large sample sizes n and is expressed by:<br />

<strong>for</strong> <strong>the</strong> continuous case, and by:<br />

ˆα = 1 + n[<br />

ˆα ⋍ 1 + n[<br />

n∑<br />

i=1<br />

ln x i<br />

x min<br />

] −1 (19)<br />

n∑ x i<br />

ln<br />

x min − 1/2 ]−1 (20)<br />

i=1<br />

<strong>for</strong> <strong>the</strong> discrete case, where x i , i = 1...n are <strong>the</strong> observed values of x <strong>for</strong> which x i ≥ x min .<br />

The standard error of <strong>the</strong> estimated power law exponent ˆα, is <strong>the</strong>reby given by:<br />

σ = ˆα − 1 √ n<br />

+ = O(1/n). (21)<br />

Note that all hated parameters are estimates of <strong>the</strong> respective true parameters. [Clauset<br />

et al., 2009, White et al., 2008] per<strong>for</strong>med several analyses and tested <strong>the</strong> MLE against<br />

least-square fitting routines with <strong>the</strong> finding that <strong>the</strong> MLE result in <strong>the</strong> best power law<br />

fit in <strong>the</strong> discrete and continuous case. This is based on <strong>the</strong> low statistical error of<br />

<strong>the</strong> estimator, which <strong>de</strong>cays as O(n 1/2 ). Clauset also found out that <strong>the</strong> initial x min<br />

value is crucial and in <strong>the</strong> case of an un<strong>de</strong>restimation of x < x min ˆα <strong>de</strong>viates quickly<br />

<strong>from</strong> its true value which can also be seen in Table 8. As <strong>the</strong> cloud gap data used <strong>for</strong><br />

this study is a discrete version of <strong>the</strong> probability distribution it cannot be accurately<br />

fitted with its continuous counterpart. There<strong>for</strong>e <strong>the</strong> MLE fit <strong>for</strong> a discrete power law<br />

is found by numerical optimization in which <strong>the</strong> continuous distribution is summed up<br />

to <strong>the</strong> nearest integer. To ensure <strong>the</strong> best power law fit possible, an alternative to <strong>the</strong><br />

maximum likelihood method is <strong>the</strong> Kolmogorov-Smirnov statistic. It finds <strong>the</strong> x min that<br />

best fits <strong>the</strong> starting point of <strong>the</strong> power law, by minimizing <strong>the</strong> distance between <strong>the</strong><br />

data and <strong>the</strong> fit.<br />

Kolmogorov-Smirnov distance:<br />

The Kolmogorov-Smirnov statistic simply calculates <strong>the</strong> distance between <strong>the</strong> cumulative<br />

distribution function of <strong>the</strong> estimated data with ˆx min and compares it with <strong>the</strong><br />

cumulative distribution function of <strong>the</strong> fitted mo<strong>de</strong>l.<br />

D = max<br />

x≥x min<br />

|M(x) − P (x)| (22)<br />

where D is <strong>the</strong> distance, M(x) is <strong>the</strong> CDF of <strong>the</strong> observation data with at least one<br />

value x min and P (x) is <strong>the</strong> respective CDF of <strong>the</strong> power law best fitting <strong>the</strong> data in <strong>the</strong><br />

region x ≥ x min . The optimal ˆx min minimizes D.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!