29.01.2015 Views

Analysing spatial point patterns in R - CSIRO

Analysing spatial point patterns in R - CSIRO

Analysing spatial point patterns in R - CSIRO

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

150 Adjust<strong>in</strong>g for <strong>in</strong>homogeneity<br />

The denom<strong>in</strong>ator D <strong>in</strong> (30) is either the area of the w<strong>in</strong>dow D 1 = area(W), or<br />

D 2 = ∑ i<br />

1<br />

̂λ(x i )<br />

which is an unbiased estimator of area(W) if the <strong>in</strong>tensity is correctly estimated. The denom<strong>in</strong>ator<br />

D 2 is often preferred on grounds of statistical performance, because it <strong>in</strong>troduces a<br />

data-dependent normalisation.<br />

The <strong>in</strong>homogeneous K function is computed by the command K<strong>in</strong>hom(X, lambda) where<br />

X is the <strong>po<strong>in</strong>t</strong> pattern and lambda is the estimated <strong>in</strong>tensity function. Here lambda may be a<br />

pixel image, a function(x,y) <strong>in</strong> the R language, a numeric vector giv<strong>in</strong>g the values ̂λ(x i ) at<br />

the data <strong>po<strong>in</strong>t</strong>s x i only, or it may be omitted (and will then be estimated from X). By default,<br />

the data-dependent denom<strong>in</strong>ator D 2 is used.<br />

There rema<strong>in</strong>s the question of how to estimate the <strong>in</strong>tensity function λ(u). It is usually<br />

advisable to obta<strong>in</strong> the <strong>in</strong>tensity estimate ̂λ(u) by fitt<strong>in</strong>g a parametric model, to avoid overfitt<strong>in</strong>g.<br />

Here is an example for the tropical ra<strong>in</strong>forest data, us<strong>in</strong>g the covariate data to suggest a model<br />

for the <strong>in</strong>tensity.<br />

> data(bei)<br />

> fit lam Ki plot(Ki, ma<strong>in</strong> = "Inhomogeneous K function")<br />

Inhomogeneous K function<br />

K(r)<br />

0 10000 20000 30000 40000 50000<br />

bord.modif<br />

border<br />

theo<br />

0 20 40 60 80 100 120<br />

The plot suggests that, even after account<strong>in</strong>g for dependence on altitude and slope, the trees<br />

still appear to be clustered.<br />

The <strong>in</strong>tensity function λ(u) could also be estimated by kernel smooth<strong>in</strong>g the <strong>po<strong>in</strong>t</strong> pattern<br />

data. However, notice that the estimator (30) of the <strong>in</strong>homogeneous K function depends on<br />

the estimated <strong>in</strong>tensity values at the data <strong>po<strong>in</strong>t</strong>s, ̂λ(x i ). These are positively biased estimates<br />

of the true values λ(x i ). In order to avoid bias, the value ̂λ(x i ) should be estimated by kernel<br />

smooth<strong>in</strong>g of the <strong>po<strong>in</strong>t</strong> pattern with the <strong>po<strong>in</strong>t</strong> x i deleted. This “leave-one-out” estimator is<br />

implemented <strong>in</strong> K<strong>in</strong>hom and is <strong>in</strong>voked when the argument lambda is not given:<br />

> Ki2 plot(Ki2, ma<strong>in</strong> = "K<strong>in</strong>hom us<strong>in</strong>g leave-one-out")<br />

Copyright<strong>CSIRO</strong> 2010

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!