22.03.2013 Views

Final Technical Report: - Southwest Fisheries Science Center - NOAA

Final Technical Report: - Southwest Fisheries Science Center - NOAA

Final Technical Report: - Southwest Fisheries Science Center - NOAA

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

“number of individuals” as the response variable and 2) deriving density from a two-step process<br />

in which the probability of a species being present in a given habitat is multiplied by the<br />

expected number of individuals given favorable habitat. The primary reason we decided to use<br />

separate models to predict encounter rate and group size is that this approach breaks the process<br />

down into ecologically meaningful quanta: differences in distribution may arise from variability<br />

in group size or number of groups in a given region, with potentially different environmental<br />

factors affecting the variability in each model. The two-step process of computing the<br />

probability of presence and then multiplying by the expected number of individuals does not<br />

have this flexibility because environmental effects on encounter rate and group size are<br />

confounded in a single model.<br />

GAMs are commonly used to relate characteristics of a species, such as distribution or<br />

abundance, to environmental characteristics. A GAM may be represented as<br />

(Hastie and Tibshirani 1990). The function g(μ) is known as the link function, and it relates the<br />

mean of the response variable given the predictor variables =E(Y|X1,…,Xp) to the additive<br />

predictor jfj(Xj). GAMs are nonparametric extensions of generalized linear models (GLMs).<br />

The components fj(Xj) in the additive predictor of a GAM may include nonparametric smooth<br />

functions of the predictor variables, whereas a GLM is composed of a linear predictor, jjXj,<br />

in which the terms j are constants. This difference between the additive and linear predictor<br />

allows GAMs to be more flexible than GLMs.<br />

Model Comparison Analysis<br />

When working with ecological data, it is often difficult to distinguish meaningful signals<br />

from noise arising from the unexplainable variability and complex interactions inherent in<br />

ecological systems. Even in the absence of noise, relationships among ecological variables<br />

rarely can be explained by simple mathematical equations. Working within the framework of<br />

generalized additive models may be useful for analyzing ecological data because the<br />

nonparametric model structure of GAMs provides flexibility in model building and fitting, often<br />

allowing GAMs to exhibit more fidelity to the data than alternative model structures.<br />

Nevertheless, there are disadvantages to GAMs. For example, if appropriate model building and<br />

selection methods are not used, the resulting GAM may overfit the data, reliably reproducing the<br />

data upon which the model was built at the cost of sacrificing accuracy when predicting on novel<br />

data. In addition, GAMs may be difficult to interpret because they cannot always be defined by<br />

a simple formula comprised of a constant coefficient tied to each explanatory variable that<br />

indicates the strength, magnitude, and direction of the covariate’s effect on the response variable.<br />

16

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!