13.07.2015 Views

View - Statistics - University of Washington

View - Statistics - University of Washington

View - Statistics - University of Washington

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

13ness <strong>of</strong> a feature) considered is viewed as specifying a possible model for the data,and the competing models are compared using Bayes factors (Kass and Raftery.1995). We approximate the Bayes factor using the Bayesian Information Criterion(BIC; Schwarz, 1978); the difference between the BIC values for two models isapproximately equal to twice the log Bayes factor when unit information priorsfor the model parameters are used (Kass and Wasserman, 1995). These are priorsthat contain about the same amount <strong>of</strong> information as a single typical observation.This approach has been found to work well for mixture models (Roeder andWasserman, 1997).The BIC for a model with K features and background noise is defined by:BIC = 2 log(L(X|θ)) − M · log(N),where M = K(DF + 2) + K + 1 is the number <strong>of</strong> parameters. The number <strong>of</strong>feature clusters is K; for each feature cluster we estimate ν j and σ j , and we fit acurve using DF degrees <strong>of</strong> freedom. The mixing proportions add K parameters,and the estimate <strong>of</strong> the image area used in the noise density is one more parameter.The larger the BIC, the more the model is favored by the data. Conventionally,differences <strong>of</strong> 2–6 between BIC values for models represent positive evidence,differences <strong>of</strong> 6–10 correspond to strong evidence, while differences greater than10 indicate very strong evidence (Kass and Raftery, 1995).2.3 Initialization2.3.1 Denoising and Initial ClusteringThe performance <strong>of</strong> the CEM-PCC algorithm can be sensitive to the startingvalue, so it is important to have a good starting value. The initial clustering usedto obtain this should accomplish two objectives: separate the feature points frombackground noise, and provide an initial clustering <strong>of</strong> the feature points. The

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!