25.12.2013 Views

CRANFIELD UNIVERSITY Eleni Anthippi Chatzimichali ...

CRANFIELD UNIVERSITY Eleni Anthippi Chatzimichali ...

CRANFIELD UNIVERSITY Eleni Anthippi Chatzimichali ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

TABLE OF FIGURES<br />

Figure 1-1 Evolution from molecular biology to systems biology ................................. 2<br />

Figure 1-2 Electromagnetic spectrum ............................................................................ 4<br />

Figure 1-3 Representation of possible correlations and redundancies in highdimensional<br />

data ................................................................................................... 10<br />

Figure 1-4 Extracting the Principal Components ......................................................... 11<br />

Figure 1-5 Nonlinear SVM classifier ........................................................................... 18<br />

Figure 1-6 The effect of the hyperparameter on the SVM boundaries ...................... 21<br />

Figure 1-7 k-fold Cross-Validation............................................................................... 25<br />

Figure 1-8 Leave-One-Out Cross Validation (LOOCV) .............................................. 26<br />

Figure 1-9 Model complexity and overfitting; the bias-variance trade-off .................. 28<br />

Figure 1-10 Permutation tests and the P-value ............................................................. 29<br />

Figure 2-1 Mean FTIR spectra for case study 1 in the fingerprint region (1500-1000<br />

cm -1 ) ...................................................................................................................... 35<br />

Figure 2-2 Sampling with Libra e-nose ........................................................................ 36<br />

Figure 2-3 Data intersection ......................................................................................... 38<br />

Figure 2-4 The process of constructing an ensemble of RBF SVMs optimised via<br />

bootstrapping ........................................................................................................ 42<br />

Figure 2-5 PCA score plots with 95% confidence ellipses for case study 1 ................ 46<br />

Figure 2-6 Overall accuracies (%CC) for the standalone datasets of case study 1 ...... 49<br />

Figure 2-7 Class prediction rates of the standalone (prior to PCA) datasets for case<br />

study 1 .................................................................................................................. 52<br />

Figure 2-8 Comparison of the optimisation of the hyperparameters of RBF SVMs via<br />

bootstrapping, 10-fold cross-validation and LOOCV respectively ...................... 55<br />

Figure 2-9 Three-dimensional error surface plots for the optimisation of the RBF<br />

parameters ............................................................................................................. 56<br />

Figure 3-1 Master/Slave architecture ........................................................................... 60<br />

Figure 3-2 Embarrassingly parallel problems in the analysis pipeline ......................... 61<br />

Figure 3-3 The steps of the Nelder-Mead algorithm .................................................... 64<br />

Figure 3-4 The relationship between the number of slave processors (master/slave<br />

model) and the execution times of an ensemble of PLS-DA with bootstrapping 67<br />

Figure 3-5 Step-by-step representation of the Box complex algorithm towards<br />

identifying the optimal hyperparameters and the minimum bootstrapping error<br />

(HPLC data) .......................................................................................................... 70<br />

Figure 3-6 Contour plots of the density estimation of the optimal hyperparameters as<br />

defined by the Box complex algorithm ................................................................ 71<br />

Figure 3-7 Density, filled-contour, grid and contour plots for the HPLC optimisation 72<br />

Figure 3-8 Comparison of the prediction accuracies between the grid-Search and the<br />

Box complex algorithm (HPLC data) ................................................................... 73<br />

Figure 3-9 Histograms of the number of iterations and function evaluations<br />

respectively for an ensemble of nonlinear (RBF) SVMs optimised using the Box<br />

complex algorithm. ............................................................................................... 74<br />

Figure 3-10 Comparison of the execution times for the tuning of a single RBF SVM<br />

ensemble, when optimised with different techniques, via bootstrapping ............. 76<br />

Figure 3-11 Comparison of the execution times of 100 permutation tests for the (RBF)<br />

SVMs when optimised with different techniques, via bootstrapping ................... 77<br />

viii

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!