05.08.2013 Views

New Approaches to in silico Design of Epitope-Based Vaccines

New Approaches to in silico Design of Epitope-Based Vaccines

New Approaches to in silico Design of Epitope-Based Vaccines

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

4.1. INTRODUCTION 31<br />

Figure 4.2: Schematic representation <strong>of</strong> the modular structure <strong>of</strong> an MHC b<strong>in</strong>d<strong>in</strong>g<br />

groove. A number <strong>of</strong> sequence compositions exist for each pocket. These pocket variants, represented<br />

by different shapes, are shared across different allelic variants <strong>of</strong> MHC molecules.<br />

variants constitut<strong>in</strong>g the b<strong>in</strong>d<strong>in</strong>g groove. In 2007, DeLuca et al. [79] generalized this<br />

approach and applied it <strong>to</strong> MHC-I. Us<strong>in</strong>g the pocket def<strong>in</strong>ition <strong>of</strong> Chelvanayagam [80] <strong>in</strong><br />

comb<strong>in</strong>ation with a modular matrix approach, DeLuca et al. were able <strong>to</strong> <strong>in</strong>crease the<br />

number <strong>of</strong> predictable MHC-I alleles by a fac<strong>to</strong>r <strong>of</strong> five. Nevertheless, for the vast majority<br />

<strong>of</strong> known MHC-I alleles the modular matrices were <strong>in</strong>complete. These alleles could not be<br />

covered by this approach. Also <strong>in</strong>spired by the work <strong>of</strong> Sturniolo et al., <strong>in</strong> Section 4.3 we<br />

propose a more general approach, which for the first time allows predictions for all known<br />

MHC-I alleles, <strong>in</strong>dependent <strong>of</strong> the availability <strong>of</strong> experimental b<strong>in</strong>d<strong>in</strong>g data. Our approach<br />

performs better than exist<strong>in</strong>g allele-specific methods on alleles <strong>in</strong>cluded <strong>in</strong> the tra<strong>in</strong><strong>in</strong>g<br />

set. On alleles not <strong>in</strong>cluded <strong>in</strong> the tra<strong>in</strong><strong>in</strong>g set it achieves remarkable results, which are<br />

comparable <strong>to</strong> those <strong>of</strong> allele-specific methods. However, NetMHCpan, a related approach<br />

developed <strong>in</strong>dependently by Nielsen et al. [20], yields even better results.<br />

As discussed above, MHC b<strong>in</strong>d<strong>in</strong>g is only a necessary prerequisite for T-cell reactivity.<br />

The ability <strong>of</strong> a peptide <strong>to</strong> trigger a T-cell response, i.e., the peptide’s immunogenicity,<br />

depends on the host immune system, more specifically on the MHC alleles expressed and<br />

the T-cell reper<strong>to</strong>ire. Given a suitable MHC molecule that presents the peptide <strong>of</strong> <strong>in</strong>terest<br />

on the cell surface, <strong>in</strong>duction <strong>of</strong> an immune response requires recognition <strong>of</strong> the pMHC<br />

complex by a host T cell. The mechanisms govern<strong>in</strong>g T-cell recognition <strong>of</strong> pMHC complexes<br />

are not yet fully unders<strong>to</strong>od and therefore difficult <strong>to</strong> model. Furthermore, the<br />

availability <strong>of</strong> a suitable TCR is determ<strong>in</strong>ed by positive and negative selection <strong>of</strong> T cells<br />

and thus by the host’s proteome. These complex dependencies and the <strong>in</strong>complete biological<br />

knowledge make the prediction <strong>of</strong> T-cell epi<strong>to</strong>pes a very challeng<strong>in</strong>g problem. Only<br />

few classification approaches have been published. They range from early methods based<br />

on the amphipathic helix model <strong>of</strong> T-cell epi<strong>to</strong>pes [81] <strong>to</strong> sequence-based mach<strong>in</strong>e learn<strong>in</strong>g<br />

methods [21, 22]. Bhas<strong>in</strong> and Raghava [21] proposed a consensus approach us<strong>in</strong>g artificial<br />

neural networks and SVMs on sparse-encoded peptides. Tung et al. [22] skillfully selected<br />

23 physicochemical AA properties from the Am<strong>in</strong>o Acid Index database (AAIndex) [82].<br />

<strong>Based</strong> on this 23-dimensional AA encod<strong>in</strong>g, the authors tra<strong>in</strong>ed an SVC with Gaussian<br />

RBF kernel. Recently, Tung et al. have proposed a str<strong>in</strong>g-kernel-based SVM approach <strong>to</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!