New Approaches to in silico Design of Epitope-Based Vaccines
New Approaches to in silico Design of Epitope-Based Vaccines
New Approaches to in silico Design of Epitope-Based Vaccines
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Chapter 8<br />
Discussion & Conclusion<br />
Parts <strong>of</strong> this chapter have previously been published [8].<br />
Traditional trial-and-error based approaches <strong>to</strong> vacc<strong>in</strong>e design have been remarkably<br />
successful. One <strong>of</strong> the major successes was the eradication <strong>of</strong> smallpox <strong>in</strong> the 1970s.<br />
However, there are still many diseases for which no viable vacc<strong>in</strong>e could be found, HIV<br />
<strong>in</strong>fection and cancer be<strong>in</strong>g among the most prom<strong>in</strong>ent examples. Here, new, rationally<br />
designed types <strong>of</strong> vacc<strong>in</strong>es, as, e.g., EVs, are promis<strong>in</strong>g alternatives.<br />
The process <strong>of</strong> design<strong>in</strong>g an EV can roughly be divided <strong>in</strong><strong>to</strong> three steps: epi<strong>to</strong>pe discovery,<br />
epi<strong>to</strong>pe selection and epi<strong>to</strong>pe assembly. In <strong>silico</strong> epi<strong>to</strong>pe discovery has the potential<br />
<strong>to</strong> drastically reduce the number <strong>of</strong> biological experiments that have <strong>to</strong> be performed. A<br />
key step <strong>in</strong> <strong>in</strong> <strong>silico</strong> approaches <strong>to</strong> epi<strong>to</strong>pe discovery is the prediction <strong>of</strong> MHC-b<strong>in</strong>d<strong>in</strong>g peptides.<br />
This problem has been tackled with the full range <strong>of</strong> possible mathematical methods.<br />
Mostly mach<strong>in</strong>e learn<strong>in</strong>g methods have proven useful. However, the prediction <strong>of</strong> b<strong>in</strong>ders<br />
for MHC alleles with little or no experimental data is problematic. In Sections 4.2 and 4.3<br />
we present two approaches <strong>to</strong> overcome the problem <strong>of</strong> scarcity <strong>of</strong> data. Our first approach<br />
improves the predictive power <strong>of</strong> SVMs for alleles with little experimental b<strong>in</strong>d<strong>in</strong>g data by<br />
comb<strong>in</strong><strong>in</strong>g the benefits <strong>of</strong> str<strong>in</strong>g kernels with the ones <strong>of</strong> physicochemical descrip<strong>to</strong>rs for<br />
AAs. Our second approach, UniTope, for the first time allowed predictions for all MHC-I<br />
alleles. This was achieved by employ<strong>in</strong>g all available MHC-I b<strong>in</strong>d<strong>in</strong>g data for the prediction<br />
<strong>of</strong> an <strong>in</strong>dividual allele’s b<strong>in</strong>d<strong>in</strong>g specificity. The kernels developed <strong>in</strong> our first approach<br />
are particularly useful when data is scarce. Next <strong>to</strong> allele-specific MHC b<strong>in</strong>d<strong>in</strong>g prediction<br />
they promise <strong>to</strong> be beneficial <strong>in</strong>, e.g., the prediction <strong>of</strong> substrate specificities with<strong>in</strong> enzyme<br />
families [151]. Use <strong>of</strong> these kernels <strong>in</strong> UniTope (Section 4.3), however, will not yield further<br />
improvements: Due <strong>to</strong> the pool<strong>in</strong>g <strong>of</strong> all available b<strong>in</strong>d<strong>in</strong>g data, tra<strong>in</strong><strong>in</strong>g data is far from<br />
be<strong>in</strong>g scarce <strong>in</strong> this sett<strong>in</strong>g.<br />
Pan-specific approaches like UniTope have contributed significantly <strong>to</strong> the advancement<br />
<strong>of</strong> <strong>in</strong> <strong>silico</strong> epi<strong>to</strong>pe discovery. Nevertheless, it has <strong>to</strong> be noted that there is a drawback<br />
<strong>to</strong> current approaches: for a considerable fraction <strong>of</strong> alleles pan-specific models perform<br />
worse than allele-specific models, i.e., <strong>in</strong>clusion <strong>of</strong> knowledge on other alleles’ b<strong>in</strong>d<strong>in</strong>g<br />
specificities is disadvantageous. Here, models tra<strong>in</strong>ed for a specific allele <strong>in</strong>corporat<strong>in</strong>g a<br />
limited number <strong>of</strong> examples from highly related other alleles [152] show promise. However,<br />
this approach br<strong>in</strong>gs about another drawback: when confronted with a new MHC allele a<br />
93