03.12.2015 Views

bbc 2015

BBC2015_booklet

BBC2015_booklet

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

BeNeLux Bioinformatics Conference – Antwerp, December 7-8 <strong>2015</strong><br />

Abstract ID: P<br />

Poster<br />

10th Benelux Bioinformatics Conference <strong>bbc</strong> <strong>2015</strong><br />

P1. KNN-MDR APPROACH FOR DETECTING GENE-GENE<br />

INTERACTIONS<br />

Sinan Abo alchamlat 1 & Frédéric Farnir 1,* .<br />

Fundamental and Applied Research for Animals & Health (FARAH), Sustainable Animal Production, University of<br />

Liège 1 . * f.farnir@ulg.ac.be<br />

These last years have seen the emergence of a wealth of biological information. Facilitated access to the genome<br />

sequence, along with massive data on genes expression and on proteins have revolutionized the research in many fields<br />

of biology. For example, the identification of up to several millions SNPs in many species and the development of chips<br />

allowing for an effective genotyping of these SNPs in large cohorts have triggered the need for statistical models able to<br />

identify the effects of individual and of interacting SNPs on phenotypic traits in this new high-dimensional landscape.<br />

Our work is a contribution to this field...............................................................................................................<br />

INTRODUCTION<br />

GWAS has allowed the identification of hundreds of<br />

genetic variants associated to complex diseases and traits,<br />

and provided valuable information into their genetic<br />

architecture (Wu M et al., 2010). Nevertheless, most<br />

variants identified so far have been found to confer<br />

relatively small information about the relationship<br />

between changes at the genomic level and phenotypes<br />

because of the lack of reproducibility of the findings, or<br />

because these variants most of the time explain only a<br />

small proportion of the underlying genetic variation (Fang<br />

G et al., 2012). This observation, quoted as the ‘missing<br />

heritability’ problem (Manolio T et al., 2009) of course<br />

raises the question: where does the unexplained genetic<br />

variation come from? A tentative explanation is that genes<br />

do not work in isolation, leading to the idea that sets of<br />

genes (or genes networks) could have a major effect on the<br />

tested traits while almost no marginal – i.e. individual<br />

gene – effect is detectable. Consequently, an important<br />

question concerns the exact relationship between the<br />

genomic configuration, including the interactions between<br />

the involved genes, and the phenotypic expression.<br />

METHODS<br />

To tackle this subject, different statistical methods such as<br />

MDR (Multi Dimensional Reduction) have been proposed<br />

for detecting gene-gene interaction (Ritchie, D., et al.,<br />

2001); their relative performances remain largely unclear,<br />

and their extension to situations combining many variants<br />

turns out to be challenging. So we propose a novel MDR<br />

approach using K-Nearest Neighbors (KNN) methodology<br />

(KNN-MDR) for detecting gene-gene interaction as a<br />

possible alternative, especially when the number of<br />

involved determinants is potentially high. The idea behind<br />

our method is to replace the status allocation used in<br />

classical MDR methods by a KNN approach: the majority<br />

vote occurs in the k (a parameter that must be tuned and<br />

depends on the various possible scenarios) nearest<br />

neighbors instead of within the (potentially empty) cell<br />

determined by the tested attributes of the individual to be<br />

classified. The steps other than classification are identical<br />

in both methods (i.e. cross-validation, attributes selection,<br />

training and tests balanced accuracy computations, best<br />

model selection procedure).<br />

RESULTS & DISCUSSION<br />

Experimental results on both simulated data and real<br />

genome-wide data from Wellcome Trust Case Control<br />

Consortium (WTCCC) (Wellcome Trust Case Control C.,<br />

2007) show that KNN-MDR has interesting properties in<br />

terms of accuracy and power, and that, in many cases, it<br />

significantly outperforms its recent competitors.<br />

FIGURE 1. Comparison of the inter-chromosomal interactions detected<br />

on the WTCCC dataset by KNN-MDR and other interaction methods<br />

using this same dataset as example (Shchetynsky et al. (<strong>2015</strong>); Zhang et<br />

al. (2012))<br />

The results of this study allow us to draw some<br />

conclusions about the performance of KNN-MDR: on the<br />

one hand, the performance of the KNN-MDR method to<br />

detect gene-gene interactions are similar to the<br />

performance of MDR for small problems. On the other<br />

hand, KNN-MDR has significant advantages in large<br />

samples and large number of markers (such as GWAS) to<br />

detect the existence of genes effect. So KNN-MDR can be<br />

seen as a new and more comprehensive method than MDR<br />

and other competitors for detecting gene-gene interaction.<br />

REFERENCES<br />

Wu M et al. American journal of human genetics 86, 929-942 (2010).<br />

Fang G et al. PloS one 7, 1932-6203 (2012).<br />

Manolio T et al. Nature 461, 747-753 (2009).<br />

Ritchie, D., et al. Am J Hum Genet,69, 138-147 (2001).<br />

Wellcome Trust Case Control C. Nature, 447(7145):661-678 (2007).<br />

Shchetynsky K et al. Clinical immunology 158(1):19-28 (<strong>2015</strong>).<br />

Zhang J et al. American Medical Journal 3(1) (<strong>2015</strong>).<br />

45

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!