04.04.2013 Views

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.2 Tag mapping Results<br />

Figure 6.2: Step by step diagram <strong>of</strong> the extraction, filtering and mapping phases<br />

for reads and tags. The extracted tag population is coloured with different shades <strong>of</strong><br />

grey that represent a different origin (adapter, mitochondrial or ribosomal tag). In<br />

the mapping phase the diagram to the right represents the human reference genome<br />

and some <strong>of</strong> the filtered tags (light grey) are mapping onto some known transcripts<br />

(gene A, gene B) and non-coding regions.<br />

the reference genome (adapter tags formed by the ligation <strong>of</strong> two sequencing<br />

primers, and mitochondrial RNA), as well as rRNA sequences, were filtered<br />

out. As shown in figure 6.3, on average more than 90% <strong>of</strong> tags remained un-<br />

filtered, meaning that most <strong>of</strong> the sequenced data were available for us to use<br />

in subsequent analyses, the first one being alignment to the reference genome.<br />

Finally, the tag sequences contained within the recounted and filtered reads<br />

were mapped in two complementary ways. Notice that, in order to maximise<br />

the ability <strong>of</strong> the aligner to effectively map our short tag sequences to the ref-<br />

erence genome, we included the CATG recognition site <strong>of</strong> the MmeI anchoring<br />

enzyme, at the 5’ <strong>of</strong> every tag sequence; this generated 21nt tag sequences that<br />

were used for alignment to the reference genome. For each library, we mapped<br />

the so-generated pool <strong>of</strong> 21nt tags to the human reference genome, as well as<br />

to a virtual tag-to-gene library that we assembled from already existing tag-to-<br />

gene libraries and a complementary one that we generated programmatically.<br />

The tag mapping strategy we adopted was a hierarchical one, as summarised in<br />

figure 6.4. This allowed us to generate sets <strong>of</strong> decreasing stringency - regarding<br />

the number <strong>of</strong> tags mapping to known transcripts - that were best fit for differ-<br />

122

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!