04.04.2013 Views

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

Transcriptional Characterization of Glioma Neural Stem Cells Diva ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.2 Tag mapping Results<br />

Table 6.1: Summary <strong>of</strong> the available clinical data for our GNS cell lines (M=Male,<br />

F=Female, n.a=not applicable).<br />

Type <strong>of</strong> Name <strong>of</strong> Tissue Type Sex IDH1 IDH2 Patient<br />

cell line cell line mutation mutation age (years)<br />

GNS G144 GBM Primary M No No 51<br />

GNS G144ED GBM Primary M No No 51<br />

GNS G166 GBM Primary F No No 74<br />

GNS G179 Giant cell GBM Primary M No No 56<br />

NS CB541 Fetal forebrain n.a. n.a. n.a. n.a. n.a.<br />

NS CB660 Fetal forebrain n.a. n.a. n.a. n.a. n.a.<br />

6.2 Tag mapping<br />

We created one Tag-seq library per cell line and obtained between 6 and 13<br />

million sequence reads from each (Table 6.2). Every read was formed by a first<br />

sequencing primer, the 17nt tag, and a second sequencing primer in this order<br />

(Fig 6.1).<br />

Table 6.2: Summary <strong>of</strong> reads per cell line library.<br />

Type <strong>of</strong> cell line Cell line Number <strong>of</strong> reads<br />

GNS G144 7,133,520<br />

GNS G144ED 6,383,175<br />

GNS G166 13,415,402<br />

GNS G179 11,610,415<br />

NS CB541 12,103,066<br />

NS CB660 10,043,561<br />

Figure 6.1: Diagram <strong>of</strong> the construct generated by the longSAGE protocol sent to<br />

the Illumina sequencer for the final step <strong>of</strong> Tag-seq.<br />

Once the read sequences were received as FASTA files, we first extracted the<br />

17nt tags they contained, then filtered these tags and, finally, aligned them<br />

to the genome. The entire process is summarised as a diagram in figure 6.2.<br />

Firstly, the 17nt tags were extracted out <strong>of</strong> each read, separating the primer<br />

sequences from the tag sequences that actually interested us. Secondly, each<br />

tag sequence was counted and the resulting counts adjusted for sequencing and<br />

library preparation errors that might have caused highly expressed transcripts<br />

to give rise to a significant number <strong>of</strong> tags differing from the expected tag se-<br />

quence by just one base. Secondly, the reads that were not going to map onto<br />

121

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!