28.02.2014 Views

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1.3. APPLICATIONS OF HIGH-THROUGHPUT SEQUENCING 5<br />

sequencing further enables recovery <strong>and</strong> quantication of unexpected transcripts <strong>and</strong> splice<br />

forms.<br />

A variety of dierent transcriptome sequencing protocols are available, distinguished by their<br />

suitability for assessment of transcript structure <strong>and</strong> quantication as well as capabilities to<br />

detect certain types of transcript <strong>and</strong> to recover transcript directionality. As with all examples<br />

of quantication by sequencing, data interpretation may be intricate (section 1.5.5).<br />

A special class of transcript is given by small RNA, short cellular RNA molecules ∼2025<br />

nucleotides in length ubiquitous in almost all well-studied eukaryotes. These have been identied<br />

as another vital factor for transcript regulation, <strong>and</strong> are further implicated in various important<br />

cellular processes such as DNA methylation <strong>and</strong> pathogen response [51, 52]. Transcript levels<br />

are aected by small RNA through RNA interference (RNAi). As the specicity-determining<br />

component of RNA induced silencing complexes (RISC), they direct the degrading enzymatic<br />

activity of ARGONAUTE proteins at the targeted, roughly complementary transcripts.<br />

Small RNA molecules are attributed to multiple cellular pathways in plants <strong>and</strong> animals generally<br />

involving double str<strong>and</strong>ed RNA or RNA stem-loop structures. According to the generating<br />

biochemical pathway, small RNA can be categorized into dierent classes, notably microRNA<br />

(miRNA) <strong>and</strong> small interfering RNA (siRNA), linked with overlapping, but distinct cellular roles.<br />

High-throughput small RNA proling by sequencing facilitates genome-wide discovery, categorization<br />

<strong>and</strong> relative quantication for the various small RNA species. However, small RNA<br />

quantication is sensitive to sequence specic bias [53], with their short length likely introducing<br />

increased variance to base composition bias (section 1.4) <strong>and</strong> emphasizing the importance of ligation<br />

biases. Furthermore, relative quantication with reference to <strong>lib</strong>rary size is often deemed<br />

unsuitable for an investigation, <strong>and</strong> consensus on appropriate normalizing transformations is currently<br />

lacking (cf. [54]). Nonetheless, small RNA sequencing data prove a valuable quantitative<br />

resource on the genome scale, e. g. as a proxy for RNA-directed DNA methylation [55] or in<br />

correlation with further transcriptome or epigenetic data.<br />

1.3.2 ChIP-Seq <strong>and</strong> further Immunoprecipitation Protocols<br />

Expression of genes is thought to be connected in complex regulatory networks of positive <strong>and</strong><br />

negative feedback. A key role in these networks is occupied by transcription factors, DNA-binding<br />

proteins involved with initiation of gene transcription at the promoter sequences. ChIP-Seq, the<br />

successor of the microarray-based ChIP-chip technology featuring improved signal-to-noise ratio<br />

<strong>and</strong> sharper signal bounds, facilitates the identication of putative transcription factor targets<br />

in a relevant in-vivo context. By elucidation of individual transcription factors' targets, our<br />

underst<strong>and</strong>ing of the complex transcriptional interplay can be gradually improved.<br />

Immunoprecipitation (IP) methods exploit the antibody-antigen anity to enrich specic<br />

molecules in a solution. In the course of the procedure, antibodies for the antigen to be concentrated<br />

are incubated with the solution to promote formation of antibody-antigen complexes.<br />

<strong>An</strong>tibodies are captured on solid-phase beads, thus enabling precipitation to enrich for the formed<br />

complexes.<br />

Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-Seq) employs<br />

the IP technique for in-vivo detection of regions of co-localization of a specic molecule, usually a<br />

protein, with the genomic DNA. During the procedure, live tissue is prepared with formaldehyde<br />

or an appropriate alternative cross-linking reagent. Formaldehyde undergoes a reaction that<br />

induces cross-linking of amino groups in close vicinity, thereby promoting formation of DNAprotein<br />

complexes (DPC) (cf. Barker et al. [56]). Following lysis of the tissue, the DNA is<br />

sonicated, yielding DPC associated with only short fragments of genomic DNA. The DPC are<br />

immunoprecipitated <strong>and</strong>, by purication <strong>and</strong> amplication of the contained DNA, prepared as

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!