24.08.2013 Views

2009 Vienna - European Society of Human Genetics

2009 Vienna - European Society of Human Genetics

2009 Vienna - European Society of Human Genetics

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Genomics, Genomic technology and Epigenetics<br />

AB’s next generation sequencing platform. We have determined several<br />

conditions that result in high coverage and low bias libraries. These<br />

libraries enable the ability to read 50 bases on each tag, allowing rapid<br />

and precise mapping <strong>of</strong> structural variations, including translocations,<br />

across the entire genome <strong>of</strong> complex organisms.<br />

P11.110<br />

improving the quality <strong>of</strong> DNA libraries in the Next Generation<br />

Sequencing workflow<br />

R. Salowsky1 , K. Gromadski1 , S. Glueck1 , N. Bontoux2 ;<br />

1 2 Agilent Technologies, Waldbronn, Germany, Agilent Technologies, Massy,<br />

France.<br />

Next-generation sequencing technologies play an important role in investigating<br />

complete cancer genomes and transcriptomes. To further<br />

increase productivity <strong>of</strong> this compelling technique, the quality <strong>of</strong> DNA<br />

libraries plays an important role. One important step in the Illumina sequencing<br />

workflow is the amplification <strong>of</strong> the generated libraries for determining<br />

the exact sequencing cluster concentration. The drawback<br />

<strong>of</strong> this step is that amplification artefacts and errors will be introduced<br />

into the target sequence.<br />

An on-chip electrophoresis instrument has become a standard tool for<br />

implementing DNA library quality control and quantification in the Illumina<br />

workflow. The micr<strong>of</strong>luidic device monitors the size and quantification<br />

<strong>of</strong> the amplified libraries and also helps to detect contaminating<br />

artefacts. With an optimized protocol and newly developed electrophoresis<br />

chemistry, the sensitivity could be increased by a factor <strong>of</strong> 20-30,<br />

down to the pg/µl concentration range. This improved detection sensitivity<br />

allows for the significant reduction <strong>of</strong> amplification cycles thereby<br />

reducing the target sequence error rate.<br />

P11.111<br />

mapreads: a tool that rapidly maps short reads to a genome<br />

E. N. Spier1 , S. Katzman2 , N. Mullaken1 , Y. Sun1 , J. Ni1 , Z. Zhang1 ;<br />

1 2 Applied Biosystems, Foster City, CA, United States, University <strong>of</strong> California,<br />

Santa Cruz, Santa Cruz, CA, United States.<br />

Mapreads is a bioinformatics tool that is optimized to rapidly map short<br />

SOLiD (color space) or sequence reads to the genome. Mapreads<br />

indexes the genome for a specific word size (W=14/15 is default for<br />

mapreads), but unlike BLAST can use both continuous and discontinuous<br />

word patterns called “schemas”. An example <strong>of</strong> a discontinuous<br />

schema for a 25-mer will be 7 matches in the beginning <strong>of</strong> the<br />

read followed by 11 charcters followed by 7 matches (effective W=14).<br />

Only seven W=14 schemas are required to map a 25-mer with up to<br />

two mismatches (25-2) enabling much quicker searchers than BLAST<br />

W=7. A new feature <strong>of</strong> mapreads allows to specify the number <strong>of</strong> mismatches<br />

for each schema and report “k-best” hits in the genome. A<br />

single base difference between a read and the reference leads to two<br />

color differences - so called valid-adjacent (VA) mismatches. Mapreads<br />

can count these as a single (VA=1) or two (VA=2) mismatches. In<br />

addition mapreads supports IUB codes in the reference sequence enabling<br />

to avoid “non-reference allele bias” when mapping short reads<br />

to known SNPs. We present results how mapping and false-positive /<br />

false-negative SNP-calling rates for human paired 25-mers depend on<br />

mapping parameters. We use both artificially generated color-reads<br />

with “spiked errors” and SOLiD sequences from a HapMap individual<br />

with known genotypes. Performance for mapping to human genome<br />

ispresented. The source code <strong>of</strong> mapreads is available under GPL<br />

from http://solids<strong>of</strong>twaretools.com/gf/project/mapreads/<br />

P11.112<br />

multiplex sequencing on the sOLiD Platform with 10, 16, or 96<br />

Barcodes<br />

L. Zhang1 , G. Silfwerbrand2 , A. Rico2 , J. Stuart1 , J. Bodeau2 , C. Hendrickson1 ,<br />

E. Dimalanta1 , J. Manning1 , H. Peckham1 , A. Blanchard1 , G. Costa1 , T. Sokolsky1<br />

, K. McKernan1 ;<br />

1 2 Applied Biosystems, Beverly, MA, United States, Applied Biosystems, Foster<br />

City, CA, United States.<br />

The SOLIDTM DNA sequencing system utilizes stepwise ligation <strong>of</strong><br />

oligonucleotide probes and enables high fidelity, high throughput sequencing.<br />

In order to maximize sequencing capacity and reduce workflow<br />

<strong>of</strong> sample preparation, a single sequencing run containing multiple<br />

biological samples is sometimes preferred. To this end, a multiplexing<br />

method with barcodes has been developed for the SOLIDTM platform.<br />

Barcodes are unique 5-7 base sequences that are added at the 3’ end<br />

<strong>of</strong> the template along with a barcode priming region. Sets <strong>of</strong> 10, 16,<br />

and 96 barcodes have been designed and can be assigned to up to<br />

96 individual samples. Data presented shows the sequencing results<br />

<strong>of</strong> all three sets <strong>of</strong> barcodes, as well as the results <strong>of</strong> an alternative<br />

design <strong>of</strong> 20 barcodes. For all sets <strong>of</strong> barcode analysis, over 96% <strong>of</strong><br />

matching sequenced tags contain a barcode. The multiplexing system<br />

coupled with SOLID TM ’s ability to process two slides, accommodating<br />

two to sixteen depositions, enables researchers to sequence over a<br />

thousand patients or unique biological samples within a single run.<br />

P11.113<br />

the Detectable Genome: How much <strong>of</strong> the human genome is<br />

accessible to variant discovery by next-generation sequencing?<br />

L. He 1 , S. Thoraval 2 , H. E. Peckham 3 , Y. Fu 3 , S. F. McLaughlin 3 , E. F. Tsung 3 , S.<br />

S. Ranade 4 , C. C. Lee 3 , C. R. Clouser 3 , J. M. Manning 3 , C. L. Hendrickson 3 , L.<br />

Zhang 3 , E. T. Dimalanta 3 , T. D. Sokolsky 3 , J. K. Ichikawa 3 , J. B. Warner 3 , M. W.<br />

Laptewicz 3 , B. E. Coleman 3 , B. Li 4 , A. P. Blanchard 3 , J. A. Malek 5 , G. L. Costa 3 ,<br />

K. J. McKernan 3 , J. Mangion 6 ;<br />

1 Applied Biosystems, Sweden, 2 Applied Biosystems, France, 3 Applied Biosystems,<br />

MA, United States, 4 Applied Biosystems, CA, United States, 5 Weill Cornell<br />

Medical College in Qatar, Qatar, 6 Applied Biosystems, United Kingdom.<br />

The human genome is being vigorously sequenced in an effort to<br />

understand the extent <strong>of</strong> normal human variation as well as disease<br />

causing variants. This initiative brings with it the challenge <strong>of</strong> assessing<br />

the areas <strong>of</strong> the human genome that are accessible to variant detection.<br />

We illustrate the amount <strong>of</strong> the human genome that is covered<br />

with uniquely placed single tags and uniquely placed mate pairs and<br />

demonstrate how both larger insert sizes and read lengths increase<br />

the portion <strong>of</strong> the genome that is uniquely mappable by paired-end<br />

tags. We use various human genomes (NA18507 - 10x Yoruban male,<br />

NA19240 - 26x Yoruban female) sequenced with SOLiD TM sequencing<br />

to illustrate the amount <strong>of</strong> SNPs and indels that are detected at various<br />

levels <strong>of</strong> average sequence coverage. We also demonstrate the<br />

sequence and clone coverage needed to identify indels <strong>of</strong> any size<br />

between paired-end reads. We use libraries with an assortment <strong>of</strong><br />

insert sizes to show that larger libraries increase the accessibility <strong>of</strong><br />

the genome by spanning larger insertions. We show that the bisulfite<br />

converted human genome is less uniquely mappable than the normal<br />

human genome but significantly less signature is lost in color space<br />

than in base space. We also illustrate that a significant portion <strong>of</strong> large<br />

segmental duplications are accessible to sequence and clone coverage<br />

by paired-end reads. These principles are applicable to all nextgeneration<br />

sequencing platforms and are essential to comprehend the<br />

amount and location <strong>of</strong> variability in the human genome.<br />

P11.114<br />

Tissue-specific forkhead protein FOXA2 regulates SOX gene<br />

expression<br />

J. Popovic, M. Stevanovic;<br />

Institute <strong>of</strong> molecular genetics and genetic engeenering, Belgrade, Serbia.<br />

Sox14/SOX14 is a member <strong>of</strong> B2 sub-group <strong>of</strong> Sox/SOX gene super-family<br />

that functions as transcriptional repressor. Its expression is<br />

restricted to a limited population <strong>of</strong> neurons in the developing brain and<br />

spinal cord. In spinal cord explants, expression <strong>of</strong> Sox14 was found to<br />

be regulated by Sonic hedgehog (SHH). Foxa2 (previously named Hepatic<br />

nuclear factor-3-beta: HNF3β) displays a remarkable functional<br />

diversity and is involved in a wide variety <strong>of</strong> biological processes during<br />

development and adulthood. In the developing nervous system,<br />

Foxa2 can be detected in the floor plate <strong>of</strong> the spinal cord and in periventricular<br />

areas <strong>of</strong> the midbrain and diencephalons.<br />

We have proceeded with investigation <strong>of</strong> transcriptional regulation <strong>of</strong> human<br />

SOX14 gene expression. Our gel-shift and super-shift experiments<br />

demonstrated that FOXA2 interacts directly with predicted binding site<br />

within the SOX14 enhancer region. Using mutated oligonucleotide probe<br />

we further confirmed specificity <strong>of</strong> the FOXA2 binding to the predicted<br />

site. Results obtained with Foxa2 over-expression in co-transfection experiments<br />

confirmed that SOX14 enhancer region possesses regulatory<br />

element activated by this protein in HepG2 and U87MG cells.<br />

In conclusion, here we present the first evidence that transcription factor<br />

FOXA2 is involved in the up-regulation <strong>of</strong> human SOX14 expression<br />

in HepG2 and U87MG cells by direct interaction with the binding<br />

site within its enhancer region.<br />

0

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!