2009 Vienna - European Society of Human Genetics
2009 Vienna - European Society of Human Genetics
2009 Vienna - European Society of Human Genetics
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Genomics, Genomic technology and Epigenetics<br />
AB’s next generation sequencing platform. We have determined several<br />
conditions that result in high coverage and low bias libraries. These<br />
libraries enable the ability to read 50 bases on each tag, allowing rapid<br />
and precise mapping <strong>of</strong> structural variations, including translocations,<br />
across the entire genome <strong>of</strong> complex organisms.<br />
P11.110<br />
improving the quality <strong>of</strong> DNA libraries in the Next Generation<br />
Sequencing workflow<br />
R. Salowsky1 , K. Gromadski1 , S. Glueck1 , N. Bontoux2 ;<br />
1 2 Agilent Technologies, Waldbronn, Germany, Agilent Technologies, Massy,<br />
France.<br />
Next-generation sequencing technologies play an important role in investigating<br />
complete cancer genomes and transcriptomes. To further<br />
increase productivity <strong>of</strong> this compelling technique, the quality <strong>of</strong> DNA<br />
libraries plays an important role. One important step in the Illumina sequencing<br />
workflow is the amplification <strong>of</strong> the generated libraries for determining<br />
the exact sequencing cluster concentration. The drawback<br />
<strong>of</strong> this step is that amplification artefacts and errors will be introduced<br />
into the target sequence.<br />
An on-chip electrophoresis instrument has become a standard tool for<br />
implementing DNA library quality control and quantification in the Illumina<br />
workflow. The micr<strong>of</strong>luidic device monitors the size and quantification<br />
<strong>of</strong> the amplified libraries and also helps to detect contaminating<br />
artefacts. With an optimized protocol and newly developed electrophoresis<br />
chemistry, the sensitivity could be increased by a factor <strong>of</strong> 20-30,<br />
down to the pg/µl concentration range. This improved detection sensitivity<br />
allows for the significant reduction <strong>of</strong> amplification cycles thereby<br />
reducing the target sequence error rate.<br />
P11.111<br />
mapreads: a tool that rapidly maps short reads to a genome<br />
E. N. Spier1 , S. Katzman2 , N. Mullaken1 , Y. Sun1 , J. Ni1 , Z. Zhang1 ;<br />
1 2 Applied Biosystems, Foster City, CA, United States, University <strong>of</strong> California,<br />
Santa Cruz, Santa Cruz, CA, United States.<br />
Mapreads is a bioinformatics tool that is optimized to rapidly map short<br />
SOLiD (color space) or sequence reads to the genome. Mapreads<br />
indexes the genome for a specific word size (W=14/15 is default for<br />
mapreads), but unlike BLAST can use both continuous and discontinuous<br />
word patterns called “schemas”. An example <strong>of</strong> a discontinuous<br />
schema for a 25-mer will be 7 matches in the beginning <strong>of</strong> the<br />
read followed by 11 charcters followed by 7 matches (effective W=14).<br />
Only seven W=14 schemas are required to map a 25-mer with up to<br />
two mismatches (25-2) enabling much quicker searchers than BLAST<br />
W=7. A new feature <strong>of</strong> mapreads allows to specify the number <strong>of</strong> mismatches<br />
for each schema and report “k-best” hits in the genome. A<br />
single base difference between a read and the reference leads to two<br />
color differences - so called valid-adjacent (VA) mismatches. Mapreads<br />
can count these as a single (VA=1) or two (VA=2) mismatches. In<br />
addition mapreads supports IUB codes in the reference sequence enabling<br />
to avoid “non-reference allele bias” when mapping short reads<br />
to known SNPs. We present results how mapping and false-positive /<br />
false-negative SNP-calling rates for human paired 25-mers depend on<br />
mapping parameters. We use both artificially generated color-reads<br />
with “spiked errors” and SOLiD sequences from a HapMap individual<br />
with known genotypes. Performance for mapping to human genome<br />
ispresented. The source code <strong>of</strong> mapreads is available under GPL<br />
from http://solids<strong>of</strong>twaretools.com/gf/project/mapreads/<br />
P11.112<br />
multiplex sequencing on the sOLiD Platform with 10, 16, or 96<br />
Barcodes<br />
L. Zhang1 , G. Silfwerbrand2 , A. Rico2 , J. Stuart1 , J. Bodeau2 , C. Hendrickson1 ,<br />
E. Dimalanta1 , J. Manning1 , H. Peckham1 , A. Blanchard1 , G. Costa1 , T. Sokolsky1<br />
, K. McKernan1 ;<br />
1 2 Applied Biosystems, Beverly, MA, United States, Applied Biosystems, Foster<br />
City, CA, United States.<br />
The SOLIDTM DNA sequencing system utilizes stepwise ligation <strong>of</strong><br />
oligonucleotide probes and enables high fidelity, high throughput sequencing.<br />
In order to maximize sequencing capacity and reduce workflow<br />
<strong>of</strong> sample preparation, a single sequencing run containing multiple<br />
biological samples is sometimes preferred. To this end, a multiplexing<br />
method with barcodes has been developed for the SOLIDTM platform.<br />
Barcodes are unique 5-7 base sequences that are added at the 3’ end<br />
<strong>of</strong> the template along with a barcode priming region. Sets <strong>of</strong> 10, 16,<br />
and 96 barcodes have been designed and can be assigned to up to<br />
96 individual samples. Data presented shows the sequencing results<br />
<strong>of</strong> all three sets <strong>of</strong> barcodes, as well as the results <strong>of</strong> an alternative<br />
design <strong>of</strong> 20 barcodes. For all sets <strong>of</strong> barcode analysis, over 96% <strong>of</strong><br />
matching sequenced tags contain a barcode. The multiplexing system<br />
coupled with SOLID TM ’s ability to process two slides, accommodating<br />
two to sixteen depositions, enables researchers to sequence over a<br />
thousand patients or unique biological samples within a single run.<br />
P11.113<br />
the Detectable Genome: How much <strong>of</strong> the human genome is<br />
accessible to variant discovery by next-generation sequencing?<br />
L. He 1 , S. Thoraval 2 , H. E. Peckham 3 , Y. Fu 3 , S. F. McLaughlin 3 , E. F. Tsung 3 , S.<br />
S. Ranade 4 , C. C. Lee 3 , C. R. Clouser 3 , J. M. Manning 3 , C. L. Hendrickson 3 , L.<br />
Zhang 3 , E. T. Dimalanta 3 , T. D. Sokolsky 3 , J. K. Ichikawa 3 , J. B. Warner 3 , M. W.<br />
Laptewicz 3 , B. E. Coleman 3 , B. Li 4 , A. P. Blanchard 3 , J. A. Malek 5 , G. L. Costa 3 ,<br />
K. J. McKernan 3 , J. Mangion 6 ;<br />
1 Applied Biosystems, Sweden, 2 Applied Biosystems, France, 3 Applied Biosystems,<br />
MA, United States, 4 Applied Biosystems, CA, United States, 5 Weill Cornell<br />
Medical College in Qatar, Qatar, 6 Applied Biosystems, United Kingdom.<br />
The human genome is being vigorously sequenced in an effort to<br />
understand the extent <strong>of</strong> normal human variation as well as disease<br />
causing variants. This initiative brings with it the challenge <strong>of</strong> assessing<br />
the areas <strong>of</strong> the human genome that are accessible to variant detection.<br />
We illustrate the amount <strong>of</strong> the human genome that is covered<br />
with uniquely placed single tags and uniquely placed mate pairs and<br />
demonstrate how both larger insert sizes and read lengths increase<br />
the portion <strong>of</strong> the genome that is uniquely mappable by paired-end<br />
tags. We use various human genomes (NA18507 - 10x Yoruban male,<br />
NA19240 - 26x Yoruban female) sequenced with SOLiD TM sequencing<br />
to illustrate the amount <strong>of</strong> SNPs and indels that are detected at various<br />
levels <strong>of</strong> average sequence coverage. We also demonstrate the<br />
sequence and clone coverage needed to identify indels <strong>of</strong> any size<br />
between paired-end reads. We use libraries with an assortment <strong>of</strong><br />
insert sizes to show that larger libraries increase the accessibility <strong>of</strong><br />
the genome by spanning larger insertions. We show that the bisulfite<br />
converted human genome is less uniquely mappable than the normal<br />
human genome but significantly less signature is lost in color space<br />
than in base space. We also illustrate that a significant portion <strong>of</strong> large<br />
segmental duplications are accessible to sequence and clone coverage<br />
by paired-end reads. These principles are applicable to all nextgeneration<br />
sequencing platforms and are essential to comprehend the<br />
amount and location <strong>of</strong> variability in the human genome.<br />
P11.114<br />
Tissue-specific forkhead protein FOXA2 regulates SOX gene<br />
expression<br />
J. Popovic, M. Stevanovic;<br />
Institute <strong>of</strong> molecular genetics and genetic engeenering, Belgrade, Serbia.<br />
Sox14/SOX14 is a member <strong>of</strong> B2 sub-group <strong>of</strong> Sox/SOX gene super-family<br />
that functions as transcriptional repressor. Its expression is<br />
restricted to a limited population <strong>of</strong> neurons in the developing brain and<br />
spinal cord. In spinal cord explants, expression <strong>of</strong> Sox14 was found to<br />
be regulated by Sonic hedgehog (SHH). Foxa2 (previously named Hepatic<br />
nuclear factor-3-beta: HNF3β) displays a remarkable functional<br />
diversity and is involved in a wide variety <strong>of</strong> biological processes during<br />
development and adulthood. In the developing nervous system,<br />
Foxa2 can be detected in the floor plate <strong>of</strong> the spinal cord and in periventricular<br />
areas <strong>of</strong> the midbrain and diencephalons.<br />
We have proceeded with investigation <strong>of</strong> transcriptional regulation <strong>of</strong> human<br />
SOX14 gene expression. Our gel-shift and super-shift experiments<br />
demonstrated that FOXA2 interacts directly with predicted binding site<br />
within the SOX14 enhancer region. Using mutated oligonucleotide probe<br />
we further confirmed specificity <strong>of</strong> the FOXA2 binding to the predicted<br />
site. Results obtained with Foxa2 over-expression in co-transfection experiments<br />
confirmed that SOX14 enhancer region possesses regulatory<br />
element activated by this protein in HepG2 and U87MG cells.<br />
In conclusion, here we present the first evidence that transcription factor<br />
FOXA2 is involved in the up-regulation <strong>of</strong> human SOX14 expression<br />
in HepG2 and U87MG cells by direct interaction with the binding<br />
site within its enhancer region.<br />
0