An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib
An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib
An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
BIBLIOGRAPHY 91<br />
[110] P. J. Cock, C. J. Fields, N. Goto, M. L. Heuer, <strong>and</strong> P. M. Rice. The Sanger FASTQ le<br />
format for sequences with quality scores, <strong>and</strong> the Solexa/Illumina FASTQ variants. In:<br />
Nucleic Acids Res. 38.6 (Apr. 2010), pp. 17671771. doi: 10.1093/nar/gkp1137. pmid:<br />
20015970.<br />
[111] T. S. F. S. W. Group. The SAM Format Specication. Sept. 2011. WebCite: 6FqvzGZJ8.<br />
url: http://samtools.sourceforge.net/SAM1.pdf Feb. 27, 2013.<br />
[112] L. Stein. Generic Feature Format Version 3. Feb. 2013. WebCite: 5R00Wxobq. url: http:<br />
//www.sequenceontology.org/gff3.shtml Mar. 4, 2013.<br />
[113] M. G. Reese, B. Moore, C. Batchelor, F. Salas, F. Cunningham, G. T. Marth, L. Stein, P.<br />
Flicek, M. Y<strong>and</strong>ell, <strong>and</strong> K. Eilbeck. A st<strong>and</strong>ard variation le format for human genome<br />
sequences. In: Genome Biol. 11.8 (2010), R88. doi: 10.1186/gb-2010-11-8-r88. pmid:<br />
20796305.<br />
[114] P. Danecek, A. Auton, G. Abecasis, C. A. Albers, E. Banks, M. A. DePristo, R. E.<br />
H<strong>and</strong>saker, G. Lunter, G. T. Marth, S. T. Sherry, G. McVean, R. Durbin, R. Durbin,<br />
D. Altshuler, G. Abecasis, D. Bentley, A. Chakravarti, A. Clark, F. De La Vega, P.<br />
Donnelly, M. Dunn, P. Flicek, S. Gabriel, E. Green, R. Gibbs, B. Knoppers, E. L<strong>and</strong>er,<br />
H. Lehrach, E. Mardis, G. Marth, et al. The variant call format <strong>and</strong> VCFtools. In:<br />
Bioinformatics 27.15 (Aug. 2011), pp. 21562158. doi: 10.1093/bioinformatics/btr330.<br />
pmid: 21653522.<br />
[115] P. Deutsch. DEFLATE Compressed <strong>Data</strong> Format Specication version 1.3. RFC 1951<br />
(Informational). Internet Engineering Task Force, May 1996. WebCite: 6Fr6mTscx. url:<br />
http://www.ietf.org/rfc/rfc1951.txt.<br />
[116] P. Deutsch. GZIP le format specication version 4.3. RFC 1952 (Informational). Internet<br />
Engineering Task Force, May 1996. WebCite: 6Fr6csgTT. url: http://www.ietf.org/rfc/<br />
rfc1952.txt.<br />
[117] X. Chen, M. Li, B. Ma, <strong>and</strong> J. Tromp. DNACompress: fast <strong>and</strong> eective DNA sequence<br />
compression. In: Bioinformatics 18.12 (Dec. 2002), pp. 16961698. doi: 10.1093/<br />
bioinformatics/18.12.1696. pmid: 12490460.<br />
[118] F. Hach, I. Numanagic, C. Alkan, <strong>and</strong> S. C. Sahinalp. SCALCE: boosting sequence<br />
compression algorithms using locally consistent encoding. In: Bioinformatics 28.23 (Dec.<br />
2012), pp. 30513057. doi: 10.1093/bioinformatics/bts593. pmid: 23047557.<br />
[119] D. C. Jones, W. L. Ruzzo, X. Peng, <strong>and</strong> M. G. Katze. Compression of next-generation<br />
sequencing reads aided by highly ecient de novo assembly. In: Nucleic Acids Res. 40.22<br />
(Dec. 2012), e171. doi: 10.1093/nar/gks754. pmid: 22904078.<br />
[120] M. Hsi-Yang Fritz, R. Leinonen, G. Cochrane, <strong>and</strong> E. Birney. Ecient storage of high<br />
throughput DNA sequencing data using reference-based compression. In: Genome Res.<br />
21.5 (May 2011), pp. 734740. doi: 10.1101/gr.114819.110. pmid: 21245279.<br />
[121] S. Golomb. Run-length encodings (Corresp.) In: Information Theory, IEEE Transactions<br />
on 12.3 (1966), pp. 399401. issn: 0018-9448. doi: 10.1109/TIT.1966.1053907.<br />
[122] D. Human. A Method for the Construction of Minimum-Redundancy Codes. In: Proceedings<br />
of the IRE 40.9 (1952), pp. 10981101. issn: 0096-8390. doi: 10.1109/JRPROC.<br />
1952.273898.<br />
[123] W. J. Kent, A. S. Zweig, G. Barber, A. S. Hinrichs, <strong>and</strong> D. Karolchik. BigWig <strong>and</strong><br />
BigBed: enabling browsing of large distributed datasets. In: Bioinformatics 26.17 (Sept.<br />
2010), pp. 22042207. doi: 10.1093/bioinformatics/btq351. pmid: 20639541.