28.02.2014 Views

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

An Integrated Data Analysis Suite and Programming ... - TOBIAS-lib

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

BIBLIOGRAPHY 91<br />

[110] P. J. Cock, C. J. Fields, N. Goto, M. L. Heuer, <strong>and</strong> P. M. Rice. The Sanger FASTQ le<br />

format for sequences with quality scores, <strong>and</strong> the Solexa/Illumina FASTQ variants. In:<br />

Nucleic Acids Res. 38.6 (Apr. 2010), pp. 17671771. doi: 10.1093/nar/gkp1137. pmid:<br />

20015970.<br />

[111] T. S. F. S. W. Group. The SAM Format Specication. Sept. 2011. WebCite: 6FqvzGZJ8.<br />

url: http://samtools.sourceforge.net/SAM1.pdf Feb. 27, 2013.<br />

[112] L. Stein. Generic Feature Format Version 3. Feb. 2013. WebCite: 5R00Wxobq. url: http:<br />

//www.sequenceontology.org/gff3.shtml Mar. 4, 2013.<br />

[113] M. G. Reese, B. Moore, C. Batchelor, F. Salas, F. Cunningham, G. T. Marth, L. Stein, P.<br />

Flicek, M. Y<strong>and</strong>ell, <strong>and</strong> K. Eilbeck. A st<strong>and</strong>ard variation le format for human genome<br />

sequences. In: Genome Biol. 11.8 (2010), R88. doi: 10.1186/gb-2010-11-8-r88. pmid:<br />

20796305.<br />

[114] P. Danecek, A. Auton, G. Abecasis, C. A. Albers, E. Banks, M. A. DePristo, R. E.<br />

H<strong>and</strong>saker, G. Lunter, G. T. Marth, S. T. Sherry, G. McVean, R. Durbin, R. Durbin,<br />

D. Altshuler, G. Abecasis, D. Bentley, A. Chakravarti, A. Clark, F. De La Vega, P.<br />

Donnelly, M. Dunn, P. Flicek, S. Gabriel, E. Green, R. Gibbs, B. Knoppers, E. L<strong>and</strong>er,<br />

H. Lehrach, E. Mardis, G. Marth, et al. The variant call format <strong>and</strong> VCFtools. In:<br />

Bioinformatics 27.15 (Aug. 2011), pp. 21562158. doi: 10.1093/bioinformatics/btr330.<br />

pmid: 21653522.<br />

[115] P. Deutsch. DEFLATE Compressed <strong>Data</strong> Format Specication version 1.3. RFC 1951<br />

(Informational). Internet Engineering Task Force, May 1996. WebCite: 6Fr6mTscx. url:<br />

http://www.ietf.org/rfc/rfc1951.txt.<br />

[116] P. Deutsch. GZIP le format specication version 4.3. RFC 1952 (Informational). Internet<br />

Engineering Task Force, May 1996. WebCite: 6Fr6csgTT. url: http://www.ietf.org/rfc/<br />

rfc1952.txt.<br />

[117] X. Chen, M. Li, B. Ma, <strong>and</strong> J. Tromp. DNACompress: fast <strong>and</strong> eective DNA sequence<br />

compression. In: Bioinformatics 18.12 (Dec. 2002), pp. 16961698. doi: 10.1093/<br />

bioinformatics/18.12.1696. pmid: 12490460.<br />

[118] F. Hach, I. Numanagic, C. Alkan, <strong>and</strong> S. C. Sahinalp. SCALCE: boosting sequence<br />

compression algorithms using locally consistent encoding. In: Bioinformatics 28.23 (Dec.<br />

2012), pp. 30513057. doi: 10.1093/bioinformatics/bts593. pmid: 23047557.<br />

[119] D. C. Jones, W. L. Ruzzo, X. Peng, <strong>and</strong> M. G. Katze. Compression of next-generation<br />

sequencing reads aided by highly ecient de novo assembly. In: Nucleic Acids Res. 40.22<br />

(Dec. 2012), e171. doi: 10.1093/nar/gks754. pmid: 22904078.<br />

[120] M. Hsi-Yang Fritz, R. Leinonen, G. Cochrane, <strong>and</strong> E. Birney. Ecient storage of high<br />

throughput DNA sequencing data using reference-based compression. In: Genome Res.<br />

21.5 (May 2011), pp. 734740. doi: 10.1101/gr.114819.110. pmid: 21245279.<br />

[121] S. Golomb. Run-length encodings (Corresp.) In: Information Theory, IEEE Transactions<br />

on 12.3 (1966), pp. 399401. issn: 0018-9448. doi: 10.1109/TIT.1966.1053907.<br />

[122] D. Human. A Method for the Construction of Minimum-Redundancy Codes. In: Proceedings<br />

of the IRE 40.9 (1952), pp. 10981101. issn: 0096-8390. doi: 10.1109/JRPROC.<br />

1952.273898.<br />

[123] W. J. Kent, A. S. Zweig, G. Barber, A. S. Hinrichs, <strong>and</strong> D. Karolchik. BigWig <strong>and</strong><br />

BigBed: enabling browsing of large distributed datasets. In: Bioinformatics 26.17 (Sept.<br />

2010), pp. 22042207. doi: 10.1093/bioinformatics/btq351. pmid: 20639541.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!