Publications <strong>and</strong> manuscripts Publications <strong>in</strong>cluded <strong>in</strong> this thesis are listed <strong>in</strong> the order they appear. All other articles are sorted by publication date, descend<strong>in</strong>g. For papers with five <strong>and</strong> more citations this number is <strong>in</strong>dicated. Paper I Hall<strong>in</strong> PF, B<strong>in</strong>newies TT, Ussery DW. The genome BLASTatlas - a GeneWiz extension for visualization of whole-genome homology. Mol Biosyst 4:363-71 (2008). Paper II B<strong>in</strong>newies TT, Motro Y, Hall<strong>in</strong> PF, Lund O, Dunn D. La T, Hampson DJ, Bellgard M, Wassenaar TM, Ussery DW. Ten years of bacterial genome sequenc<strong>in</strong>g: comparative– genomics–based discoveries. Funct Integr Genomics 6:165-85 (2006) - 56 citations. Paper III Reva ON, Hall<strong>in</strong> PF, Willenbrock H, Sicheritz-Ponten T, Tummler B, Ussery DW Global features of the Alcanivorax borkumensis SK2 genome. Environ Microbiol 10:614- 25 (2008). Paper IV Vesth T, Hall<strong>in</strong> PF, Snipen L, Lagesen K, Wassenaar TM, Ussery DW. The orig<strong>in</strong>s of Vibrio species. Microbial Ecology (2009) doi:10.1007/s00248-009-9596-7 Paper V Wassenaar TM, B<strong>in</strong>newies TT, Hall<strong>in</strong> PF, <strong>and</strong> Ussery DW Tools for comparison of bacterial genomes. Book chapter, Microbiology of Hydrocarbons, Oils, Lipids, <strong>and</strong> Derived Compounds, Spr<strong>in</strong>ger-Verlag, Heidelberg, Germany, 2009. ix
Paper VI [Lagesen K, Hall<strong>in</strong> P] 1 , Rodl<strong>and</strong> EA, Stærfeldt HH, Rognes T, Ussery DW. RNAmmer: consistent <strong>and</strong> rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100-8 (2007) - 8 citations 2 Paper VII Hall<strong>in</strong> PF, Stærfeldt H, Rotenberg E, B<strong>in</strong>newies TT, Benham CJ, <strong>and</strong> Ussery DW. GeneWiz browser: An Interactive Tool for Visualiz<strong>in</strong>g Sequenced Chromosomes. St<strong>and</strong>ards <strong>in</strong> Genomic Sciences 1:204-215 (2009) doi:10.4056/sigs.28177. Papers not <strong>in</strong>cluded Contributions have been made to the follow<strong>in</strong>g papers dur<strong>in</strong>g my PhD project. • Miller WG, Parker CT, Rubenfield M, Mendz GL, Wosten MM, Ussery DW, Stolz JF, B<strong>in</strong>newies TT, Hall<strong>in</strong> PF, Wang G, Malek JA, Rogos<strong>in</strong> A, Stanker LH, M<strong>and</strong>rell RE. The complete genome sequence <strong>and</strong> analysis of the human pathogen Arcobacter butzleri. PLoS ONE 2:e1358 (2007) • Willenbrock H, Hall<strong>in</strong> PF, Wassenaar TM, Ussery DW Characterization of probiotic Escherichia coli isolates with a novel pan-genome microarray. Genome Biol 8:R267 (2007) Earlier papers, 2004–2006 • Worn<strong>in</strong>g P, Jensen LJ, Hall<strong>in</strong> PF, Stærfeldt HH, Ussery DW Orig<strong>in</strong> of replication <strong>in</strong> circular prokaryotic chromosomes. Environ Microbiol 8:353-61 (2006) - 28 citations • Kill K, B<strong>in</strong>newies TT, Sicheritz-Ponten T, Willenbrock H, Hall<strong>in</strong> PF, Wassenaar TM, Ussery DW Genome update: sigma factors <strong>in</strong> 240 bacterial genomes. Microbiology 151:3147-50 (2005) • Bendtsen JD, B<strong>in</strong>newies TT, Hall<strong>in</strong> PF, Ussery DW Genome update: prediction of membrane prote<strong>in</strong>s <strong>in</strong> prokaryotic genomes. Microbiology 151:2119-21 (2005) • Bendtsen JD, B<strong>in</strong>newies TT, Hall<strong>in</strong> PF, Sicheritz-Ponten T, Ussery DW Genome update: prediction of secreted prote<strong>in</strong>s <strong>in</strong> 225 bacterial proteomes. Microbiology 151:1725-7 (2005) • B<strong>in</strong>newies TT, Bendtsen JD, Hall<strong>in</strong> PF, Nielsen N, Wassenaar TM, Pedersen MB, Klemm P, Ussery DW Genome Update: Prote<strong>in</strong> secretion systems <strong>in</strong> 225 bacterial genomes. Microbiology 151:1013-6 (2005) • Hall<strong>in</strong> PF, Nielsen N, Dev<strong>in</strong>e KM, B<strong>in</strong>newies TT, Willenbrock H, Ussery DW Genome update: base skews <strong>in</strong> 200+ bacterial chromosomes. Microbiology 151:633-7 (2005) 1 Both authors contributed equally 2 Additionally 8 citations for the first 8 GEBA genomes published <strong>in</strong> SIGS journal; be<strong>in</strong>g part of a st<strong>and</strong>ard pipel<strong>in</strong>e, RNAmmer will be cited for future GEBA articles. x
- Page 1 and 2: Peter Fischer Hallin | 2009 Peter F
- Page 4: Preface This Ph.D. thesis is writte
- Page 7 and 8: thesis, the work is just being publ
- Page 9 and 10: ved at blive publiceret i Standards
- Page 11: viii
- Page 15 and 16: xii
- Page 17 and 18: 3.3.3 Refining E. coli and Shigella
- Page 19 and 20: xvi
- Page 21 and 22: xviii 2.17 Pan- and core-genome plo
- Page 24 and 25: Chapter 1 Introduction Introduction
- Page 26 and 27: Chapter 2 Comparative Genomics 2.1
- Page 28 and 29: Comparative Genomics the publicly a
- Page 30 and 31: Comparative Genomics source CDS tot
- Page 32 and 33: Comparative Genomics 1 mysql -N -B
- Page 34 and 35: Listing 2.8: R code to generate a 2
- Page 36 and 37: 1st U C A G U 2nd position C A G 3r
- Page 38 and 39: 1st U C A G U 2nd position C A G 3r
- Page 40 and 41: Escherichia coli strain K-12, subst
- Page 42 and 43: Comparative Genomics
- Page 44 and 45: 3M 2.5M 3.5M 2.5M 2M 0M 2M 0.5M B.
- Page 46 and 47: Streptococcus Escherichia Bacillus
- Page 48 and 49: 2.4 Summary Comparative Genomics Th
- Page 50 and 51: Comparative Genomics 2.5 Instant in
- Page 52 and 53: ‘ReSourCe is he best online submi
- Page 54 and 55: up to a total of 41 different E. co
- Page 56 and 57: Fig. 2 Genes (or segments) from eac
- Page 58 and 59: Fig. 5 BLASTatlas of Clostridium bo
- Page 60 and 61: different applications, such as ide
- Page 62 and 63:
1 Comparative Genomics 2.7 Paper II
- Page 64 and 65:
166 literally millions of bacterial
- Page 66 and 67:
168
- Page 68 and 69:
170 resistance genes on mobile gene
- Page 70 and 71:
172 involved in generating diversit
- Page 72 and 73:
174 recipient DNA. A feature observ
- Page 74 and 75:
176 Fig. 5 Genome length distributi
- Page 76 and 77:
178
- Page 78 and 79:
180 reasons why organisms remain un
- Page 80 and 81:
182 A final problem has to do with
- Page 82 and 83:
184 Middendorf B, Hochhut B, Leipol
- Page 84 and 85:
1 Comparative Genomics 2.8 Paper II
- Page 86 and 87:
2 O. N. Reva et al. Fig. 1. Genome
- Page 88 and 89:
4 O. N. Reva et al. decrease of the
- Page 90 and 91:
6 O. N. Reva et al. of which are kn
- Page 92 and 93:
8 O. N. Reva et al. compiled into a
- Page 94 and 95:
10 O. N. Reva et al. encoded by a c
- Page 96 and 97:
12 O. N. Reva et al. systems and ef
- Page 98 and 99:
1 2.9 Paper IV: The origins of Vibr
- Page 100 and 101:
phylogenies based on alternative ho
- Page 102 and 103:
Figure 1 Phylogenetic tree of the 1
- Page 104 and 105:
25000 20000 15000 10000 5000 0 Pan
- Page 106 and 107:
Gap F 2M 2.5M Gap E 875k 750k 625k
- Page 108 and 109:
Table 2 A selection of genes locate
- Page 110 and 111:
Open Access This article is distrib
- Page 112 and 113:
1 Comparative Genomics 2.10 Paper V
- Page 114 and 115:
4314 74 Tools Abstract: Of the plet
- Page 116 and 117:
4316 74 Tools Size distribution of
- Page 118 and 119:
4318 74 Tools Genome atlas Intrinsi
- Page 120 and 121:
4320 74 Tools Genome atlas Intrinsi
- Page 122 and 123:
4322 74 Tools for Comparison of Bac
- Page 124 and 125:
4324 74 Tools for Comparison of Bac
- Page 126 and 127:
4326 74 Tools information, as genet
- Page 128 and 129:
Chapter 3 rRNA operons and promoter
- Page 130 and 131:
tuB murI Fis III Fis II Fis I UP -3
- Page 132 and 133:
Bits 2.0 1.5 1.0 0.5 0.0 Bits 2.0 1
- Page 134 and 135:
RNA operons and promoter analysis O
- Page 136 and 137:
Bits 2.0 1.5 1.0 0.5 0.0 Bits T A T
- Page 138 and 139:
Code Meaning Example C Coding CCCCC
- Page 140 and 141:
RNA operons and promoter analysis 3
- Page 142 and 143:
P2 -10 -35 UP P1 -10 -35 UP FIS FIS
- Page 144 and 145:
1 rRNA operons and promoter analysi
- Page 146 and 147:
Using HMMs also simplifies the use
- Page 148 and 149:
Information content Information con
- Page 150 and 151:
of the annotation. Some of the majo
- Page 152 and 153:
where match states stop around 10 c
- Page 154 and 155:
1 rRNA operons and promoter analysi
- Page 156 and 157:
synthesis in flow cells to simultan
- Page 158 and 159:
Read absence. A boolean where ‘on
- Page 160 and 161:
Hallin, et al. Figure 4 | The dataf
- Page 162 and 163:
Genome homology: Comparing multiple
- Page 164 and 165:
ing platform-‐independent Java
- Page 166 and 167:
34. Wang H, Noordewier M, Benham CJ
- Page 168 and 169:
Chapter 4 Web Services and Interope
- Page 170 and 171:
Web Services and Interoperability i
- Page 172 and 173:
Web Services and Interoperability i
- Page 174 and 175:
Web Services and Interoperability i
- Page 176 and 177:
Web Services and Interoperability i
- Page 178 and 179:
Chapter 5 Conclusion and perspectiv
- Page 180 and 181:
Appendix A Appendix: Workshops, tea
- Page 182 and 183:
Appendix B Appendix: Ph.D. study pl
- Page 184 and 185:
Danmarks Tekniske Universitet AFI,
- Page 186 and 187:
Danmarks Tekniske Universitet AFI,
- Page 188 and 189:
Appendix C Appendix: Courses C.1 Gl
- Page 190 and 191:
D.2 Sample output from queryGenomes
- Page 192 and 193:
Appendix: Software 13 w a r n " $ o
- Page 194 and 195:
Appendix: Software 109 m y ( $ m i
- Page 196 and 197:
Appendix: Software 25 [ ] A l t e r
- Page 198 and 199:
BIBLIOGRAPHY J. Rogers, P. F. Stadl
- Page 200 and 201:
BIBLIOGRAPHY Q. Jin, Z. Yuan, J. Xu
- Page 202 and 203:
BIBLIOGRAPHY Velicer, F.-J. Vorholt