Research Report 2003 - Max-Planck-Institut für molekulare Genetik

More documents

Recommendations

Info

Department of Vertebrate Genomics 20 b) DNA expression profiling Our group has carried out data analysis service for more than 20 projects from 10 different groups within and outside the department. Different technologies (Affymetrix, glass-chips, nylon-arrays) have been incorporated in the data analysis modules. Research covers the full pipeline from image analysis, normalisation, cluster analysis, reverse engineering and modelling and simulation. c) “Meta-clustering” We set up a framework to compare and validate various clustering methods for gene expression profiles. Since the concept of co-regulation is fundamental to large gene expression screenings and since each individual method has a certain bias we focus on validation methods for gene expression clusters by robust normalisation of the data, by the comparison of different methods, by the definition and implementation of numerical cluster validation methods and the comparison of gene expression data to alternative data sources. A graphical user interface to a comprehensive data-mining tool, BioMiner, has been set up in close collaboration with the bioinformatics start-up MicroDiscovery GmbH Berlin. d) Data integration We define methods and strategies to correlate and integrate data from various sources into a unifying gene-based concept. As an example we correlated data from EST-mining, RT- PCR and whole-mount in-situ hybridisations from mouse orthologues to human chromosome 21 genes. Further attempts are ongoing with the AG Yaspo to validate EST-mining data and gene expression data. In order to take into account transcriptional regulation we work on a joined project with the Department of Computational Biology (Prof. Vingron) in order to evaluate gene expression data and transcriptional profiling data coming from comparative cross-species sequence analysis. e) Statistical service The bioinformatics group has contributed to the statistical evaluation of other data sets such as the comparative analysis of the 2R hypothesis, the analysis of peptide peak-lists derived from 2D-gels (AG Gobom) and the large-scale sequence analysis of bacteria genomes (AG Russo). f) Simulation and Modelling Modelling and simulation systems are a valuable tool for the understanding of complex systems. We developed an object-oriented environment, PyBioS, that is able to process such systems. In the context of array hybridization experiments we used this system for the simulation of artificial gene expression data in order to identify and validate important experimental parameters. In collaboration with the Kinetic modeling group (AG Klipp) we established a modeling platform for in-silico experiments in the context of target validation. g) Visualisation We develop tools for data quality control and visualisation. The program Xdigitise was designed for the visualisation and manipulation of TIF-images in order to check image analysis results. The automated image analysis program, FA, for hybridisation images has been developed and successfully applied to various projects. Furthermore, we are developing the Java tool A-Cgen for the purpose of array data quality control. This tool is implemented in the chip-processing pipeline that we set up with the automation group (AG Hultschig). We currently implement modules for data quality control, normalisation and the detection of differentially expressed genes. Furthermore, the tool allows webconnections to data resources such as GenomeMatrix, Entrez server and Ensembl. Sequence Analysis (Hennig) The main research areas and activities of our group are evolutionary sequence analysis, generalized sequence annotation based on GeneOntology, clustering of redundant cDNA sequences (ESTs), species-species comparison in terms of orthology, and the development of automatic high-throughput procedures for genome annotation.
a) Evolutionary studies In a recent publication we have tested the 2R hypothesis, which postulates 2 rounds of genome duplications at the origin of vertebrates, and found high significance that there was at least one genome duplication around 600 myr ago. The study was based on a set of genes (proteins) orthologous between 4 invertebrate (yeast, C.elegans, drosophila, amphioxus) and 2 vertebrate species (human, mouse), which we constructed from public and inhouse (amphioxus) sources. Based on ~3000 groups of orthologous genes we found an almost 3-fold increase in gene numbers from invertebrates to vertebrates. The majority of those extra duplicates could be dated to a time interval around 600 myr ago, i.e. shortly after the emergence of amphioxus. Moreover, we identified a significant number of segments in the human genome sequence, which can be clearly shown to be derivatives of large-scale ancient duplication events. b) Generalised sequence annotation by GeneOntology (GO) The absolute need for a unified vocabulary for description of genes and their products led to the foundation of the GeneOntology consortium, which currently provides a hierarchy of more than 11.000 terms. In order to facilitate the task of annotating anonymous sequence data, e.g. from an in house EST project, we have developed an automated system (http://goblet.molgen.mpg.de) able to perform GO annotations on any kind of coding sequences (cDNA, protein). We also demonstrated that import of GO-terms from existing species annotations is meaningful even in cases of significant evolutionary distance and in the majority of cases gives correct results. c) High-throughput sequence clustering and genome analysis Our group was involved in many sequencing projects carried out mainly at the MPIMG. For various model organisms (zebrafish, sea urchin, amphioxus, medaka ) gene catalogs were developed by large scale EST analysis. An important step in the analysis was the clustering of EST sequences into unique contigs. We have developed an automated system that performs clustering of several 100.000 ESTs typically within a day generating one unique sequence (contig) per cluster. For the genomic sequencing projects carried out in house (human chromosomes 17, 21, X) we designed an automated pipeline of analysis steps called GenscanX. Parts of our analysis entered the final annotation of the human chromosomes 17, 21 and X. We also participated in several smaller projects focusing on the analysis of single disease genes. Recently, we started to develop a system for identification of non-mRNA genes in mammalian genomes. Future perspectives a) NGFN-2 (National Genome Research Network) - Our group is well established within the NGFN and we plan to maintain these efforts by applying for funding in the 2nd round of NGFN. Main focus here will be data integration aspects, tools for functional genomics analysis and detection and analysis of non-mRNA genes. Furthermore, our group is integrated in the disease-oriented networks “Infection and Inflammation” and “Cardiovascular Diseases” carrying out statistical analysis of clinical data and detection of disease relevant genes. b) EU-Framework 6 - A further direction will be Systems Biology, as a methodological concept to integrate bioinformatics with modelling and simulation. Here, we successfully applied for a grant within the EU-Framework 6 program. The project will start 2004 in collaboration with the EBI-Ensembl group, the School of Computer Science Tel-Aviv University, LION Bioscience Ltd. Cambridge and MicroDiscovery GmbH Berlin. We will construct a software platform for the modelling of disease processes. c) BioRegio - Our group applied (together with MicroDiscovery GmbH, Scienion GmbH and the German Institute for Human Nutrition) for a BMBF BioRegio grant on the large-scale screening of a mouse model for diabetes and obesity in various relevant tissues and through time points of disease progression. This passed the first evaluation round and is currently under revision. MPI for Molecular Genetics Research Report 2003 21
Page 1 and 2: Max-Planck-Institut für molekulare
Page 3 and 4: Table of Contents The Max Planck In
Page 5 and 6: MPI for Molecular Genetics Research
Page 7 and 8: The Max Planck Institute for Molecu
Page 9 and 10: Department of Vertebrate Genomics I
Page 11 and 12: To generate high resolution express
Page 13 and 14: Competitive position Since a long t
Page 15 and 16: the Protein Structure Factory. Fina
Page 17 and 18: Mass Spectrometry Group Head: Dr. J
Page 19 and 20: different tissues of A. thaliana, w
Page 21: Bioinformatics Group Heads: Ralf He
Page 25 and 26: Fuchs T, Malecova B, Linhart C, Sha
Page 27 and 28: Mouse, Medaka & MHC Group Head: Dr.
Page 29 and 30: General information Publications 19
Page 31 and 32: Jochen Wittbrodt, EMBL, Heidelberg
Page 35 and 36: THETA - Two Hybrid Transfected-cell
Page 37 and 38: Academical co-operations Andreas Ra
Page 39 and 40: The investigation of signaling path
Page 41 and 42: In vitro Ligand Screening Group Hea
Page 45 and 46: Neurodegenerative Disorders Group H
Page 47 and 48: Automation Group Graduate students:
Page 49 and 50: In addition, novel methods for the
Page 51 and 52: Sittler A, Walter S, Wedemeyer N, H
Page 53 and 54: Prof. Dr. Joachim Klose, Institut f
Page 55 and 56: eral years. This ambiguity is partl
Page 57 and 58: Clark MD, Panopoulou GD, Cahill DJ,
Page 59 and 60: integrations were passaged to the g
Page 63 and 64: Protein Group Scientists: Sabine Ba
Page 65 and 66: Kersten B, Bürkle L, Kuhn EJ, Giav
Page 67 and 68: Cardiovascular Genetics Group Head:
Page 69 and 70: protein interaction partners and at
Page 73 and 74:
Chromosome 21 Group Head: Marie-Lau
Page 75 and 76:
General information Publications 12
Page 77 and 78:
Department of Human Molecular Genet
Page 79 and 80:
Network (NGFN). Recently, DNA from
Page 81 and 82:
Finally, several of these groups ar
Page 83 and 84:
Neurochemistry Group & Mouse Lab Gr
Page 85 and 86:
studies. Furthermore, these mice wi
Page 87 and 88:
Noonan syndrome and related disorde
Page 89 and 90:
Chromosome Rearrangements & Disease
Page 91 and 92:
General information Selected Public
Page 93 and 94:
DNA Microarrays Scientists: Dr. Fik
Page 95 and 96:
General information Publications He
Page 97 and 98:
that in addition to reduced prolife
Page 99 and 100:
Christoph Redies, Prof. Dr., Instit
Page 101 and 102:
Since my move to the MPIMG in 11/20
Page 103 and 104:
Biochemistry of Inherited Brain Dis
Page 105 and 106:
Schweiger S, Chaoui R, Tennstedt C,
Page 107 and 108:
Until 2002, 13 genes have been impl
Page 109 and 110:
General information Selected public
Page 111 and 112:
Department of Computational Molecul
Page 113 and 114:
EU: The European Molecular Biology
Page 115 and 116:
EST tissue distribution Besides the
Page 117 and 118:
Protein Families & Evolution Group
Page 119 and 120:
General information Publications 20
Page 121 and 122:
We make use of the well-known HMMer
Page 123 and 124:
Theses Benjamin Georgi: A graph-bas
Page 125 and 126:
wide analysis of tyrosine as well a
Page 127 and 128:
Further projects include multiple t
Page 129 and 130:
Co-operations Identification and fu
Page 131 and 132:
upstream of the translational start
Page 133 and 134:
Department of Developmental Genetic
Page 135 and 136:
2000; see figure 1). So far, we hav
Page 137 and 138:
partners are. We also want to find
Page 139 and 140:
Emeritus Group General Molecular Ge
Page 141 and 142:
Sethmann S, Ceglowski P, Willert J,
Page 143 and 144:
Research Group Development & Diseas
Page 145 and 146:
MPI for Molecular Genetics Research
Page 147 and 148:
Goals An important future goal is t
Page 149 and 150:
Independent Junior Research Groups
Page 151 and 152:
over, daf-9 is tightly regulated by
Page 153 and 154:
European C.elegans Meeting, Blanken
Page 155 and 156:
and at the ribosomal DNA locus. One
Page 157 and 158:
lencer, has so far remained unchara
Page 159 and 160:
Molecular control of skeletal devel
Page 161 and 162:
BMPs, into a common control network
Page 163 and 164:
this end I plan to analyze mouse mu
Page 165 and 166:
Ribosome Group The Ribosome group c
Page 167 and 168:
General information Selected Public
Page 169 and 170:
group during the same period. In ad
Page 171 and 172:
11) Agmon I, Auerbach T, Baram D, B
Page 173 and 174:
Reconstitution & function The ribos
Page 175 and 176:
Wendrich TM, Blaha G, Wilson DN, Ma
Page 177 and 178:
Miscellaneous Research Groups There
Page 179 and 180:
vealed that the extraordinary stabi
Page 181 and 182:
34. Thorsted PB, Macartney DP, Akht
Page 183 and 184:
esis and DNA packaging of this phag
Page 185 and 186:
Other launched projects concern con
Page 187 and 188:
BMBF 01GR0105, Plattform 1.1: NGFN
Page 189 and 190:
DNA synthesis, lipid biosynthesis,
Page 191 and 192:
Administration & Research Support H
Page 193 and 194:
Technical Management and Workshops
Page 195 and 196:
tion based on the simplified GOOD a
Page 197 and 198:
Library Librarians: Dipl.-Fachinf.
Page 199:
How to get to the Institute Max Pla
show all

Research Report 2003 - Max-Planck-Institut für molekulare Genetik

Create successful ePaper yourself

Delete template?

Save as template?