FY2010 - Oak Ridge National Laboratory

More documents

Recommendations

Info

Director’s R&D Fund— Systems Biology and the Environment derived from the switchgrass cultivar “Alamo” using high-performance liquid chromatography (HPLC) and gas chromatography–mass spectrometry (GCMS) methods; (3) set up microcosms to identify exudates from three different switchgrass cultivars exhibiting distinct root architectures to look at genotypic variation; (4) designed a method and apparatus for labeling of plant material using an airtight, closed circulation CO 2 chamber for producing 13 C labeled plant biomass and exudates for soil incubations; (5) conducted soil incubations using synthetic exudate analogs; and (6) developed the initial simple compartment model of microbial community dynamics. During FY 2010, we will finish publications from data collected in previous years and complete follow-on experiments under the new BER Scientific Focus Area funded last year. Information Shared Castro, H. F., A. T. Classen, E. E. Austin, R. J. Norby, and C. W.Schadt. 2010. “Soil microbial community responses to multiple climate change drivers.” Appl. Environ. Microbiol. 76, 999–1007. De Graaff, M. A., A. T. Classen, H. F. Castro, and C. W. Schadt. 2010. “Labile soil carbon inputs mediate soil microbial community composition and plant residue decomposition rates.” New Phytol. 188, 1055–1064. 05199 Next-Generation Computational System for Biological Annotation R. W. Cottingham, S. D. Brown, A. A. Gorin, L. J. Hauser, and D. J. Quest Project Description Most of the genome sequences that have been annotated are rapidly becoming outdated because the annotation process is static and cannot easily integrate new types of data, novel algorithms, or emerging biological concepts (e.g., microRNA regulation from “junk” DNA). Fundamentally the existing annotation systems are capturing only a small fraction of available knowledge both in terms ofvolume of the linked experimental information and in terms of biological understanding of the targeted organisms. We propose a next-generation system to address a number of crucial existing bottlenecks. Our primary goal is to create easy and intuitive access to the annotation process for a wide experimental community, so specialists beyond the core annotation team can contribute their expertise and experimental results directly to the system. The proposed system is designed to (1) support complex system biology concepts, (2) allow both manual curation and fully automatic updates, and (3) provide for evolution of new concepts with minimal implementation effort. The developed framework will be applied to construct a working model of Gene Regulatory Networks (GRNs) in Clostridium thermocellum that will generate experimentally testable predictions of expression levels of a selected set of genes under specified conditions. ORNL has been a leading center for genome annotation. This project will develop a prototype system for the future to enhance ORNL’s position as a center of excellence in biological annotation and provide a long-needed transition toward data management for systems biology. Mission Relevance The capabilities developed by this research will be relevant to the DOE/BER Systems Biology Knowledgebase program and areas of data management and analysis in support of annotation for key DOE Office of Biological and Environmental Research (BER) programs such as Genomic Sciences 94
Director’s R&D Fund— Systems Biology and the Environment Systems Biology, the Joint Genome Institute (JGI) partnership, Bioremediation, Microbial Sequencing, and Genome Annotation that will be critically dependent on appropriate computational capabilities for data management, annotation, and support of experimental direction. The prototype developed in this project will demonstrate a new approach that could be expanded for these programs and the larger research community. Results and Accomplishments A number of whole genome transcriptome data sets for C. thermocellum were collected under normal and ethanol stress conditions using older standard gene expression array technology, newer high-density tiling array technology, and high-throughput sequencing. Analysis methods were developed or integrated and improved to determine differential expression and quantify genes and other features such as operons. Tools were also developed for visualization of both tiling array data and gene expression as determined by sequencing (RNAseq) data in conjunction with genome annotation. Analysis using these tools shows that transcription in the bacterial cell is much more complicated than previously known. We were able to detect the presence of several previously unknown features such as 5ꞌ regulatory RNAs, small regulatory RNAs, and alternative transcription start sites including ones in the middle of annotated genes. Inference of genetic regulatory circuits depends on many things including accurate genome annotation, correct quantification of the genes in the cell, precise structure of operons including transcription start sites, quantification of the transcription factors in the cell, and accurate models for the association of transcription factors to binding sites. Conventional data models (e.g., Genbank files) are inadequate for assembling data of different types into a computational model of the genetic regulatory network because they do not adequately describe concepts associated with regulatory networks and the underlying assumptions of the data used to generate these models. Over the course of this project, we investigated multiple data representation alternatives including SMBL, XML, Chado, and RDF/XML. The goal was to find a model that was appropriate for representing a genetic regulatory network. We were able to determine that BioPAX (built on RDF/XML) is a community adopted data standard that is capable of representing Genetic Regulatory networks. We developed two alternative methods for linking traditional genome annotations to the BioPAX standard. First, we implemented a proof-of-concept annotation representation based on the Semantic Web (RDF/XML). This approach allows one to merge the BioPAX annotation directly with annotation data using the SPARQL query language. Second, we explored the approach of directly importing all of the concepts stored in raw ontologies and the raw data from expression experiments into a Chado relational database schema. The relational database approach is currently better suited to production use, whereas the Semantic Web-based representation is better suited for sharing of scientific data and transparency. Future advancements in Semantic Web technologies may also make it suitable for production. Program Development Since this project began it has become clear that transcriptomics will be the “next big wave” in systems biology research based on rapid advancements in sequencing technology and RNAseq. With the new Illumina technology using 100× sample multiplexing, RNAseq data will be more cheaply generated than alternative technologies so it was fortuitous that we focused on this area. We have discussed incorporating RNAseq transcriptomics as part of the JGI sequencing and annotation pipeline with the JGI management. As a preliminary test they have agreed to sequence 12 samples from some of the Caldicellulosiruptor genomes being studied by ORNL’s BioEnergy Science Center using their Illumina sequencing machines. We will process this data using the RNAseq analysis pipeline created as part of this project. Successful results in this project could include additional Caldicellulosiruptor transcriptome RNAseq samples sequenced by both JGI and ORNL and eventually could include an expanded ORNL annotation pipeline which would include transcriptome analysis as a new product for all researchers. Newmodifications to the 95
Page 1 and 2:
ORNL/PPA-2011/1 Laboratory Directed
Page 3:
ORNL/PPA-2011/1 Oak Ridge National
Page 6 and 7:
NEUTRON SCIENCES ..................
Page 8 and 9:
NATIONAL SECURITY SCIENCE AND TECHN
Page 10 and 11:
05887 Controlling the Catalytic Pro
Page 13 and 14:
Introduction INTRODUCTION The Labor
Page 15 and 16:
Introduction projects for next-gene
Page 17 and 18:
Introduction ― Develop new mode
Page 19 and 20:
Introduction To select the best and
Page 21 and 22:
Introduction Fig. 2. Distribution o
Page 23:
SUMMARIES OF PROJECTS SUPPORTED THR
Page 26 and 27:
Director’s R&D Fund— Science fo
Page 28 and 29:
Page 30 and 31:
Page 32 and 33:
Page 34 and 35:
Page 36 and 37:
Page 38 and 39:
Page 40 and 41:
Page 42 and 43:
Page 44 and 45:
Page 46 and 47:
Page 48 and 49:
Page 50 and 51:
Page 53 and 54:
Director’s R&D Fund— Neutron Sc
Page 55 and 56: Director’s R&D Fund— Neutron Sc
Page 65 and 66: 05306 Structure and Structure Evolu
Page 67 and 68: 05404 Asynchronous In Situ Neutron
Page 77 and 78: Director’s R&D Fund— Ultrascale
Page 105: Director’s R&D Fund— Systems Bi
Page 109 and 110: Director’s R&D Fund— Systems Bi
Page 117: Director’s R&D Fund— Systems Bi
Page 120 and 121: Director’s R&D Fund— Advanced E
Page 135 and 136: Director’s R&D Fund— Emerging S
Page 137 and 138: Director’s R&D Fund— Emerging S
Page 139: Director’s R&D Fund— Emerging S
Page 142 and 143: Director’s R&D Fund— Understand
Page 153 and 154: Director’s R&D Fund— National S
Page 155 and 156: Director’s R&D Fund— National S
Page 157 and 158:
Director’s R&D Fund— National S
Page 159 and 160:
Page 161 and 162:
Page 163:
05573 Rapid Radiochemistry Applicat
Page 166 and 167:
Director’s R&D Fund— Energy Sto
Page 168 and 169:
Page 170 and 171:
Page 172 and 173:
Page 174 and 175:
Page 176 and 177:
Director’s R&D Fund— General Re
Page 178 and 179:
Director’s R&D Fund— General de
Page 180 and 181:
Director’s R&D Fund— General su
Page 182 and 183:
Director’s R&D Fund— General 05
Page 184 and 185:
Director’s R&D Fund— General 20
Page 187 and 188:
Seed Money Fund— Biosciences Divi
Page 189 and 190:
Page 191 and 192:
Page 193:
Page 196 and 197:
Seed Money Fund— Center for Nanop
Page 199 and 200:
Seed Money Fund— Chemical Science
Page 201 and 202:
Page 203 and 204:
Page 205:
Page 208 and 209:
Seed Money Fund— Computational Sc
Page 210 and 211:
Seed Money Fund— Computational Sc
Page 213 and 214:
Seed Money Fund— Computer Science
Page 215 and 216:
Seed Money Fund— Energy and Trans
Page 217 and 218:
Page 219:
Page 222 and 223:
Seed Money Fund— Environmental Sc
Page 224 and 225:
Seed Money Fund— Environmental Sc
Page 227 and 228:
Seed Money Fund— Fusion Energy Di
Page 229 and 230:
Seed Money Fund— Materials Scienc
Page 231 and 232:
Page 233 and 234:
Page 235 and 236:
Page 237 and 238:
Page 239 and 240:
Page 241 and 242:
Seed Money Fund— Measurement Scie
Page 243 and 244:
Page 245 and 246:
Page 247 and 248:
05858 Fabrication of Ultrathin Grap
Page 249 and 250:
Page 251:
Page 254 and 255:
Seed Money Fund— Global Nuclear S
Page 256 and 257:
Seed Money Fund— Neutron Scatteri
Page 259 and 260:
Seed Money Fund— Reactor and Nucl
Page 261 and 262:
Page 263 and 264:
Page 265:
Page 268 and 269:
Seed Money Fund— Physics Division
Page 270 and 271:
Seed Money Fund— Physics Division
Page 272 and 273:
Seed Money Fund— Research Acceler
Page 275 and 276:
Laboratory-Wide Fellowships— Wein
Page 277 and 278:
Page 279 and 280:
Page 281:
Page 284 and 285:
Laboratory-Wide Fellowships— Wign
Page 286 and 287:
Laboratory-Wide Fellowships— Wign
Page 288 and 289:
Index of Project Contributors Coope
Page 290 and 291:
Index of Project Contributors Mille
Page 292 and 293:
Index of Project Contributors Yang,
Page 294:
Index of Project Numbers 05501 ....
show all

FY2010 - Oak Ridge National Laboratory

You also want an ePaper? Increase the reach of your titles

Delete template?

Save as template?