28.12.2014 Views

TGQR 2010Q2 Report.pdf - Teragridforum.org

TGQR 2010Q2 Report.pdf - Teragridforum.org

TGQR 2010Q2 Report.pdf - Teragridforum.org

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

SDSC<br />

Dash Early Users<br />

Project Title<br />

Principle<br />

Investigat<br />

or<br />

Institution<br />

(s)<br />

Project Overview<br />

Status/Outcomes<br />

Protein<br />

Databank<br />

Phil<br />

Bourne<br />

University<br />

of<br />

California,<br />

San Diego<br />

Protein Data Bank (PDB) is a<br />

worldwide repository of<br />

information about the 3D<br />

structures of large biological<br />

molecules including proteins<br />

and nucleic acids. Alignment-<br />

DB, from the PDB group,<br />

performs predictive science<br />

with queries on pair-wise<br />

correlations and alignments<br />

of protein structures that<br />

predict protein fold space and<br />

other properties.<br />

Initial tests show<br />

speedup of up to<br />

69x on a limited<br />

set of queries<br />

when using flash<br />

drives. Tests are<br />

ongoing to<br />

evaluate the<br />

possibility of<br />

running test for<br />

the full PDB. This<br />

represents an<br />

important use case<br />

for Gordon, i.e., a<br />

semi-persistent<br />

database installed<br />

on SSD’s. This<br />

project is<br />

providing a<br />

mechanism to<br />

assess this<br />

architecture.<br />

Biological<br />

Networks<br />

Mihail<br />

Baitaliuc<br />

University<br />

of<br />

California,<br />

San Diego<br />

Biological Networks<br />

integrates over 100 public<br />

databases for thousands of<br />

eukaryotic, prokaryotic and<br />

viral genomes and provides<br />

software tools needed to<br />

decipher gene regulatory<br />

networks, sequence and<br />

experimental data, functional<br />

annotation, transcriptional<br />

regulatory region analysis.<br />

The software platform<br />

supports biological pathways<br />

analysis, querying and<br />

visualization of gene<br />

regulation and protein<br />

interaction networks,<br />

metabolic and signaling<br />

pathways.<br />

Generated a<br />

synthetic<br />

workload to<br />

observe query<br />

plans using<br />

indexes and<br />

sequential scans.<br />

Initial tests show<br />

speedup of up to<br />

186x for typical<br />

queries when the<br />

database is<br />

installed on<br />

SSD’s.<br />

Palomar<br />

Transient<br />

Peter<br />

Nugent<br />

Caltech,<br />

UC<br />

This project’s goal is to<br />

support the PTF Image<br />

PTF transient<br />

search tested on<br />

134

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!