TGQR 2010Q2 Report.pdf - Teragridforum.org
TGQR 2010Q2 Report.pdf - Teragridforum.org
TGQR 2010Q2 Report.pdf - Teragridforum.org
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
SDSC<br />
Dash Early Users<br />
Project Title<br />
Principle<br />
Investigat<br />
or<br />
Institution<br />
(s)<br />
Project Overview<br />
Status/Outcomes<br />
Protein<br />
Databank<br />
Phil<br />
Bourne<br />
University<br />
of<br />
California,<br />
San Diego<br />
Protein Data Bank (PDB) is a<br />
worldwide repository of<br />
information about the 3D<br />
structures of large biological<br />
molecules including proteins<br />
and nucleic acids. Alignment-<br />
DB, from the PDB group,<br />
performs predictive science<br />
with queries on pair-wise<br />
correlations and alignments<br />
of protein structures that<br />
predict protein fold space and<br />
other properties.<br />
Initial tests show<br />
speedup of up to<br />
69x on a limited<br />
set of queries<br />
when using flash<br />
drives. Tests are<br />
ongoing to<br />
evaluate the<br />
possibility of<br />
running test for<br />
the full PDB. This<br />
represents an<br />
important use case<br />
for Gordon, i.e., a<br />
semi-persistent<br />
database installed<br />
on SSD’s. This<br />
project is<br />
providing a<br />
mechanism to<br />
assess this<br />
architecture.<br />
Biological<br />
Networks<br />
Mihail<br />
Baitaliuc<br />
University<br />
of<br />
California,<br />
San Diego<br />
Biological Networks<br />
integrates over 100 public<br />
databases for thousands of<br />
eukaryotic, prokaryotic and<br />
viral genomes and provides<br />
software tools needed to<br />
decipher gene regulatory<br />
networks, sequence and<br />
experimental data, functional<br />
annotation, transcriptional<br />
regulatory region analysis.<br />
The software platform<br />
supports biological pathways<br />
analysis, querying and<br />
visualization of gene<br />
regulation and protein<br />
interaction networks,<br />
metabolic and signaling<br />
pathways.<br />
Generated a<br />
synthetic<br />
workload to<br />
observe query<br />
plans using<br />
indexes and<br />
sequential scans.<br />
Initial tests show<br />
speedup of up to<br />
186x for typical<br />
queries when the<br />
database is<br />
installed on<br />
SSD’s.<br />
Palomar<br />
Transient<br />
Peter<br />
Nugent<br />
Caltech,<br />
UC<br />
This project’s goal is to<br />
support the PTF Image<br />
PTF transient<br />
search tested on<br />
134