2010 Best Practices Competition IT & Informatics HPC
IT Informatics - Cambridge Healthtech Institute
IT Informatics - Cambridge Healthtech Institute
- No tags were found...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Figure 3 Scientific data workflow (Feb. <strong>2010</strong>)<br />
Scalable storage<br />
The dedicated storage capacity for NextGen projects has increased from ~80TB to over 200TB with a<br />
new scalable Isilon storage system with a single name space file system. This system provides robust<br />
performance, redundancy and scalability. Being able to manage the very large amounts of storage<br />
required to support biomedical research using the minimal of <strong>IT</strong> support allows researchers to<br />
concentrate on their research and <strong>IT</strong> to concentrate on building better <strong>IT</strong> infrastructure in support of<br />
scientific programs. The Isilon system uses a modular architecture and symmetric clustered file system so<br />
that tasks such as adding additional storage to the storage cluster is as simple as plugging in additional<br />
storage arrays. This helps to minimize costs while providing a solution that can grow as the data storage<br />
requirements continue to increase.<br />
Backup optimization<br />
In addition to the Isilon storage system, TGen used Ocarina storage optimization appliances to compress<br />
data before backup, saving considerable overhead on the backup systems. This makes it feasible to<br />
backup more of the sequencing data.<br />
File sharing<br />
File sharing with external collaborators and other partners is accomplished using the Aspera FASP file<br />
transfer technology. This technology allows optimal use of the network bandwidth to achieve high<br />
throughput file transfer across the Internet.