10.07.2015 Views

Developments in Tape Storage and Suitability for HPC Environments

Developments in Tape Storage and Suitability for HPC Environments

Developments in Tape Storage and Suitability for HPC Environments

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Tape</strong> <strong>in</strong> High Per<strong>for</strong>manceComput<strong>in</strong>g <strong>Environments</strong>Steve Mackey SpectraLogic


Reth<strong>in</strong>k<strong>in</strong>g Disk & <strong>Tape</strong> Strategies…Organisations us<strong>in</strong>g disk-only storageDecreasedlast year13%Plan to start us<strong>in</strong>g tape<strong>for</strong> <strong>in</strong>terim storage 58%Plan to start us<strong>in</strong>g tape<strong>for</strong> long-term archiv<strong>in</strong>g 68%•Media &Enterta<strong>in</strong>ment•Healthcare•HighPer<strong>for</strong>manceComput<strong>in</strong>g•Life Sciences•SurveillanceSource: Fleishman-Hillard Research <strong>for</strong> the L<strong>in</strong>ear <strong>Tape</strong> Open (LTO) Program• Cost• Density• EnergyConsumption• Per<strong>for</strong>mance• Security• Persistence/lifespan• Portability• Reliability


2011 INSIC <strong>Tape</strong> Applications System Report“<strong>Tape</strong> has been shift<strong>in</strong>g from its historical role ofserv<strong>in</strong>g as a medium dedicated primarily toshort-term backup, to a medium that addressesa much broader set of data storage goals,<strong>in</strong>clud<strong>in</strong>g:– active archive (the most promis<strong>in</strong>g segment of marketgrowth),– regulatory compliance (approximately 20% - 25% ofall bus<strong>in</strong>ess data created must be reta<strong>in</strong>ed to meetcompliance requirements <strong>for</strong> a specified <strong>and</strong> oftenlengthy period), <strong>and</strong>– disaster recovery, which cont<strong>in</strong>ues <strong>in</strong> its traditionalrequirements as a significant use of tape.”


Active ArchiveActive Archive provides an af<strong>for</strong>dable, onl<strong>in</strong>esolution to access <strong>and</strong> store all created data.An archive that conta<strong>in</strong>s production data, nomatter how old or <strong>in</strong>frequently accessed,that can still be retrieved onl<strong>in</strong>e. It may existon both disk <strong>and</strong> tape.4


Active ArchiveExtended FileSystem


Disk Will Be Squeezed• Speed of access• Cost• Energy consumption• Cost• Energy consumption• Long term archive• SecurityMemory SystemsDisk Arrays<strong>Tape</strong>2011 2020


<strong>Tape</strong> Requirements <strong>for</strong> <strong>HPC</strong> environments• Extreme scalability <strong>for</strong> s<strong>in</strong>gle system active archive• Per<strong>for</strong>mance– Scalable drive counts <strong>for</strong> very high throughputs– Multiple robots to support high daily cartridge exchanges– Optional „enterprise‟ drive support• Reliability– <strong>Tape</strong> read reliability– Media lifecycle management– Media replication, RAIT• System Redundancy & Serviceability– Robot, robotic <strong>in</strong>terface, robotic controller, PSUs, dual port drives, global hotspare drives, assisted self ma<strong>in</strong>tenance• <strong>Storage</strong> Density• Energy Consumption• Cost per TB


Specifications – Drive ModelsFeature IBM TS1140 Oracle T10000 C LTO-5Capacity (Native) 4.0 TB 5.0 TB 1.5 TBTransfer Rate (Native)250 MB/s 240 MB/s 140 MB/sR/W Compatibility R/W TS1130 Re<strong>for</strong>matted TS1120R only T10000BR only T10000AR/W LTO-4R only LTO-3Power Consumption 51 Watts 67 Watts 27 Watts(No sled)Interfaces 8 Gb FC 4 Gb FC 8 Gb FC FICON* FICON 4 Gb FCLibrary Compatibility IBM TS3500 SL8500 Spectra Libraries IBM TS3400 SL3000 IBM Libraries IBM TS3494 Oracle Libraries Spectra T-F<strong>in</strong>ity Quantum LibrariesMedia Sources Two One MultipleMTBF 237,000 N / A 250,000Load / Unload Cycles 300,000 >150,000 100,000


<strong>Tape</strong> Drive Technology In Use• ≈75% of <strong>HPC</strong> Market Utilizes Open <strong>Tape</strong> Technology• ≈25% of <strong>HPC</strong> Market Utilizes Proprietary <strong>Tape</strong> Technology


LTO <strong>Tape</strong> Roadmap Todayhttp://ultrium.com/technology/roadmap.htmlAnnounced April 14, 2010


Future of <strong>Tape</strong> Capability SecuredJanuary 22, 2010:• ―The scientists at IBM Research – Zurich, <strong>in</strong> cooperationwith the FUJIFILM Corporation of Japan, recorded dataonto an advanced prototype tape, at a density of 29.5billion bits per square <strong>in</strong>ch — about 39 times the areal datadensity of today's most popular <strong>in</strong>dustry-st<strong>and</strong>ard magnetictape product*.• ―These new technologies are estimated to enable cartridgecapacities that could hold up to 35 trillion bytes (terabytes)of uncompressed data*.* http://www.zurich.ibm.com/news/10/storage.html


Density / Capacity• <strong>Tape</strong> - 4.3 PB+ per Sq. M. (Terapackdesign, TS1140 technology)• High Density NAS - 1.5 PB / Sq. M.• 4x – 5x <strong>in</strong>crease <strong>in</strong> tape density <strong>in</strong> 5years (20 TB tapes).TerapackDesign


Energy―The disk system costs over 25 times more money to power <strong>and</strong> cool than asimilar tape system.‖ -Clipper Group, 2007 , (5-year cost comparison to SATAdisk)<strong>Tape</strong> <strong>and</strong> Disk Costs – What it Really Costs to Power the Devices―The energy cost ratio <strong>for</strong> a terabyte stored long-term on SATA disk versusLTO-4 is about 290:1.‖ -Clipper Group, 2008, (5-year cost comparison to D2Dbackup)Disk <strong>and</strong> <strong>Tape</strong> Square off Aga<strong>in</strong> – <strong>Tape</strong> Rema<strong>in</strong>s K<strong>in</strong>g of the Hill with LTO4―The cost of energy alone <strong>for</strong> the average disk based solution exceeds the entireTCO of the average tape based solution.‖ “…disk consumes 238 times asmuch energy as tape under assumptions that lean toward favor<strong>in</strong>g disk.‖ -Clipper Group, (2010, 12-year cost comparison to average disk solution)In Search of the Long-Term Archiv<strong>in</strong>g Solution – <strong>Tape</strong> Delivers Significant TCOOver Disk


Scalability - Exascale <strong>Storage</strong>• Up to 400,000 slots <strong>in</strong> alibrary complex• Up to 8 Libraries / 16 Robots• 40 Frames per libraryT-F<strong>in</strong>ity libraryarchitecturesupports multilibrarycomplex.Skyway Pass-Throughconnect<strong>in</strong>g libraries <strong>in</strong>complex.


<strong>Tape</strong> Technologies Are Reliable…Reliability has <strong>in</strong>creased 700% over the technologyavailable a decade earlier• Advances <strong>in</strong> the coat<strong>in</strong>g of tape film• Read-after-write data verification• Error correction codes• Drive technology featuressimplified tape paths <strong>and</strong> servotrack<strong>in</strong>g systems• Spectra tape libraries offer data<strong>in</strong>tegrity verificationBeech, Debbie. “Best Practices <strong>for</strong> backup <strong>and</strong> long-term data retention” SylvaticaWhitepaper. The evolv<strong>in</strong>g role of disk <strong>and</strong> tape <strong>in</strong> the data center. June 200915


Reliability- Orders of Magnitude Greater• <strong>Tape</strong> has the best bit harderror rate of any storagemedium


What is <strong>Tape</strong> Debris?• “Green” tapes have debris• Debris is typically the result of the manufactur<strong>in</strong>gprocess– Oxide shedd<strong>in</strong>g– Fractured base film (mylar)– Slitt<strong>in</strong>g debrisEnlarged area of tape section.The large circular contam<strong>in</strong>ation isabout 25 μm <strong>in</strong> diameter, roughlytwice the width of an LTO4 track.


Known Problems Caused by <strong>Tape</strong> Debris• New tapes are abrasive to the tape drive head surface<strong>and</strong> can cause excessive wear <strong>and</strong> reduce the lifespan• Debris contam<strong>in</strong>ation is a common cause ofper<strong>for</strong>mance problems:– Poor signal per<strong>for</strong>mance can cause temporary dataerrors lead<strong>in</strong>g to read/write retries *– Debris can migrate <strong>in</strong>to data b<strong>and</strong>s, obstruct<strong>in</strong>g read<strong>in</strong>g <strong>and</strong> writ<strong>in</strong>gof data caus<strong>in</strong>g retries or permanent data loss *– Debris accumulation can cause tape to w<strong>in</strong>d unevenly on the spoollead<strong>in</strong>g to tension control problems that cause temporary errors <strong>and</strong>retries, or may even cause tape to break*Source: IBM LTO Media: Optimized <strong>for</strong> IBM Drives <strong>and</strong> Automation. The difference is per<strong>for</strong>mance


CarbideClean the Spectra Solution• <strong>Tape</strong> burnish<strong>in</strong>g removes loose <strong>and</strong> embeddedparticles <strong>and</strong> smoothes asperities <strong>in</strong> the tape• Improves the per<strong>for</strong>mance level <strong>and</strong> prolongs thelife of tape head (verifies the tape surface)


Complete Media Lifecycle Management


Terapack Optimised RAIT– Very fast tape mounts of RAIT sets (up to over 2x thespeed of non-RAIT sets or non-Terapack libraries– Protection aga<strong>in</strong>st damaged, dropped or worn media– Stream<strong>in</strong>g rate = to number of data tapes <strong>in</strong> a RAITset (up to 8x a non-RAIT stream speed)– Available <strong>in</strong> November from HPSS <strong>and</strong> Spectra


High Availability Architecture• Dual Active-Active Robotic Transporters• Dual Robotics Interface Modules (RIM)• Dual Robotics Control Module(RCM) Architecture• Global Spare TM tape drive• Redundant Power Paths• Data Integrity Verification– PreScan– QuickScan– PostScan


Cost• <strong>Tape</strong>‟s purchase cost rema<strong>in</strong>s unbeatable <strong>for</strong>large systems– 10 to 15 times less expensive to purchase than disk.• A 2.7PB tape system will have a cost of around $0.05 to$0.10 per GB. Larger systems are even lower.• IT grade disk costs about $1.00 GB street price; more <strong>for</strong>enterprise class diskAcquisition Cost


Sample <strong>HPC</strong> Customer list• Argonne National Labs• FMI- Switzerl<strong>and</strong>• Honeywell• NASA Ames• NASA Goddard• Fermi National Labs• S<strong>and</strong>ia National Labs• Los Alamos National Labs• Honda Research• Colsa• NCSA• Howard Hughes Medical Center Research• CERN


30 Years of Success• Proven Innovator <strong>and</strong> History of Success– Founded <strong>in</strong> 1979, self-funded, profitable, debt-freegrowth– Intelligent <strong>in</strong>tegration of complete data protectionsolutions– Highest customer satisfaction & support rat<strong>in</strong>gs <strong>in</strong> the<strong>in</strong>dustry• Gold St<strong>and</strong>ard Manufacturer <strong>in</strong> <strong>Tape</strong> Innovation• Manufactur<strong>in</strong>g <strong>in</strong> Boulder, Colorado


Worldwide Markets• Leader <strong>in</strong> data <strong>in</strong>tensiveverticals:– <strong>HPC</strong>, M&E, Federal <strong>and</strong>F<strong>in</strong>ancial– Require true enterprise solutions• Enterprise R&D flows over tomid-market<strong>and</strong> department level productsEducation9%F<strong>in</strong>ancial14%Other12%Media <strong>and</strong>Enterta<strong>in</strong>ment25%HighPer<strong>for</strong>manceComput<strong>in</strong>g17%Federal23%


Awards <strong>and</strong> Recognition• Ranked #1 <strong>in</strong> 14 out of 14categories!– Overall Product Rank<strong>in</strong>g– Initial Product Quality– Product Reliability– Product Features– Technical Support– Sales-Force Competence– I Would Buy This Product Aga<strong>in</strong>• Ranked #1 <strong>in</strong> all 7 categories<strong>for</strong> Mid-Level <strong>and</strong> Enterprise!


Thank You28

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!