FeaturesectionSolutions for the virtual eraHow deduplication helpsreduce the cost of backup anddisaster recoveryBy Kay Benaroch and Shane Jackson<strong>Dell</strong> and EMC have teamed up to deliver efficientdisk-based backup and recovery with integrateddeduplication technology designed to make disk-basedbackups as cost-effective as tape-based backups.Deduplication pays off<strong>Dell</strong>/EMC DD Series appliancesincorporate deduplication and other keytechnologies to help meet storage andbackup challenges.• Up to 99 percent bandwidth reductionfor replication• 90–97 percent data reduction forbackups• Leading defense against data integrityissues with the Data Domain DataInvulnerability ArchitectureToday’s explosive data growth is prompting manyorganizations to look for ways to increase the efficiencyof their storage, backup, and disaster recovery processes.The growth of data is straining capacity, but at the sametime, cost and complexity frustrate many IT managers seeking tomake improvements. For example, tape-based approaches thatworked well when originally implemented can prove to be slow,expensive, and unreliable as the organization grows and as tapesmust be physically transported from remote sites to a centralstorage location. Disk-based backup is inherently faster and morereliable than tape, but has traditionally been more costly, and largedisk arrays can be difficult to manage.<strong>Dell</strong> and EMC are addressing these challenges by leveraging thedata reduction and cost-saving advantages of deduplication storagesystems from Data Domain, which was purchased by EMC in 2009.Rather than adding deduplication to an existing storage platform, <strong>Dell</strong>and EMC have introduced the <strong>Dell</strong>/EMC DD Series—purpose-builtappliances that incorporate disk-based storage and deduplication.These appliances are designed to tame data growth as effectively aspossible, helping organizations to reduce complexity and costs.Overcoming challenges to effective data protectionCost pressures can keep many organizations from making neededstorage and backup improvements to cover the variety of operatingsystems, applications, and geographic locations that produce critical28 <strong>2010</strong> <strong>Issue</strong> 02 | dell.com/powersolutionsReprinted from <strong>Dell</strong> Power Solutions, <strong>2010</strong> <strong>Issue</strong> 2. Copyright © <strong>2010</strong> <strong>Dell</strong> Inc. All rights reserved.
BackupData center applications(Microsoft, Oracle, SAP, VMware)ArchiveNAS, SAN, DASPrimarystorageFigure 1. <strong>Dell</strong>/EMC DD Seriesinfrastructure and ecosystemBackup applications(EMC, CommVault,Symantec, Vizioncore, CA,IBM, Hewlett-Packard,Atempo, BakBone)Archive applications(EMC, CommVault,Symantec, F5)NetworkReplicationover WANWANDisaster recovery<strong>Dell</strong>/EMC DD Seriesappliance<strong>Dell</strong>/EMC DD Seriesapplianceinformation. IT managers may be forcedto deploy systems piecemeal, requiringadditional management by IT departmentsthat are already stretched, or they mayhave resources for headquarters or centralsites but lack the budget to create acomprehensive plan that encompasses alltypes of servers and applications at local andremote sites.Organizations may also be exposed torisk by doing without essential protectionsuch as off-site disaster recovery. Off-siteprocesses for disaster recovery havehistorically induced latency, promptingmany organizations to simply createlocal copies to speed local recovery—anapproach that exposes the organizationto data loss in the event of accidentaldeletion or site disasters. In addition, thecost of bandwidth to replicate the growingvolume of critical information over a widearea network (WAN) is often prohibitive,especially for remote or branch offices.Complexity in the backup environmentis another factor that can delay storage andbackup improvements. Many organizationshave multiple backup software instancesor heterogeneous environments. Thiscomplexity can increase operational andprocedural errors, leading to unprotected,corrupted, or lost backup data. As a result, ITmanagers may not be able to meet businessneeds for data restoration and recovery.To address these challenges withoutrequiring a costly redesign of backupprocedures, IT managers are exploring waysto remove duplicate data from the backupprocess, reducing data volume to the pointthat they can use fewer hard drives thanthey had been. Finding an approach thatworks with a flexible range of backup andarchive software is also desirable. <strong>Dell</strong> andEMC are working collaboratively to createstorage and backup systems that includeintegrated deduplication functionality.Examining technical requirementsBuilding a storage system that delivers the fullpotential of deduplication requires a flexibleapproach that addresses variable-lengthduplicates, multiple formats and protocols,and other technical considerations.Variable-length duplicates. Conventionalstorage systems, whether network attachedstorage (NAS) or storage area networks(SANs), store data in fixed-size blocks.Some deduplication systems simply lookfor identical fixed-size data blocks, anapproach that is inadequate for maximizingdeduplication results; to be effective, asystem must be able to identify variablelengthsegments as well. This capabilityis necessary to maximize the amountof redundant data that can be found inincoming data blocks, regardless of smallchanges in those blocks compared withprevious backups.Format parsing. Data comes in manyformats generated by many differentapplications, and the same duplicate data isoften embedded in those different formats. Thesheer number of these formats and the speedat which they change make it impractical fora storage vendor to support them all. Parsingthe formats requires substantial overhead. AReprinted from <strong>Dell</strong> Power Solutions, <strong>2010</strong> <strong>Issue</strong> 2. Copyright © <strong>2010</strong> <strong>Dell</strong> Inc. All rights reserved.dell.com/powersolutions | <strong>2010</strong> <strong>Issue</strong> 02 29