IT Jobs

Did you know? Techworld now offers an IT Jobs section with hundreds of jobs! Current job listings are now available for Software Developers, Web Developers, Application Engineers, Project Managers, Graduate opportunities and more. Apply for your new IT job today!

Turning back the hands of time

Why continuous data protection means never having to say you're sorry

Every major company, and a few startups as well, seems to be offering continuous data protection. That's to say, instead of backup tapes being partial and unreliable backup copies of your data you can recover every last byte of information, specified down to an exact time; 3.32pm on Thursday 1t October, 2005 for example, if you wish.

Why would you want to? Isn’t this just supplier waffle; mere marketing puff to sell products nobody really needs? Haven't people relied on tape back-up for long enough?

But, according to the continuous data protection (CDP) people, tape backup is unreliable and mirroring won’t answer the one need you have: to turn back time.

When you realise that your system, your server, your workstation, your drive array, has missing or corrupted data then you need to turn back time and have it revert to a state it was in before the data loss or corruption happened.

Tape unreliability
Tape backup unreliability is said to be famous. Tape backup is to data recovery what bus timetables are to real life; endearingly unreliable. It’s supposed to be because backups scheduled for Friday afternoon, POETS day, are not run because the scheduled person leaves early and fails to do it. Or they are not run because the job is started, people leave the office, and find the job uncompleted due to an error that happened while they were away.

Even when they are run data can’t be restored because the tape is dirty or broken, the drive fails or the tape can’t even be found.

D2D and CDP suppliers will push Gartner-type reports your way showing that what you thought of as a reliable tape-based backup system is actually quite the reverse.

Back it up to disk instead
The D2D vendors reckon they have the answer to unreliable tape backups. Backup to disk instead. Overland Storage and Quantum are just two of the suppliers at Storage Expo with D2D products. They’ll tell you that disk-based backup can be automated, that it is rather quicker than tape-based backup and that restores are magically quicker.

This is true. There is the slight problem that you’ll probably find 5 terabytes of tape media is cheaper than 5 terabytes of disk. For our purposes here though, the other problem is that if you backup your data to disk and that data is corrupted or incomplete then you don’t have the ability to restore your system back to a known good point without losing data.

What do I mean? Suppose you do a daily backup to disk at 6pm every day. Next day, at 5pm, your transaction database fails. You need to restore it from your backup. You can do that, but every transaction between 6pm last night and 5pm today is lost.

What the CDP people say is that you need ‘continuous’ data protection, not backups every week or every day.

Mirroring is not CDP
Fine. All right. Let’s send every data byte we write to a second set of disks. Let’s mirror the primary disk set to a secondary set. EMC, H-P, Hitachi Data Systems, IBM and Sun will be delighted to tell you about it and what their software and drive arrays can do for you.

Mirroring is thought of as a near-perfect way to avoid data loss but it has a couple of disadvantages. It’s expensive; you need a second disk array so that writes to the first array are simultaneously made to an identical second array. More importantly from a CDP point of view, you can’t wind the clock back and extract all data up to a previous point in time from a mirror disk or array.

For that you need to add something to every piece of data you write. It has to be time-stamped. And to avoid the expense of a duplicated primary array you need to copy your base data and then copy only the changes to it. It’s like a synthetic backup, which Computer Associates, Symantec’s Veritas or EMC Legato can tell you about. You take a full back of the data you want to protect and then take incremental backups thereafter, saving only the changed bytes, the deltas in the trade’s jargon..

When you need to restore a file it is reconstructed, synthesised you might think, from the original full backup and all the subsequent incremental backups containing data relating to that file. This saves space in the backup media because you aren’t taking full backups of everything each time a backup is run. It saves you backup media space and it saves you backup window time.

CDP is like synthetic backup to disk of all changes
So why won’t synthetic backup to disk do? The answer is that it will, but only if every change is written to the backup disk. Traditional synthetic backups are run, like all tape backup jobs, at intervals, typically of a day or a week.

The CDP suppliers run synthetic backup, aka their software processes, all the time. Talk to EMC, HP, IBM and Network Appliance about their plans for offering this technology.

It was first established by niche technology startups like Revivio and TimeSpring. Now the storage majors are offering it or planning to offer it.

Because the CDP backup data is not mirror data it doesn’t have to go to an identical drive array. It can go to a serial ATA (SATA) array. You could backup data on a set of SCSI drives or a Fibre Channel array, to a cheaper SATA drive array. Having RAID facilities protects against a SATA drive failure. You can backup the SATA array to tape for archive-level protection and off-site vaulting for disaster recovery.

This might be overkill, be excessive protection for many people who just want something more reliable than tape backup and with the advantages of D2D but with better protection that daily synthetic backups to disk.

Near-continuous
Microsoft has just announced its Data Protection manager (DPM) which enables you to take snapshots every hour of your primary array and store them on a separate storage server. Microsoft, Quantum, Dell, and H-P will gladly tell you about this at the show. Symantec will tell you about its Veritas product that can match this, possibly exceed it.

With near-CDP you can turn the clock back to within an hour of the time you want. Much better than weekly or daily tape backup. With true CDP you can turn the clock back to the minute, the second even, that you need.

Talk to the vendors mentioned, talk to storage system houses, and understand the costs involved, and what it means for your daily routine in your IT shop. Identify data sets that might need to be continuously protected. And then, if you do need to turn back time, you might be able to do just that with your data.


What are your views on this subject? Use the form below to post a comment on this article up to 500 characters.


Characters remaining: 500

Related Storage news

Startup offers EMC Centera data migration services

Interlock to move Centera data to any other NAS array

Intel launches $125 'affordable' SSD

Intel X25-V 40GB solid state drive offers lower performance with same per-gigabyte cost

3par offers new type of data tiering

Adaptive optimisation keeps costs lower

British hopeless at backing up PCs

Germans and French plan for disaster

Related Storage reviews



Email this article to a friend or colleague:


PLEASE NOTE: Your name is used only to let the recipient know who sent the story, and in case of transmission error. Both your name and the recipient's name and address will not be used for any other purpose.

Techworld White Papers

Email archiving: Top 10 myths and challenges

This survey looks at a number of challenges and myths around email archiving that may also slow adoption of full archiving.

Download Whitepaper

Strategic mobile deployments

Deploying mobile applications? Supporting multiple devices? See why mobile platforms should be part of your IT strategy.

Download Whitepaper

Creating an AUP: Common myths & mistakes

Avoid the common myths & mistakes when implementing your AUP

Download Whitepaper

Legal risks of uncontrolled email and web use

Exploring the challenges facing IT Mangers today and vital steps to ensure safe internet an email use by employees.

Download Whitepaper

Techworld UK - Technology - Business

COLT White Paper

Virtualisation 2.0
Driving to higher ground beyond the basics

Virtualisation can deliver unparalleled efficiency and cost reductions to your business, allowing direct access to servers and guaranteeing a dependable, rapid response in times of crisis. Read this e-book to learn more about consolidation, discover the latest technologies and find out how to reduce the TCO of virtualisation.

Download E-Book
COLT White Paper

IT Misuse Survey

Complete this survey and you could win a Nexus One

Techworld are running a short survey to discover how UK businesses are managing Internet and email misuse in the Enterprise.

Complete Survey

Webcast: IT Financial Management: Cost Optimisation for Efficiency and Agility.
On Demand Webcast
Join this webcast to learn about the techniques and technologies that can help you prove the value of IT to the business by understanding the true cost of today's IT services and those that will be necessary to deliver future success.

Register Today

Site Map

IDG Network

* *