Data Disaster Recovery
Data Disaster Recovery
WHITE PAPER
Abstract
Data Disaster Recovery for SMBs This paper discusses the growing awareness within the small and medium-sized business (SMB) community that business data that most valuable of company assets is at risk and must be better protected. Concepts such as Recovery Time and Recovery Point objectives, common in the large enterprise, are gaining greater awareness as ways to measure and develop data asset protection strategies. Addressing the need for fool-proof data protection and immediate recovery from disasters can be daunting, but fortunately, online backup is solving the problem for a growing number of businesses.
Table of Contents
Executive Summary................................................ page 3 Business Objectives................................................ page 4 Prioritization of Business Processes.............. page 4 Determine Business Objectives..................... page 5 Data Loss Scenarios....................................... page 6 Putting It Together........................................ page 6 Determining a Solution.......................................... page 7 Online Backup Addresses the SMB.................. page 8 Conclusion.............................................................. page 10
3
LIVEVAULT WHITE PAPER
Business Objectives
A recently published article by InfoWorld had the editor describing the value of her laptop as $2 million. Is this outrageous? Absolutely not. The value of data as a business asset does not correlate to the medium that stores that data. If it did, the value of a Monet would equal the price of the canvas.
Inevitably, your computer systems will fail. Determining the business value that data represents to your company is essential in order to plan recovery of that business data when failure occurs. Fortunately, the value of data tends to align with function. Table 1 presents a possible categorization strategy that can be used in determining the value of systems in your organization. This table is a useful guide, but the content will vary by organization. How a company segments and ultimately prioritizes its business applications is highly dependent on individual business requirements. One organization may determine that email is a mission critical application while another may deem it far less critical. Consider the overall business impact of one of your systems being unavailable for an extended outage. What is the effect? Or consider the impact of re-entering data that may be lost in an outage. How much data is irreplaceable? These realities will drive your requirements for recovery objectives at the business application level. Determining priority should be a process that is shared by executive management. The details must be addressed with the stakeholders of the business, where awareness must be built, defined and agreed upon. Only then can the process meet expectations.
Determining the business value that data represents to your company is essential in order to plan recovery of that business data when failure occurs.
Business informationthe information required to run a businessincreasingly exists on server hard drives within computers. To adequately plan to protect that data, it is imperative for businesses to look not only at the value of the data on those systems, but also at the time required to get that data back after a failure and the tolerance for data loss after an event. Step 1 Prioritization of Business Processes The first step in planning a data protection strategy involves a critical look at the business and how it functions. Over the past three decades, computing infrastructure and the data that it manages has been completely integrated into the daily operation of most organizations. How long can an organization continue to operate without its infrastructure or its data?
Impact on Business
Mission Critical
Sample Applications
EDI, commerce, customer support
Business Critical
Cross-organization operations
Operationally Important
Departmental
Table 1
4
LIVEVAULT WHITE PAPER
Step 2 - Determine Business Objectives Once you have established the relative priority of business applications, it is possible to determine objectives for recovery. There are three primary concepts that need to be considered when planning a recovery strategy: Recovery Time Objective (RTO), Recovery Point Objective (RPO) and the scope of the Data Loss Event (DLE).
> Ever increasing management costs associated with storage management >>Management problems associated with storage systems often account for six to nine times the original purchase price Setting RTO and RPO goals requires the organization to look inward and make some clear, rational determinations as to how critical each business application is to the running of the company. Many businesses find that all data is not created equal. The nature of the industry, the organizational culture and the systems in place will all significantly affect these decisions and the resulting RTO, RPO and DLE standards. Some real-world examples will help cement these concepts. Example 1: A law firm with 50 attorneys determines that, in the event of a system failure, it is acceptable for client files to be inaccessible for 48 hours (Recovery Time). However, since the attorneys input data directly into the systems rather than on paper first, near zero data loss is acceptable (Recovery Point). Example 2: A $50 million insurance agency, whose business is dependent on being available to its customers when a disaster strikes, experiences a flood that causes widespread damage throughout the community. The agency must be back online processing customer claims in 4 hours (RTO). However, the
These concepts have been well integrated into the sound business practices of large enterprises. Now they are gaining significant attention in the small to medium enterprise because a series of market dynamics have made comprehensive data asset protection much more economical to smaller companies. These changing dynamics include: > A radical reduction in the cost of disk drive technology >>ATA disk drives have had a significant impact on disk storage costs. IDC predicts that by 2006, ATA disks will be the number one drive technology within the enterprise > Increased broadband penetration into SMB >>Many small and medium businesses now have broadband of one kind or another and are looking for better ways to leverage that network connectivity
Planning Concerns
Recovery Time Objective
Acronym
RTO
Description
The time objective to bring a system back online following a failure The acceptable amount of data loss from the last good backup prior to the point of failure Type and scope of failure scenario that results in data loss
RPO
DLE
5
LIVEVAULT WHITE PAPER
agencys client interactions have a front end paper trail, so re-entry of a small amount of data prior to the failure is acceptable, a 2 hours RPO. Step 3 Data Loss Scenarios Data loss events come in various shapes, sizes and scopes. IT plays an important part in every disaster recovery plan, but by no means the only part. This is especially true when the disaster rises to the level of a site-wide or regional disaster where the entire business facility is inaccessible. In these situations, the data processing aspects of the business need to be addressed in the larger context of business recovery. For example, a business will respond differently to a database corruption and a building fire. Although a fire is a rare event, the business recovery entails a vastly different scope (employee safety, new facility, communications, etc.) than a purely data-driven event such as a corrupted database. While, taken alone, it may be critical to recover from a database corruption in 4 hours (RTO), if the business is recovering from a fire, the first four hours are usually dedicated to people and to securing a new place of business. So in this scenario, a four-hour RTO for a database is irrelevant. There is nowhere to recover that data to. The scope of a data loss event affects not only the way a company responds, but also how much a company invests in protecting against the event.
For a perspective on how frequently the most common types of data loss occur, refer to Table 2. Disaster recovery planning should address recovery requirements for all relevant types of data loss. Step 4 Putting It Together At this point, the business has looked inward, determined the needs of business functions, prioritized business applications, identified data loss events and begun to define RTO and RPO goals. Now is the time to put all these concepts together and develop a simple chart identifying recovery objectives (RTO and RPO) for each class of application relative to the scope of the data loss event. Table 3 on page 7 provides an example. The data is illustrative only. IT professionals must keep several key points in mind as you develop your companys chart: Correlation of objective to risk Remember, not all Data Loss Events (DLE) are equally likely to occur. Consider the cost trade offs when developing objectives. Corporate buy-in Executive-level business management support is imperative. These requirements and objectives must satisfy the business stakeholder so, without management buy-in, ITs ability to finalize an actionable plan is at risk.
Description/Examples
Human error, deletion, overwrite, data entry error, File corruption, contained virus, application error, Failure or loss of primary storage, e.g. corrupt RAID controller, etc. Site Disaster CPU failure, theft, catastrophic virus Table 2
Frequency
83% 10% 5% <2% <1%
6
LIVEVAULT WHITE PAPER
Representing business value Ensure that the recovery objectives represent the true business value of the data, including opportunity costs. Do your objectives account for the lost revenue when critical systems are down? Are they in line with the reality of recovering from a site loss? Budget The business case will eventually have to be made that these objectives and their subsequent costs are aligned. In many cases, but not all (a notable exception, online backup, is described below) the cost associated with recovery objectives increases as the acceptable time frame decreases. Reality check with other recovery plans It is generally good practice to do one last reality check to confirm that the IT recovery plan fits within the overall business recovery plan. An RTO of 1 hour in event of a site disaster does little good unless there is also a plan to have a server in a computer room with communications gear and operators to run it in a shorter period of time.
Determining a Solution
Large enterprise organizations have been addressing these concepts and issues effectively for decades. But SMBs, equally dependent on data to run their businesses but not equally resourced, are only just now grappling with these issues and their ramifications. Instead of having an entire committee or even a single person to address the task, SMBs have an already stretched IT department that often looks at recovery planning as just one more thing that has to be done with no time or resources to do it. While the planning and recovery tasks are similar for all enterprises, the needs of the SMB are different in many ways from that of the larger enterprise. By nature, SMBs are ultimately concerned with: Ability to address RTO and RPO Its always a priority to get systems back online and minimize
Class
Class 1 Mission Critical File Storage/server
Protected
RTO
RPO
Y Y
4 Hrs 24 Hrs
Site
Class 2 Business Critical File Storage/server
48 Hrs
Y Y
8 Hrs 48 Hrs
Site
Class 3 Operationally Important File Storage/server
3 Days
Y Y
8 Hrs 48 Hrs
Site
7
LIVEVAULT WHITE PAPER
data loss. But every organizations tolerance for delay and loss is different. Recovery Time/Recovery Point standards must be appropriate to your organizations particular needs. Addressing All Data Loss Events (DLE) The effect of unrecoverable data is too devastating to ignore. Therefore, all potential Data Loss Events must be identified and planned for, even if the likelihood of the event occurring is low. While the cost of rapidly recovering from some DLEs may be particularly onerous, planning enables the organization to rationally adjust Recovery Time and Recovery Point standards to offset costs. Limited IT Resources The reality is that IT resources are thin and adding additional tasks to under-resourced organizations often causes something to break. Tight Budgets Ideally, every company would want to have a fully redundant datacenter that can handle a fail-over of the entire business instantly. But this very expensive solution is just not practical for most SMBs. Simplicity Complexity is the enemy of thinly stretched resources. The ideal solution to the problem would solve the entire problem, achieve all objectives and not require burdensome ongoing management. Online Backup Addresses the SMB Online backup and recovery is a solution to this problem that is gaining tremendous market acceptance within small and mid-sized businesses. Online backup and recovery is the process of automatically moving data over the network from its primary servers to offsite storage located within a hardened electronic vault. This data is then available to be restored either over the network or through delivery of a network attached storage (NAS) device containing the recovered data. Online backup is gaining momentum because of its ability to very simply cover the vast majority of business requirements at exceptionally affordable price points. In addition, recent trends in declining storage costs and availability of broadband network
access have enabled large-scale market adoption of this technology. The advantages of online backup are many, but can be simplified into the following categories: Fast RTOs Network recovery of files and entire servers can be efficiently delivered from a simple web browser interface. In many scenarios, Recovery Time is effectively zero meaning immediate return to business operations and greatly improved when a complete recovery from offsite storage is necessary. Instant offsite protection Data is moved offsite continuously, providing near zero data loss and very short RPOs. Guaranteed data recovery It is no longer a secret. The vast majority of data on backup tapes is unrecoverable. Independent analysts confirm that over 50% of all recoveries will fail because of errors in the backup process. By comparison, online backup offers a Service Level Agreement guaranteeing 100% recoverability of data. Remove burden of data protection The initial purchase price of storage infrastructure is often dwarfed by the ongoing cost associated with the management and maintenance of that storage. A primary contributor to that cost is manual backup and recovery. Online backup and recovery is a completely automated, network delivered service that requires no ongoing monitoring or management by internal IT staff. Professional management Online backup includes 24 hour monitoring by online backup and recovery experts who proactively contact you if theres a disruption of your backup process caused by power loss, system failure or other unexpected event. Recovery from anywhere With online backup, the recovery can be initiated using a simple web browser from anywhere in the world. Traditional backup is manual and requires that the recovery be initiated at the server itself, eliminating the opportunity for remote recovery.
Online backup is gaining momentum because of its ability to very simply cover the vast majority of business requirements at exceptionally affordable price points.
8
LIVEVAULT WHITE PAPER
Cost effectiveness Online backup is a managed service, saving SMBs the cost of hardware, software and annual maintenance. Personnel costs, measured in time saved, are also reduced, enabling your scarce IT resources to focus on more strategic activities.
Not all online backup providers offer the same level or type of service. To solve the problems of business data asset protection, be sure to compare attributes of online backup providers featured in table 4 below:
Feature Capabilities
Frequency of backup
Business Benefits
If returning to business with current data is an objective, the frequency of backup is critical. Continuous protection provides RPOs of minutes while with nightly batch backups, even if youre in the 50% of companies whose tapes restore properly, youre still likely to lose 24 hours worth of data. Some providers offer a 100% guarantee on recovery while others make no guarantee that the data is recoverable The storage location of your corporate data is critical. Only the most dependable names in data protection should be trusted. A fully managed service should require no reading of logs or monitoring of any kind by SMB personnel. These tasks add to the cost of ownership and offer many opportunities for errors The service should provide 24x7 monitoring of service operations by backup and recovery experts. Look for secure web access to your service so you can check, manage, test, and restore your data from anywhere you have a browser. Avoid services that require you to install VPNs and other complex remote security setups in order to access your data remotely. Table 4
9
LIVEVAULT WHITE PAPER
Conclusion
Business management no longer questions the value of IT systems and the data contained within. Business managers do, however, expect their IT department to ensure that those systems are properly protected so the business is properly protected. Without foolproof data protection in place, every business is at great risk from the mundane damage caused by human error or a virus as well as the devastating damage of a flood, fire or total system failure. As SMBs get more serious and systematic about disaster recovery, the responsibilities of IT professionals expand to ensure that the company can meet the prime business requirement after a data loss: timely recovery of systems (Recovery Time Objective or RTO), with current, usable data (Recovery Point Objective or RPO). And that must be accomplished while keeping in mind the dynamics of different Data Loss Events (DLE). Solutions abound, but the relevance and costs to a specific company must be closely examined. Small and mid-sized businesses are increasingly aware of these problems as customer expectations and market conditions change. And they are equally aware that complexity, cost and additional management responsibilities are things that the majority of IT shops dont want and just cannot take on. Thankfully, online backup is a solution that addresses these problems, specifically as they relate to the SMB. Online backup is rapidly gaining acceptance and delivering levels of service that were, until recently, only available to the large enterprise. Online backup has the ability to greatly enhance an SMB organizations ability to meet RTO and RPO objectives at cost points that are usually lower than the traditional backup solutions already in place.
LiveVault Corporation 201 Boston Post Road West Marlborough, MA 01752-4667 USA Tel Fax 508-460-6670 508-460-6617
info@livevault.com www.livevault.com
LiveVault Corporation
LiveVault provides the world's only fully managed, automatic data protection service that ensures 100% recoverability in the event of a disaster, virus attack, human error or other data-loss event. Our network of reseller partners delivers enterprise-class off-site data protection to organizations of all sizes.
2001, 2002, 2003 LiveVault Corporation. All rights reserved. LiveVault is a registered trademark of LiveVault Corporation. All other trademarks are the property of their respective owners. For more information please visit: www.livevault.com
10
LIVEVAULT WHITE PAPER