|
BUSINESS RESUMPTION AND
CONTINGENCY PLANNING PROCEDURES
1. PURPOSE: To
establish mandatory operational requirements for business resumption and
contingency planning within the Office of the State Attorney. It is
designed to provide Office-wide guidance to Chiefs, Supervisors and Staff
in responding to catastrophic events involving SAO facilities and
information resource service interruptions.
2. BACKGROUND: Traditionally,
contingency planning has focused on restoring information resource
services for an automation center, wide area network or similar service.
While remote sites are still accessed for processing and storing
information, most facilities have their own local area networks which link
the various personal computers and share various resources. If a
catastrophic event occurs that makes it impossible for SAO employees to
use that site, the re-establishment of information systems and network
functions is only one part of the resumption of services for a facility.
The critical functions of that facility must be restored and an interim
process must be put into action. "Hot sites" (a reserved space
already equipped with processing capability and other services),
reciprocal agreements, and other arrangements to provide restored services
to their end users should be considered. Critical files that were
processed previously on Local Area Network servers need to be restored so
they can be used in processing during the contingency period. In summary,
both the information technology and the general office environment have to
be restored.
3. RESPONSIBILITIES:
a. The SAO Chief Information Officer
(CIO) is charged with ensuring that a business resumption plan is
developed at all SAO locations. This includes the necessary contingency
plans for critical automated information systems. The CIO is also
responsible for monitoring, reviewing, and evaluating compliance with this
automated information system security.
b. Division Chiefs are responsible
for ensuring that offices and facilities under their control can operate
despite disruptions. These offices and facilities must include business
resumption and contingency planning as vital considerations in their
computer security programs in protecting sensitive information in SAO
automated information systems.
c. The Chief at each SAO remote
office is responsible for the development, periodic testing and updating
of a business resumption plan for that field station and contingency plans
for all general support systems located at that field station.
a. Identify Mission Critical
Functions. The first step of business resumption planning is to
identify mission critical functions and determine their priorities. In the
event of a disaster, certain functions will not be performed. If
appropriate priorities have been set and approved by senior management, it
will be easier for the organization to recover from the disaster and
resume normal operations. Contingency plans shall be consistent with other
site and building emergency plans. All plans designed to continue
essential SAO missions and functions must be coordinated with each other
and recognize the dependent nature of this process.
b. Identify the Resources that
Support Critical Functions. After mission critical functions are
identified, the resources to support the critical functions must be
identified, determine the time frames in which each resource is used (some
are needed daily and others are used only once a month), and to determine
the effect on the mission if the resource is not available. One method
used to identify mission-critical functions and their impact is called
Business Impact Analysis. It includes a review of the site's functions to
understand the impact if they are not performed. A review is done of each
function regarding its impact on operations, end users, interrelationships
with other critical functions, as well as time lines and considering
workload peaks and valleys. Also considered are additional expenses caused
by overtime, the need for temporary employees and other costs associated
with recovery. Finally, the effect of not performing a mission critical
function needs to be examined and considered with regard to its impact
within the organization, externally, and in the media.
c. Anticipating Potential
Contingencies or Disasters. All resources associated with critical
functions should be examined with likely problem scenarios. Various types
and sizes of contingencies should be considered. To better understand
resource needs and their support of critical functions, a contingency
planning team should be formed. Team members should include
representatives from three main areas: functional/business groups,
facilities management, and technology management. This assignment should
not preclude members of these groups from serving in other planning roles.
Members from these areas may include financial management, personnel,
computer security, and physical security. The team should identify likely
problems by using analytical tools, such as existing risk assessment
methodologies and risk assessment software packages.
d.
Selecting Business Resumption and Contingency Planning Strategies. The
primary purpose of this step is to plan how to recover needed resources.
Alternative strategies should be evaluated to consider what controls are
in place to prevent and minimize contingencies.
(1) A contingency planning strategy
normally consists of three parts: emergency response, recovery, and
resumption. Emergency response encompasses the initial actions taken to
protect lives and limit damage. Recovery refers to the steps that are
taken to continue support for critical functions. Resumption is the return
to normal operations. The relationship between recovery and resumption is
important. The longer it takes to resume normal operations, the longer the
organization will have to operate in the recovery mode.
(2) The selection of a strategy
needs to be based on practical considerations as feasibility and cost.
Risk assessment can be used to help estimate the cost of options to decide
on an optimal strategy. Questions to be asked are: Is it more expensive to
purchase and maintain a generator or to move processing to an alternate
site, considering the likelihood of loosing electrical power for serrious
lengths of time? Are the consequences of loss of computer-related
resources sufficiently high to warrant the cost of disaster recovery
strategies? The risk assessment should focus on areas where it is not
clear which strategy is the best. In developing contingency planning
strategies, there are many factors to consider in addressing each of the
resources that support critical functions. The different categories of
resources should each be considered. Some of these factors include: human
resources, processing capability, automated applications and data,
computer-based services, physical infrastructure, documents and papers.
(a) Implementation
(1) Much preparation is needed to
implement the strategies for protecting critical functions and their
supporting resources. For example, one common preparation is to establish
procedures for backing up files and applications. Another is to establish
contracts and agreements, if the contingency strategy calls for them.
Existing service contracts may need to be re-negotiated to add contingency
services. Another preparation may be to purchase equipment, to support a
redundant capability.
(2) Backing up data files and
applications is a critical part of virtually every contingency plan.
Backups are used to restore files after a personal computer virus corrupts
the data or after a hurricane destroys an automation center. System
backups must be tested on a regular basis to ensure that data can be read
from the disks in the event they are needed in an emergency.
(3) It is important to keep
preparations, including documentation, up-to-date. Computer systems change
rapidly and backup services and redundant equipment should also be kept
current. Contracts and agreements also need to reflect any changes. If
additional equipment is needed, it must be maintained and periodically
replaced when it is no longer dependable or obsolete to an organization's
architecture.
(4) Preparation should also include
formally designating people who are responsible for various tasks in the
event of a contingency. These people are often referred to as the
contingency response team. This team is often composed of people who were
also members of the contingency planning team.
(5) There are many important
implementation issues for an organization to consider. Two of the most
important are 1) how many plans should be developed and 2) who will
prepare each plan. The answer will depend on the organization's overall
strategy for contingency planning, and should be documented in the
organization's policy and procedures document.
(6) For small or less complex
systems, the contingency plan may be a part of the computer security plan.
For larger complex systems, the computer security plan could contain a
brief synopsis of the contingency plan, which should be a separate
document. The purpose of the computer security plan is to provide a basic
overview of the security and privacy requirements for a computer system
and the responsible SAO component's plan for meeting those requirements.
It also serves as documentation of the process of planning adequate,
cost-effective security protection for a system. The purpose of the
contingency plan is to document the specific methodology, structure,
discipline, and procedures to be used for emergency response, backup
operations, and post-disaster recovery maintained by the responsible SAO
office as part of its IR security program. This planning will help ensure
the availability of critical resources and facilitate the continuity of
operations in an emergency situation.
(7) Some organizations have one plan
for the entire organization; others have a plan for each distinct computer
system, application, or other resource. Other approaches recommend a plan
for each business or mission function, with separate plans, as needed, for
critical resources.
(8) The number of actual plans
needed depends upon the unique circumstances for each organization.
Coordination and cooperation between resource managers and functional
managers responsible for the mission or business is critical to the
success of any plan.
(b) Documentation. The
contingency plan needs to be: documented, kept up-to-date as the personnel
responsible for implementation of the contingency plan and other factors
change. A written plan is essential to have during a contingency
situation. It should clearly state in simple language sequence of tasks to
be performed in the event of a contingency so that someone with minimal
knowledge can immediately begin to execute the plan. It is important to
store, in a secure environment, up-to-date copies, including one in
electronic format, of the contingency plan in several locations, including
any off-site locations, such as alternate processing sites or backup data
storage facilities. Each member of the contingency plan response team
should have copies of the plan.
(c) Training. All personnel
should be trained in their contingency-related duties. New personnel
should be trained as they join the organization. Refresher training may be
needed and personnel need to practice their skills. Training is
particularly important for effective employee response during emergencies.
Depending on the nature of the emergency, there may be inadequate time to
check a manual to determine correct procedures to protect equipment and
other assets. Practice is necessary in order to react correctly,
especially when human safety is involved.
e. Testing and Revising
(1) A contingency plan should be
tested periodically to identify and correct any problems in
implementation. The plan will become dated as time passes and as the
resources used to support critical functions change. Responsibility for
keeping the contingency plan current should be specifically assigned. The
extent and frequency of testing will SAOry between organizations and among
systems. There are several types of testing, including reviews, analyses,
and simulations of disasters.
(a) A review can be a simple test to
check the accuracy of contingency plan documentation. For instance, a
reviewer could check if individuals listed are still in the organization
and still have the responsibilities that caused them to be included in the
plan. This test can check home and work telephone numbers, organizational
codes, and building and room numbers. The review can determine if files
can be restored from backup tapes or if employees know emergency
procedures.
(b) An analysis may be performed on
the entire plan or portions of it, such as emergency response procedures.
It is more beneficial if the analysis is performed by a member of the
facility staff who did not participate in the development of the
contingency plan, but has a sound knowledge of the critical functions and
supporting resources. This person may also interview functional managers,
resource managers, and their staff to uncover missing or unworkable
sections of the plan.
(c) Organizations may also arrange
disaster simulations. These tests provide information about
flaws in the contingency plan and provide practice for a real emergency.
While they can be expensive, these tests can also provide critical
information that can be used to ensure the continuity of important
functions. In general, the more critical the functions and the resources
addressed in the contingency plan, the more cost-beneficial it is to
perform a disaster simulation.
(2) The results of a "test"
often imply a grade assigned for a specific level of performance, or
simply pass or fail. However, in the case of contingency planning, a test
should be used to improve the plan. If organizations do not use this
approach, flaws in the plan may remain undetected and not corrected.
f. Interdependencies.
Controls can prevent or reduce the effects of a disaster at the facility.
Ideally, controls mutually support and compliment each other. In
combination, they eliminate or lessen the damage occurring as a result of
the destruction, disclosure, or denial of service to critical resources.
(1) Risk assessment provides a tool
(process) for analyzing the security costs and benefits of contingency planning options. In addition, a risk assessment effort can be
used to help identify critical resources needed to support the
organization and the likely threat to those resources. It is not
necessary, however, to perform a risk assessment prior to contingency
planning, since the identification of critical resources can be performed
during the contingency planning process itself.
(2) Physical and environmental
controls help prevent the destruction of automated information systems,
although many of the other controls, such as logical access controls, also
prevent damage. The main threats that a contingency plan address are
physical such as: fires; loss of power; plumbing breaks; and natural
disasters.
(3) Incident handling can be
viewed as a subset of contingency planning. It is the emergency response
capability for SAOrious technical threats. Incident handling can also help
an organization prevent future incidents by recording the incident and
educating personnel about the incident, the circumstances, and the
corrective action taken.
(4) Support and operations in
most organizations include the periodic backing up of critical files. It
also includes the prevention and recovery from more common contingencies,
such as a disk failure or corrupted data files.
(5) Policy is needed to create and
document the organization's approach to contingency planning. The policy
should explicitly assign responsibilities.
g. Cost Considerations. The
cost of developing and implementing contingency planning strategies should
be taken into account and, when included in the strategy, additional
expenses for contracting backup services or duplicate equipment. One
contingency cost that is often overlooked is the cost of testing a plan.
Testing provides many benefits and should be performed, although some of
the less expensive methods (such as a review) may be sufficient for less
critical resources.
5. REFERENCES
a. National Institute of Standards
and Technology, Guidelines for ADP Contingency Planning, FIPS Pub 87;
1981.
|