Configuring the Disaster Recovery Process Between an Active and a Standby Site
You configure disaster recovery between an active site and a standby site to ensure geographical redundancy of network management services.
Before you initiate the disaster recovery process between both sites, perform the following tasks:
Ensure that the connectivity requirements as described in the Disaster Recovery Overview topic are met.
Check whether identical cluster configurations exist on both sites. We recommend that both clusters have the same number of nodes so that, even in the case of a disaster, the standby site can operate with the same capacity as the active site.
Ensure that the same versions of Junos Space Network Management Platform, high-level Junos Space applications, and device adapters are installed at both sites.
Shut down the disaster recovery process configured on Junos Space Network Management Platform Release 14.1R3 and earlier before upgrading to Junos Space Network Management Platform Release 15.2R1 and configuring the new disaster recovery process. For more information, see Stopping the Disaster Recovery Process on Junos Space Network Management Platform Release 14.1R3 and Earlier.
You cannot configure the new disaster recovery process if you do not stop the disaster recovery you set up on 14.1R3 and earlier releases. You do not need to perform this step on a clean installation of Junos Space Network Management Platform Release 15.2R1.
Ensure that the same SMTP server configuration exists on both sites to receive e-mail alerts related to the disaster recovery process. You can add SMTP servers from the SMTP Servers task group in the Administration workspace. For more information about adding SMTP servers, see Adding an SMTP Server in the Junos Space Network Management Platform Workspaces Feature Guide.
Copy a file with the list of arbitrator devices (one IP address per row) in the CSV format or the custom failure-detection scripts on the VIP node at the active site. You can refer to the sample files at /var/cache/jmp-geo/doc/samples/.
Decide on the values for the following parameters depending on your network connectivity and disaster recovery requirements:
VIP address and password of both the active and standby sites
Backup, restoration, and Secure Copy Protocol (SCP) synchronization settings
Heartbeat time intervals
E-mail address of the administrator and the dampening interval in seconds to avoid reporting the same errors to avoid an e-mail flood
Failure-detection settings such as the failover threshold and the time during which the standby site stays standby if the arbiter devices are unreachable
The following sections explain how to configure disaster recovery at the active and standby sites and initiate the disaster recovery between both sites.
Configuring Disaster Recovery at the Active Site
You use the jmp-dr init –a
command to configure
disaster recovery at the active site. You need to enter values for
the parameters that are displayed. The values you enter here are saved
in a configuration file.
To configure disaster recovery at the active site:
When you have entered values for all parameters, disaster recovery is initialized at the active site.
Configuring Disaster Recovery at the Standby Site
You use the jmp-dr init –s
command to configure
disaster recovery at the standby site. You need to enter values for
the parameters that are displayed. The values you enter here are saved
in a configuration file. By default, the standby site uses the failure-detection
mechanism you configured at the active site, values you entered for
file backup and restoration, heartbeat, and notifications if the standby
site becomes an active site.
To configure disaster recovery at the standby site:
When you have entered values for all parameters, disaster recovery is initialized at the standby site.
Starting the Disaster Recovery Process
You use the jmp-dr start
command to start the disaster
recovery process at both sites. You can also use the jmp-dr start-a
command to start the disaster recovery process on the active site
and the jmp-dr start-s
command to start the disaster recovery
process on the standby site.
To start the disaster recovery process:
The disaster recovery process is initialized on the active site and the standby site.
Verifying the Status of the Disaster Recovery Process
We recommend that you execute the jmp-dr health
command
to verify the status (overall health) of the disaster recovery process
at both the active and standby sites when you start the disaster recovery
process on both sites. For more information about executing the jmp-dr health
command, see Checking
the Status of the Disaster Recovery Configuration.