AWS Disaster Recovery for CloudOps

IntermediateTopic50 min4 min read2 May 2026AWS

Plan and operate AWS disaster recovery with RTO/RPO, backups, pilot light, warm standby, multi-site active-active, AWS Backup, and Route 53 failover.

What you'll learn

  • Explain RTO and RPO in practical operational terms
  • Compare backup and restore, pilot light, warm standby, and active-active
  • Use AWS Backup for centralized backup policies
  • Design DR runbooks and test failover safely

Relevant for certifications

SOA-C03SAA-C03

RTO and RPO

TermQuestion it answers
RTOHow long can the service be down?
RPOHow much data can we afford to lose?

Shorter RTO/RPO usually means more running infrastructure, more replication, and higher cost.

DR Strategies

StrategyCostRecovery speedPattern
Backup and restoreLowSlowRestore infrastructure and data after failure
Pilot lightLow-mediumMediumCore data layer is ready; app layer scales up during DR
Warm standbyMediumFastSmaller full environment already running
Multi-site active-activeHighFastestTraffic served from multiple Regions continuously

Service Building Blocks

  • S3 Cross-Region Replication for object copies.
  • EBS snapshot copy for volume recovery.
  • RDS automated backups and cross-region snapshots.
  • Aurora Global Database for low-lag global database recovery.
  • AWS Backup for centralized policies and vaults.
  • Route 53 failover routing for DNS-level traffic shift.
  • CloudFormation StackSets for repeatable baseline infrastructure.

AWS Backup

AWS Backup centralizes backup policies across supported services.

Core pieces:

  • Backup plan.
  • Backup rule and schedule.
  • Backup vault.
  • Resource assignment by tags or ARNs.
  • Lifecycle to cold storage where supported.
  • Cross-account or cross-region copy.

Hands-on: Create a Basic AWS Backup Plan

Goal: Back up tagged resources daily and retain backups for 35 days.

  1. Tag resources to protect with Backup = daily.
  2. Open AWS Backup > Backup plans.
  3. Choose Create backup plan.
  4. Start from a new plan and name it daily-cloudops-backup.
  5. Add a rule with daily frequency, an off-peak backup window, and deletion after 35 days.
  6. Assign resources by tag with key Backup and value daily.
  7. Create the plan.
  8. Start an on-demand backup to validate permissions.
  9. Confirm the recovery point appears in the vault.

Hands-on: S3 Cross-Region DR Bucket

  1. Create a source bucket in Region A and a destination bucket in Region B.
  2. Enable versioning on both buckets.
  3. Configure replication from source to destination.
  4. Choose or create the replication IAM role.
  5. Upload a new object to the source bucket.
  6. Confirm it appears in the destination bucket.
  7. Test recovery by reading the object from the destination Region.

Hands-on: DR Runbook for a Web App

Use this as a numbered runbook for a simple application:

  1. Confirm the incident scope and declare DR if the primary Region is unavailable.
  2. Freeze deployments to the primary Region.
  3. Validate replicated S3 data in the DR Region.
  4. Restore or promote the database in the DR Region.
  5. Deploy or scale up the application stack in the DR Region.
  6. Run smoke tests against the DR load balancer.
  7. Update Route 53 failover or weighted records to send traffic to DR.
  8. Monitor error rate, latency, and database health.
  9. Record the exact failover time and data recovery point.
  10. After primary recovers, plan failback as a separate controlled change.

Practice matters

A DR plan that has never been tested is only a theory. Schedule game days and record gaps as operational work items.

Common SOA-C03 Exam Questions

Q: Which DR strategy is cheapest but slowest? Backup and restore.

Q: Which service centralizes backup policies across AWS services? AWS Backup.

Q: How do you automate DNS failover between Regions? Route 53 failover routing with health checks.

What to Learn Next

  1. Route 53 and DNS for CloudOps - failover records and health checks
  2. Amazon S3 for CloudOps - replication, object lock, and lifecycle
  3. AWS Account Management - cross-account backup and logging governance

More in Amazon Web Services