AWS Databases for CloudOps
Operate managed databases in AWS: RDS, Aurora, backups, Multi-AZ, read replicas, monitoring, Performance Insights, security, and ElastiCache.
What you'll learn
- Differentiate RDS Multi-AZ and read replicas
- Configure backups, snapshots, and maintenance windows
- Monitor RDS with CloudWatch and Performance Insights
- Apply database security and connectivity best practices
- Understand ElastiCache operational patterns
Prerequisites
Relevant for certifications
RDS Operations Model
Amazon RDS manages provisioning, patching, backups, and minor version maintenance. You still own schema design, query performance, access control, and capacity decisions.
Multi-AZ vs Read Replicas
| Feature | Multi-AZ | Read replica |
|---|---|---|
| Purpose | High availability | Read scaling |
| Replication | Synchronous | Asynchronous |
| Failover | Automatic | Manual promotion or application routing |
| Endpoint | Same primary endpoint after failover | Separate replica endpoint |
| Exam cue | DR/HA requirement | Heavy read workload |
Warning
Read replicas are not a substitute for Multi-AZ high availability. They can lag behind the primary.
Backups and Snapshots
| Backup type | Behavior |
|---|---|
| Automated backups | Point-in-time restore within retention window |
| Manual snapshots | Kept until deleted |
| Cross-region snapshot copy | Useful for DR |
| Final snapshot | Optional snapshot before DB deletion |
Operational settings include backup retention, preferred backup window, maintenance window, deletion protection, and storage autoscaling.
Monitoring and Performance
Use these together:
- CloudWatch metrics: CPU, connections, free storage, read/write latency.
- Enhanced Monitoring: OS-level metrics from the DB host.
- Performance Insights: database wait events and top SQL.
- RDS events: failover, maintenance, backup, configuration changes.
- CloudWatch Logs exports: engine logs, slow query logs, audit logs where supported.
Hands-on: Create a Multi-AZ RDS Database
Goal: Deploy a private RDS instance with backups and monitoring.
- Create a DB subnet group with private subnets in at least two Availability Zones.
- Create a security group for RDS that allows the database port from the application EC2 security group only.
- Open RDS > Databases > Create database.
- Choose an engine such as PostgreSQL.
- Select a dev/test template for labs or production template for real workloads.
- Set public access to No, Multi-AZ to enabled, storage encryption to enabled, and backup retention to 7 days or more.
- Enable Performance Insights.
- Create the database.
- Connect from an EC2 instance in the same VPC.
- Create CloudWatch alarms for low
FreeStorageSpace, highCPUUtilization, and highDatabaseConnections.
Hands-on: Create and Promote a Read Replica
- Select an existing RDS database.
- Choose Actions > Create read replica.
- Place it in the same Region or another Region depending on the lab goal.
- Wait until replica status is available.
- Send read-only test queries to the replica endpoint.
- Simulate a migration or DR exercise by choosing Promote.
- Understand the result: the promoted replica becomes a standalone DB instance and no longer receives changes from the old primary.
Hands-on: Restore from a Snapshot
- Create a manual snapshot of an RDS instance.
- Choose Restore snapshot.
- Give the restored DB a new identifier.
- Place it in the correct subnet group and security group.
- Validate the restored data.
- Clean up the restored DB when testing is done.
ElastiCache for CloudOps
ElastiCache provides managed Redis OSS, Valkey, or Memcached.
Use it for low-latency cache, session storage, rate limiting, pub/sub, and lightweight queues for Redis-compatible engines.
Operational checks:
- Keep cache nodes private.
- Monitor evictions, CPU, memory, and connection count.
- Use Multi-AZ with automatic failover for Redis-compatible replication groups.
- Do not treat cache as the only durable copy of critical data.
Common SOA-C03 Exam Questions
Q: Which RDS option provides automatic failover? Multi-AZ deployment.
Q: Which option offloads read traffic from the primary? Read replicas.
Q: Which tool identifies top SQL and database wait events? Performance Insights.
What to Learn Next
- AWS Disaster Recovery for CloudOps - database backup and failover patterns
- AWS Security & Compliance - encryption, secrets, and IAM database authentication
- AWS CloudWatch Monitoring - alarms and log exports
