Mức độ ảnh hưởng của Alarm & Event
Giới thiệu
Một số dự án sẽ yêu cầu liệt kê các cảnh báo và ảnh hưởng của nó đối với hệ thống. Đây là danh sách mức độ ảnh hưởng để anh em tham khảo khi phân tích và xử lý các cảnh báo từ AWS.
Phân loại mức độ ưu tiên
High - Cần hành động ngay lập tức
- Hệ thống không khả dụng hoặc bất ổn nghiêm trọng
- Ảnh hưởng trực tiếp đến người dùng cuối
Normal - Cần được giải quyết nhưng không gấp
- Hệ thống vẫn hoạt động nhưng có dấu hiệu giảm hiệu suất
- Có thể ảnh hưởng đến trải nghiệm người dùng
Low - Là thông báo, có thể bỏ qua
- Hệ thống hoạt động bình thường
- Không ảnh hưởng đến người dùng
- Theo dõi và xử lý khi có thể
Trạng thái hệ thống
| Trạng thái | Mô tả | Cần xử lý |
|---|---|---|
| Not available | Hệ thống không khả dụng | ✅ Ngay lập tức |
| Unstable | Hệ thống bất ổn, có thể bị gián đoạn | ✅ Ưu tiên cao |
| Available (possible slowdown) | Hoạt động nhưng có thể chậm | ⚠️ Theo dõi |
| Available (need attention) | Hoạt động bình thường nhưng cần chú ý | ⚠️ Theo dõi |
| Available (no impact) | Không ảnh hưởng | ℹ️ Thông tin |
| No effect | Không có tác động | ℹ️ Thông tin |
| No. | Notification Type | Subject | Body/Detail | Action Priority | System State |
|---|---|---|---|---|---|
| 1 | CloudWatch Alarm | ALARM: "EC2-{InstanceName}-HIGH-MemoryUtilization" | Normal | Available (possible slowdown) | |
| 2 | CloudWatch Alarm | ALARM: "EC2-{InstanceName}-HIGH-StatusCheckFailed" | High | Not available | |
| 3 | CloudWatch Alarm | ALARM: "EC2-{InstanceName}-HIGH-DiskSpaceUtilization" | Normal | Available (need attention) | |
| 4 | CloudWatch Alarm | ALARM: "EC2-{InstanceName}-HIGH-CPUUtilization" | Normal | Available (possible slowdown) | |
| 5 | CloudWatch Alarm | ALARM: "ECS-{ServiceName}-HIGH-MemoryUtilization" | Normal | Available (possible slowdown) | |
| 6 | CloudWatch Alarm | ALARM: "ECS-{ServiceName}-HIGH-CPUUtilization" | Normal | Available (possible slowdown) | |
| 7 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-HTTPCode_ELB_4XX_Count" | Low | Available (no impact) | |
| 8 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-HTTPCode_ELB_5XX_Count" | High | Unstable | |
| 9 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-HTTPCode_Target_5XX_Count" | High | Unstable | |
| 10 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-TargetResponseTime" | High | Available (possible slowdown) | |
| 11 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-RequestCount" | Normal | Available (need attention) | |
| 12 | CloudWatch Alarm | ALARM: "ALB-{LoadBalancerName}-HIGH-UnHealthyHostCount" | High | Unstable | |
| 13 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-EngineCPUUtilization" | Low (Normal nếu node là Primary) |
Available (possible slowdown) | |
| 14 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-CPUUtilization" | Low (Normal nếu node là Primary) |
Available (possible slowdown) | |
| 15 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-DatabaseMemoryUsagePercentage" | High | Available (need attention) | |
| 16 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-CurrConnections" | Low (Normal nếu node là Primary) |
Available (need attention) | |
| 17 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-Evictions" | High | Available (need attention) | |
| 18 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-HIGH-ReplicationLag" | Normal | Available (need attention) | |
| 19 | CloudWatch Alarm | ALARM: "Redis-{ClusterName}-LOW-FreeableMemory" | Low (High nếu node là Primary) |
Available (need attention) | |
| 20 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-CPUUtilization" | Normal | Available (possible slowdown) | |
| 21 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-LOW-FreeableMemory" | Normal | Available (need attention) | |
| 22 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-DatabaseConnections" | Normal | Available (need attention) | |
| 23 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-ReadIOPS" | Normal | Available (need attention) | |
| 24 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-WriteIOPS" | Normal | Available (need attention) | |
| 25 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-DiskQueueDepth" | Normal | Available (need attention) | |
| 26 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-LOW-FreeStorageSpace" | Normal | Available (need attention) | |
| 27 | CloudWatch Alarm | ALARM: "RDS-{DBInstanceName}-HIGH-ReplicaLag" | Normal | Available (need attention) | |
| 28 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-CPUUtilization" | Normal | Available (possible slowdown) | |
| 29 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-LOW-FreeableMemory" | Normal | Available (need attention) | |
| 30 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-DatabaseConnections" | Normal | Available (need attention) | |
| 31 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-ReadIOPS" | Normal | Available (need attention) | |
| 32 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-WriteIOPS" | Normal | Available (need attention) | |
| 33 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-DiskQueueDepth" | Normal | Available (need attention) | |
| 34 | CloudWatch Alarm | ALARM: "Aurora-{ClusterName}-HIGH-AuroraReplicaLag" | Normal | Available (need attention) | |
| 35 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-CPUUtilization" | Normal | Available (possible slowdown) | |
| 36 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-LOW-FreeStorageSpace" | Normal | Available (need attention) | |
| 37 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-YELLOW-ClusterStatus.yellow" | Normal | Unstable | |
| 38 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-RED-ClusterStatus.red" | High | Not available | |
| 39 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ClusterIndexWritesBlocked" | High | Unstable | |
| 40 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-LOW-Nodes" | High | Unstable | |
| 41 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-OldGenJVMMemoryPressure" | High | Unstable | |
| 42 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-MasterCPUUtilization" | Normal | Available (need attention) | |
| 43 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-MasterJVMMemoryPressure" | High | Unstable | |
| 44 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-5xx" | High | Unstable | |
| 45 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ThreadpoolWriteQueue" | Normal | Available (possible slowdown) | |
| 46 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ThreadpoolSearchQueue-Average" | Normal | Available (possible slowdown) | |
| 47 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ThreadpoolSearchQueue-Maximum" | High | Unstable | |
| 48 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ThreadpoolSearchRejected" | High | Unstable | |
| 49 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-ThreadpoolWriteRejected" | High | Unstable | |
| 50 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-AutomatedSnapshotFailure" | Automated snapshot failed; backups may be missing | High | Available (need attention) |
| 51 | CloudWatch Alarm | ALARM: "OpenSearch-{DomainName}-HIGH-JVMMemoryPressure" | JVM memory pressure is high; risk of instability | High | Unstable |
| 52 | CloudWatch Alarm | ALARM: "SES-HIGH-Bounce" | High | Available (need attention) | |
| 53 | CloudWatch Alarm | ALARM: "SES-HIGH-Complaint" | High | Available (need attention) | |
| 54 | CloudWatch Alarm | ALARM: "WAF-{WebACLName}-HIGH-BlockedRequests" | Low | Available (no impact) | |
| 55 | Budget | Budget - 80% threshold exceeded | When your actual cost is greater than 80% of your budgeted amount | Before 23rd: High After 23rd: Low |
Available (need attention) |
| 56 | Budget | Budget - 100% threshold exceeded | When your actual cost is greater than 100% of your budgeted amount | High | Available (need attention) |
| 57 | Budget | Budget - 105% threshold exceeded | When your actual cost is greater than 105% of your budgeted amount | High | Available (need attention) |
| 58 | CloudWatch Alarm | ALARM: "Amplify-{AppName}-HIGH-4xxErrors" | Low | Available (no impact) | |
| 59 | CloudWatch Alarm | ALARM: "Amplify-{AppName}-HIGH-5xxErrors" | High | Unstable | |
| 60 | CloudWatch Alarm | ALARM: "Amplify-{AppName}-HIGH-Latency" | Normal | Available (possible slowdown) | |
| 61 | CloudWatch Alarm | ALARM: "Lambda-{FunctionName}-HIGH-Errors" | High | Unstable | |
| 62 | CloudWatch Alarm | ALARM: "CloudFront-{DistributionName}-HIGH-5xxErrorRate" | High | Unstable | |
| 63 | CloudWatch Alarm | ALARM: "ApiGateway-{ApiName}-HIGH-5XXError" | High | Unstable | |
| 64 | CloudWatch Alarm | ALARM: "ApiGateway-{ApiName}-HIGH-Count" | Normal | Available (need attention) | |
| 65 | CloudWatch Alarm | ALARM: "ApiGateway-{ApiName}-HIGH-Latency" | Normal | Available (possible slowdown) | |
| 66 | ASG Event | Auto Scaling Group - Fail to Launch | High | Not available | |
| 67 | ASG Event | Auto Scaling Group - Fail to Terminate | Normal | Available (need attention) | |
| 68 | RDS/Aurora Event | RDS Notification Message | Message: failover | High | Not available |
| 69 | RDS/Aurora Event | RDS Notification Message | Message: failure | High | Not available |
| 70 | RDS/Aurora Event | RDS Notification Message | Message: maintenance | High | Not available |
| 71 | EventBridge Rules | Health Event | AWS Health notifications | High | Unstable |
| 72 | EventBridge Rules | OpenSearch Event | Cluster health status change or alert | High | Unstable |
| 73 | EventBridge Rules | ECS Task STOPPED | Scaling activity initiated by deployment | Low | No effect |
| 74 | EventBridge Rules | ECS Task STOPPED | Task failed ELB health checks | High | Unstable |
| 75 | EventBridge Rules | ECS Task STOPPED | ResourceInitializationError | High | Unstable |
| 76 | CodePipeline Event | CodePipeline - Succeeded | Pipeline execution succeeded | Low | No effect |
| 77 | CodePipeline Event | CodePipeline - Superseded | Pipeline execution superseded | Low | No effect |
| 78 | CodePipeline Event | CodePipeline - Started | Pipeline execution started | Low | No effect |
| 79 | CodePipeline Event | CodePipeline - Canceled | Pipeline execution canceled | Low | No effect |
| 80 | CodePipeline Event | CodePipeline - Failed | Pipeline execution failed | High | Available (need attention) |
| 81 | EventBridge Rules | ECS Service WARN, ERROR | Service is unstable or experiencing errors | High | Unstable |
| 82 | EventBridge Rules | ECS Deployment Fail | Deployment failed, new tasks not started | High | Available (need attention) |
| 83 | Amplify Deployment Event | Amplify Deployment - Started | Deployment execution started | Low | No effect |
| 84 | Amplify Deployment Event | Amplify Deployment - Succeeded | Deployment execution succeeded | Low | No effect |
| 85 | Amplify Deployment Event | Amplify Deployment - Superseded | Deployment execution superseded | Low | No effect |
| 86 | Amplify Deployment Event | Amplify Deployment - Canceled | Deployment execution canceled | Low | No effect |
| 87 | Amplify Deployment Event | Amplify Deployment - Failed | Deployment execution failed | High | Available (need attention) |