Chaos Engineering

Chaos Engineering on Serverless Fargate

Demo Fargate Steady State

Before introducing an I/O stress fault into our ECS Fargate task, it’s essential to capture baseline metrics. This steady-state data will help us quantify the impact of the stress test.

Prerequisites

  • You have an active ECS Fargate cluster (e.g., pay-for-adoption).
  • Container Insights is enabled for your cluster.
  • Permissions to view AWS CloudWatch metrics.

Note

Baseline metrics enable you to compare system behavior before and after fault injection. Make sure you record the values for each metric over a consistent time window.

1. Navigate to Container Insights

  1. Open the AWS Management Console.
  2. Go to CloudWatch > Container Insights > ECS.
  3. Select your ECS cluster (for example, pay-for-adoption).
  4. Set the time range to the last 30 minutes.

The image shows an AWS CloudWatch dashboard displaying container insights for ECS clusters, including graphs for CPU utilization, memory utilization, and network activity. There are no alerts present in the dashboard.

2. Review Key Metrics

Use the following table to track your baseline values:

MetricDescription
CPU UtilizationPercentage of vCPU resources consumed
Memory UtilizationPercentage of container memory in use
Network ThroughputIngress and egress data transfer rates (Bytes/s)

Warning

If any metric is already at or near its limit (e.g., > 80% CPU or memory), address capacity issues before proceeding with fault injection.

Next Steps

After recording these steady-state values, we’ll inject the I/O stress fault into the Fargate task and revisit the same metrics to observe deviations from this baseline.


Watch Video

Watch video content

Previous
ECS Fargate Experiment Idea and Hypothesis