ECS Fargate Experiment Idea and Hypothesis

Introduction
Architecture Overview
FIS Experiment Phases
1. Given
2. Hypothesis
Next Steps
References

Learn how to leverage AWS Fault Injection Service (FIS) to validate the resilience of an Amazon ECS Fargate–based microservice under high I/O stress. This guide demonstrates running a controlled I/O fault on Fargate tasks to ensure your Pet Adoption payment API remains available.

Introduction

Amazon ECS Fargate is a serverless compute engine for containers that lets you run Docker workloads without provisioning or managing servers. In this experiment, we’ll deploy a Pet Adoption payment API as two Fargate tasks, fronted by an Application Load Balancer and backed by a Pet Adoption database. Then we’ll launch an AWS FIS experiment to inject I/O stress and observe the behavior.

Before starting, ensure you have the following prerequisites:

An AWS account with permissions to create FIS experiments, ECS clusters, IAM roles, and CloudWatch alarms.
A running ECS Fargate service with at least two tasks.
A target database (e.g., Amazon RDS) for the Pet Adoption back end.

Architecture Overview

The image is a diagram illustrating a serverless compute architecture using AWS Fargate, with a focus on maintaining application availability despite high I/O tasks. It includes components like a Pet Payment API and a Pet Adoption Database within a virtual private cloud.

Application Load Balancer distributes incoming traffic to Fargate tasks.
ECS Fargate Tasks run the Pet Payment API.
Pet Adoption Database serves as the back-end data store.

FIS Experiment Phases

Every AWS FIS experiment consists of two main phases:

Experiment Phase	Description
Given	The current running state of our ECS Fargate service and its infrastructure.
Hypothesis	The expected system behavior when an I/O fault is injected.

1. Given

Two Fargate tasks in an ECS service named pet-payment-service.
An Application Load Balancer routing traffic to pet-payment-service on port 80.
A connected Pet Adoption database (e.g., Amazon RDS or DynamoDB).

# Verify ECS service and tasks
aws ecs describe-services \
  --cluster pet-adoption-cluster \
  --services pet-payment-service

aws ecs list-tasks \
  --cluster pet-adoption-cluster \
  --service-name pet-payment-service

2. Hypothesis

We expect that under high I/O stress on each Fargate task:

The Pet Payment API remains responsive with < 5% error rate.
The Pet Adoption web application continues to process payments without downtime.
CloudWatch alarms trigger if latency or error thresholds are breached.

Injecting faults can impact production workloads. Always run experiments in a staging environment or during scheduled maintenance windows. Monitor performance and rollback criteria closely.

Next Steps

Define IAM roles and permissions for AWS FIS.
Create an FIS experiment template that targets the Fargate tasks.
Configure CloudWatch metrics and alarms for latency, error rate, and CPU/I/O usage.
Execute the experiment and review the results.

References

Watch Video

Demo Create and Run FIS experiment and After Metrics and DB state

Demo Fargate Steady State

⌘I

Introduction

Chaos Engineering Fundamentals

Building a Basic FIS experiment

Introduction to Real life Application

Chaos Engineering on Database Aurora

Chaos Engineering on Serverless Fargate

Chaos Engineering on Kubernetes EKS

Chaos Engineering on Availability Zone

Conclusion

Chaos Engineering on Compute E C2

ECS Fargate Experiment Idea and Hypothesis

Introduction

Architecture Overview

FIS Experiment Phases

1. Given

2. Hypothesis

Next Steps

References

Watch Video

Introduction

Chaos Engineering Fundamentals

Building a Basic FIS experiment

Introduction to Real life Application

Chaos Engineering on Database Aurora

Chaos Engineering on Serverless Fargate

Chaos Engineering on Kubernetes EKS

Chaos Engineering on Availability Zone

Conclusion

Chaos Engineering on Compute E C2

​Introduction

​Architecture Overview

​FIS Experiment Phases

​1. Given

​2. Hypothesis

​Next Steps

​References

Watch Video

Introduction

Architecture Overview

FIS Experiment Phases

1. Given

2. Hypothesis

Next Steps

References