fargate-auto-scaled-backend

A load balanced and auto-scaled api running on AWS ECS.

vpc

A VPC with the below resources is required. Console VPC wizard can create them.

ci

Init workflow - manual trigger

ecs-check Query AWS for existing of service [obtain current task arn].
ecr Apply ECR and vpc endpoints.
build/image [if service doesn't exist] Push a new initial image to ecr.
build/task [if service doesn't exist] Create a new task definition is created.
setup/service Apply ecs service, deploy and auto-scaling.
setup/network Apply vpc link, load balancer and api gateway ingress.
test Basic check on the API /host endpoint.

Deploy workflow - push on main trigger

code/image Build image if changes to src/* detected.
code/task Apply task definition (no changes if the same image).
check Create a deploy boolean based on a new task definition (difference to current) detected.
deploy [if deploy=true] Codedeploy deployment is created and status is monitored.
A blue/green deployment takes place.

Destroy workflow - manual trigger

service Destroy ecs service, deploy and auto-scaling resources.
network Destroy vpc link, load balancer and api gateway ingress resources.
task Destroy task definition.
ecr Destroy ecr and images.

usage

obtain url from terraform - found in github action init / setup / network outputs
curl [url]/dev/host

{
    "message":"Request handled by backend at 2024-09-25T12:28:17.593Z",
    "imageUri":"700011111111.dkr.ecr.eu-west-2.amazonaws.com/fargate-auto-scaled-backend@sha256:78dfc01946306dd6afea2b47b56e196788501bfa93c1b2ee1e90a54e72b56938",
    "hostname":"ip-10-55-161-195.eu-west-2.compute.internal"
}

cpu autoscaling

ECS will auto-scale when CPU reaching upper and lower limits. CPU is for entire ECS service.

Initially, the scale-down-alarm cloudwatch alarm state will be In Alarm as CPU will be < scale down threshold. This is expected.

Simulate a load on the ECS service with curl [URL]/dev/stress-cpu/75/120. This example will run 75% CPU load for 120 seconds.

After that load has completed and the =< 1 minute cool off period. This will trigger a cloudwatch alarm which will in turn trigger the auto-scaling rule(s).

Once that load has finished - after the 120 seconds - the scale down alarm will be triggered and the tasks scaled back down.

setup

In tf/service the below variables are to be considered.

cpu_scale_up_threshold: percentage CPU load to trigger a scale up of tasks.
cpu_scale_down_threshold: percentage CPU load to trigger a scale down of tasks.
max_scaled_task_count: maximum amount of tasks to be allowed.

docker

docker build -t express-app .
docker run -i -e BASE_PATH=dev -p 3000:3000 express-app

terraform

Required deployment iam privileges.

[
    "dynamodb:*", 
    "s3:*", 
    "ecr:*", 
    "iam:*", 
    "ecs:*",
    "ec2:*", 
    "elasticloadbalancing:*",
    "application-autoscaling:*",
    "logs:*",
    "cloudwatch:*",
    "apigateway:*",
    "codedeploy:*"
]

ci config

Required github action variables.

AWS_ACCOUNT_ID
AWS_REGION
AWS_ROLE role with above deployment privileges

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
.github		.github
docs		docs
src		src
tf		tf
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
appspec.yaml		appspec.yaml
justfile		justfile
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fargate-auto-scaled-backend

vpc

ci

usage

cpu autoscaling

setup

docker

terraform

ci config

About

Releases

Packages

Languages

chrispsheehan/fargate-auto-scaled-backend

Folders and files

Latest commit

History

Repository files navigation

fargate-auto-scaled-backend

vpc

ci

usage

cpu autoscaling

setup

docker

terraform

ci config

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages