Speed up protected software program releases with new built-in blue/inexperienced deployments in Amazon ECS


Voiced by Polly

Whereas containers have revolutionized how improvement groups bundle and deploy functions, these groups have needed to fastidiously monitor releases and construct customized tooling to mitigate deployment dangers, which slows down transport velocity. At scale, improvement groups spend useful cycles constructing and sustaining undifferentiated deployment instruments as a substitute of innovating for his or her enterprise.

Beginning right this moment, you need to use the built-in blue/inexperienced deployment functionality in Amazon Elastic Container Service (Amazon ECS) to make your software deployments safer and extra constant. This new functionality eliminates the necessity to construct customized deployment tooling whereas providing you with the arrogance to ship software program updates extra steadily with rollback functionality.

Right here’s how one can allow the built-in blue/inexperienced deployment functionality within the Amazon ECS console.

You create a brand new “inexperienced” software atmosphere whereas your current “blue” atmosphere continues to serve stay visitors. After monitoring and testing the inexperienced atmosphere totally, you route the stay visitors from blue to inexperienced. With this functionality, Amazon ECS now offers built-in performance that makes containerized software deployments safer and extra dependable.

Under is a diagram illustrating how blue/inexperienced deployment works by shifting software visitors from the blue atmosphere to the inexperienced atmosphere. You’ll be able to be taught extra on the Amazon ECS blue/inexperienced service deployments workflow web page.

Amazon ECS orchestrates this complete workflow whereas offering occasion hooks to validate new variations utilizing artificial visitors earlier than routing manufacturing visitors. You’ll be able to validate new software program variations in manufacturing environments earlier than exposing them to finish customers and roll again near-instantaneously if points come up. As a result of this performance is constructed straight into Amazon ECS, you’ll be able to add these safeguards by merely updating your configuration with out constructing any customized tooling.

Getting began
Let me stroll you thru an indication that showcases easy methods to configure and use blue/inexperienced deployments for an ECS service. Earlier than that, there are a couple of setup steps that I want to finish, together with configuring AWS Identification and Entry Administration (IAM) roles, which you will discover on the Required assets for Amazon ECS blue/inexperienced deployments Documentation web page.

For this demonstration, I wish to deploy a brand new model of my software utilizing the blue/inexperienced technique to reduce danger. First, I must configure my ECS service to make use of blue/inexperienced deployments. I can do that by means of the ECS console, AWS Command Line Interface (AWS CLI), or utilizing infrastructure as code.

Utilizing the Amazon ECS console, I create a brand new service and configure it as regular:

Within the Deployment Choices part, I select ECS because the Deployment controller sort, then Blue/inexperienced because the Deployment technique. Bake time is the time after the manufacturing visitors has shifted to inexperienced, when prompt rollback to blue is obtainable. When the bake time expires, blue duties are eliminated.

We’re additionally introducing deployment lifecycle hooks. These are event-driven mechanisms you need to use to enhance the deployment workflow. I can choose which AWS Lambda operate I’d like to make use of as a deployment lifecycle hook. The Lambda operate can carry out the required enterprise logic, but it surely should return a hook standing.

Amazon ECS helps the next lifecycle hooks throughout blue/inexperienced deployments. You’ll be able to be taught extra about every stage on the Deployment lifecycle phases web page.

  • Pre scale up
  • Submit scale up
  • Manufacturing visitors shift
  • Check visitors shift
  • Submit manufacturing visitors shift
  • Submit check visitors shift

For my software, I wish to check when the check visitors shift is full and the inexperienced service handles all the check visitors. Since there’s no end-user visitors, a rollback at this stage may have no impression on customers. This makes Submit check visitors shift appropriate for my use case as I can check it first with my Lambda operate.

Switching context for a second, let’s deal with the Lambda operate that I take advantage of to validate the deployment earlier than permitting it to proceed. In my Lambda operate as a deployment lifecycle hook, I can carry out any enterprise logic, equivalent to artificial testing, calling one other API, or querying metrics.

Throughout the Lambda operate, I have to return a hookStatus. A hookStatus may be SUCCEEDED, which can transfer the method to the following step. If the standing is FAILED, it rolls again to the blue deployment. If it’s IN_PROGRESS, then Amazon ECS retries the Lambda operate in 30 seconds.

Within the following instance, I arrange my validation with a Lambda operate that performs file add as a part of a check suite for my software.

import json
import urllib3
import logging
import base64
import os

# Configure logging
logger = logging.getLogger()
logger.setLevel(logging.DEBUG)

# Initialize HTTP consumer
http = urllib3.PoolManager()

def lambda_handler(occasion, context):
    """
    Validation hook that assessments the inexperienced atmosphere with file add
    """
    logger.data(f"Occasion: {json.dumps(occasion)}")
    logger.data(f"Context: {context}")
    
    attempt:
        # In an actual state of affairs, you'll assemble the check endpoint URL
        test_endpoint = os.getenv("APP_URL")
        
        # Create a check file for add
        test_file_content = "It is a check file for deployment validation"
        test_file_data = test_file_content.encode('utf-8')
        
        # Put together multipart type information for file add
        fields = {
            'file': ('check.txt', test_file_data, 'textual content/plain'),
            'description': 'Deployment validation check file'
        }
        
        # Ship POST request with file add to /course of endpoint
        response = http.request(
            'POST', 
            test_endpoint,
            fields=fields,
            timeout=30
        )
        
        logger.data(f"POST /course of response standing: {response.standing}")
        
        # Test if response has OK standing code (200-299 vary)
        if 200 <= response.standing < 300:
            logger.data("File add check handed - obtained OK standing code")
            return {
                "hookStatus": "SUCCEEDED"
            }
        else:
            logger.error(f"File add check failed - standing code: {response.standing}")
            return {
                "hookStatus": "FAILED"
            }
            
    besides Exception as error:
        logger.error(f"File add check failed: {str(error)}")
        return {
            "hookStatus": "FAILED"
        }

When the deployment reaches the lifecycle stage that’s related to the hook, Amazon ECS robotically invokes my Lambda operate with deployment context. My validation operate can run complete assessments towards the inexperienced revision—checking software well being, working integration assessments, or validating efficiency metrics. The operate then alerts again to ECS whether or not to proceed or abort the deployment.

As I selected the blue/inexperienced deployment technique, I additionally must configure the load balancers and/or Amazon ECS Service Join. Within the Load balancing part, I choose my Software Load Balancer.

Within the Listener part, I take advantage of an current listener on port 80 and choose two Goal teams.

Pleased with this configuration, I create the service and look forward to ECS to provision my new service.

Testing blue/inexperienced deployments
Now, it’s time to check my blue/inexperienced deployments. For this check, Amazon ECS will set off my Lambda operate after the check visitors shift is accomplished. My Lambda operate will return FAILED on this case because it performs file add to my software, however my software doesn’t have this functionality.

I replace my service and test Pressure new deployment, figuring out the blue/inexperienced deployment functionality will roll again if it detects a failure. I choose this feature as a result of I haven’t modified the duty definition however nonetheless must set off a brand new deployment.

At this stage, I’ve each blue and inexperienced environments working, with the inexperienced revision dealing with all of the check visitors. In the meantime, based mostly on Amazon CloudWatch Logs of my Lambda operate, I additionally see that the deployment lifecycle hooks work as anticipated and emit the next payload:

[INFO]	2025-07-10T13:15:39.018Z	67d9b03e-12da-4fab-920d-9887d264308e	Occasion: 
{
    "executionDetails": {
        "testTrafficWeights": {},
        "productionTrafficWeights": {},
        "serviceArn": "arn:aws:ecs:us-west-2:123:service/EcsBlueGreenCluster/nginxBGservice",
        "targetServiceRevisionArn": "arn:aws:ecs:us-west-2:123:service-revision/EcsBlueGreenCluster/nginxBGservice/9386398427419951854"
    },
    "executionId": "a635edb5-a66b-4f44-bf3f-fcee4b3641a5",
    "lifecycleStage": "POST_TEST_TRAFFIC_SHIFT",
    "resourceArn": "arn:aws:ecs:us-west-2:123:service-deployment/EcsBlueGreenCluster/nginxBGservice/TFX5sH9q9XDboDTOv0rIt"
}

As anticipated, my AWS Lambda operate returns FAILED as hookStatus as a result of it did not carry out the check.

[ERROR]	2025-07-10T13:18:43.392Z	67d9b03e-12da-4fab-920d-9887d264308e	File add check failed: HTTPConnectionPool(host="xyz.us-west-2.elb.amazonaws.com", port=80): Max retries exceeded with url: / (Attributable to ConnectTimeoutError(, 'Connection to xyz.us-west-2.elb.amazonaws.com timed out. (join timeout=30)'))

As a result of the validation wasn’t accomplished efficiently, Amazon ECS tries to roll again to the blue model, which is the earlier working deployment model. I can monitor this course of by means of ECS occasions within the Occasions part, which offers detailed visibility into the deployment progress.

Amazon ECS efficiently rolls again the deployment to the earlier working model. The rollback occurs near-instantaneously as a result of the blue revision stays working and able to obtain manufacturing visitors. There is no such thing as a end-user impression throughout this course of, as manufacturing visitors by no means shifted to the brand new software model—ECS merely rolled again check visitors to the unique steady model. This eliminates the everyday deployment downtime related to conventional rolling deployments.

I can even see the rollback standing within the Final deployment part.

All through my testing, I noticed that the blue/inexperienced deployment technique offers constant and predictable conduct. Moreover, the deployment lifecycle hooks present extra flexibility to regulate the conduct of the deployment. Every service revision maintains immutable configuration together with activity definition, load balancer settings, and Service Join configuration. Which means that rollbacks restore precisely the identical atmosphere that was beforehand working.

Extra issues to know
Listed here are a few issues to notice:

  • Pricing – The blue/inexperienced deployment functionality is included with Amazon ECS at no further cost. You pay just for the compute assets used through the deployment course of.
  • Availability – This functionality is obtainable in all industrial AWS Areas.

Get began with blue/inexperienced deployments by updating your Amazon ECS service configuration within the Amazon ECS console.

Completely satisfied deploying!
Donnie