PERF02-BP04 Configure and right-size compute resources
Configure and right-size compute resources to match your workload’s performance requirements and avoid under- or over-utilized resources.
Common anti-patterns:
-
You ignore your workload performance requirements resulting in over-provisioned or under-provisioned compute resources.
-
You only choose the largest or smallest instance available for all workloads.
-
You only use one instance family for ease of management.
-
You ignore recommendations from AWS Cost Explorer or Compute Optimizer for right-sizing.
-
You do not re-evaluate the workload for suitability of new instance types.
-
You certify only a small number of instance configurations for your organization.
Benefits of establishing this best practice: Right-sizing compute resources ensures optimal operation in the cloud by avoiding over-provisioning and under-provisioning resources. Properly sizing compute resources typically results in better performance and enhanced customer experience, while also lowering cost.
Level of risk exposed if this best practice is not established: Medium
Implementation guidance
Right-sizing allows organizations to operate their cloud
infrastructure in an efficient and cost-effective manner while
addressing their business needs. Over-provisioning cloud resources
can lead to extra costs, while under-provisioning can result in
poor performance and a negative customer experience. AWS provides
tools such as
AWS Compute Optimizer
Implementation steps
-
Choose an instance type to best fit your needs:
-
Analyze the various performance characteristics of your workload and how these characteristics relate to memory, network, and CPU usage. Use this data to choose resources that best match your workload's profile and performance goals.
-
Monitor your resource usage using AWS monitoring tools such as Amazon CloudWatch.
-
Select the right configuration for compute resources.
-
For ephemeral workloads, evaluate instance Amazon CloudWatch metrics such as
CPUUtilization
to identify if the instance is under-utilized or over-utilized. -
For stable workloads, check AWS rightsizing tools such as AWS Compute Optimizer and AWS Trusted Advisor at regular intervals to identify opportunities to optimize and right-size the compute resource.
-
-
Test configuration changes in a non-production environment before implementing in a live environment.
-
Continually re-evaluate new compute offerings and compare against your workload’s needs.
Resources
Related documents:
Related videos:
-
AWS re:Invent 2023 – AWS Graviton: The best price performance for your AWS workloads
-
AWS re:Invent 2023 – New Amazon EC2 generative AI capabilities in AWS Management Console
-
AWS re:Invent 2023 – Smart savings: Amazon EC2 cost-optimization strategies
-
AWS re:Invent 2021 – Powering next-gen Amazon EC2: Deep dive on the Nitro System
Related examples: