Inference Container Images - Amazon SageMaker

Inference Container Images

SageMaker Neo now provides inference image URI information for ml_* targets. For more information see DescribeCompilationJob.

Based on your use case, replace the highlighted portion in the inference image URI template provided below with appropriate values.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/xgboost-neo:latest

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-keras:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 2.2.4.

Replace instance_type with either cpu or gpu.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-mxnet:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.8.0.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-mxnet:fx_version-instance_type-py3

Replace aws_region with either us-east-1 or us-west-2.

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.1.

Replace instance_type with inf.

aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-onnx:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.0.

Replace instance_type with either cpu or gpu.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-pytorch:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.4, 1.5, 1.6, 1.7, 1.8, 1.12, 1.13, or 2.0.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-pytorch:fx_version-instance_type-py3

Replace aws_region with either us-east-1 or us-west-2.

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.5.1.

Replace instance_type with inf.

Inferentia2 and Trainium1
763104351884.dkr.ecr.aws_region.amazonaws.com/pytorch-inference-neuronx:1.13.1-neuronx-py38-sdk2.10.0-ubuntu20.04

Replace aws_region with us-east-2 for Inferentia2, and us-east-1 for Trainium1.

CPU or GPU instance types
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-inference-tensorflow:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used.

Replace fx_version with 1.15.3 or 2.9.

Replace instance_type with either cpu or gpu.

Inferentia1
aws_account_id.dkr.ecr.aws_region.amazonaws.com/sagemaker-neo-tensorflow:fx_version-instance_type-py3

Replace aws_account_id from the table at the end of this page based on the aws_region you used. Note that for instance type inf only us-east-1 and us-west-2 are supported.

Replace fx_version with 1.15.0

Replace instance_type with inf.

Inferentia2 and Trainium1
763104351884.dkr.ecr.aws_region.amazonaws.com/tensorflow-inference-neuronx:2.10.1-neuronx-py38-sdk2.10.0-ubuntu20.04

Replace aws_region with us-east-2 for Inferentia2, and us-east-1 for Trainium1.

The following table maps aws_account_id with aws_region. Use this table to find the correct inference image URI you need for your application.

aws_account_id aws_region
785573368785 us-east-1
007439368137 us-east-2
710691900526 us-west-1
301217895009 us-west-2
802834080501 eu-west-1
205493899709 eu-west-2
254080097072 eu-west-3
601324751636 eu-north-1
966458181534 eu-south-1
746233611703 eu-central-1
110948597952 ap-east-1
763008648453 ap-south-1
941853720454 ap-northeast-1
151534178276 ap-northeast-2
925152966179 ap-northeast-3
324986816169 ap-southeast-1
355873309152 ap-southeast-2
474822919863 cn-northwest-1
472730292857 cn-north-1
756306329178 sa-east-1
464438896020 ca-central-1
836785723513 me-south-1
774647643957 af-south-1
275950707576 il-central-1