Inference
Container Images
SageMaker Neo now provides inference image URI information for ml_*
targets. For more information see DescribeCompilationJob.
Based on your use case, replace the highlighted portion in the inference image URI
template provided below with appropriate values.
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/xgboost-neo:latest
Replace aws_account_id
from the table at the end of this page based on the aws_region
you used.
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-neo-keras:fx_version-instance_type-py3
Replace aws_account_id
from the table at the end of
this page based on the aws_region
you used.
Replace fx_version
with
2.2.4
.
Replace instance_type
with
either cpu
or gpu
.
- CPU or GPU instance types
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-inference-mxnet:fx_version
-instance_type
-py3
Replace aws_account_id
from the table at the end of
this page based on the aws_region
you used.
Replace fx_version
with
1.8.0
.
Replace instance_type
with
either cpu
or gpu
.
- Inferentia1
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-neo-mxnet:fx_version
-instance_type
-py3
Replace aws_region
with
either us-east-1
or us-west-2
.
Replace aws_account_id
from the table at the end
of this page based on the aws_region
you used.
Replace fx_version
with
1.5.1
.
Replace instance_type
with inf
.
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-neo-onnx:fx_version-instance_type-py3
Replace aws_account_id
from the table at the end of
this page based on the aws_region
you used.
Replace fx_version
with
1.5.0
.
Replace instance_type
with
either cpu
or gpu
.
- CPU or GPU instance types
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-inference-pytorch:fx_version
-instance_type
-py3
Replace aws_account_id
from the table at the end of
this page based on the aws_region
you used.
Replace fx_version
with 1.4
, 1.5
,
1.6
, 1.7
, 1.8
, 1.12
, 1.13
, or 2.0
.
Replace instance_type
with
either cpu
or gpu
.
- Inferentia1
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-neo-pytorch:fx_version
-instance_type
-py3
Replace aws_region
with
either us-east-1
or us-west-2
.
Replace aws_account_id
from the table at the end
of this page based on the aws_region
you used.
Replace fx_version
with
1.5.1
.
Replace instance_type
with inf
.
- Inferentia2 and Trainium1
-
763104351884.dkr.ecr.aws_region
.amazonaws.com/pytorch-inference-neuronx:1.13.1-neuronx-py38-sdk2.10.0-ubuntu20.04
Replace aws_region
with
us-east-2
for Inferentia2, and
us-east-1
for Trainium1.
- CPU or GPU instance types
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-inference-tensorflow:fx_version
-instance_type
-py3
Replace aws_account_id
from the table at the end of
this page based on the aws_region
you used.
Replace fx_version
with
1.15.3
or 2.9
.
Replace instance_type
with
either cpu
or gpu
.
- Inferentia1
-
aws_account_id
.dkr.ecr.aws_region
.amazonaws.com/sagemaker-neo-tensorflow:fx_version
-instance_type
-py3
Replace aws_account_id
from the table at the end of this page based on the aws_region
you used.
Note that for instance type inf
only us-east-1
and us-west-2
are supported.
Replace fx_version
with 1.15.0
Replace instance_type
with inf
.
- Inferentia2 and Trainium1
-
763104351884.dkr.ecr.aws_region
.amazonaws.com/tensorflow-inference-neuronx:2.10.1-neuronx-py38-sdk2.10.0-ubuntu20.04
Replace aws_region
with
us-east-2
for Inferentia2, and
us-east-1
for Trainium1.
The following table maps aws_account_id
with aws_region
.
Use this table to find the correct inference image URI
you need for your application.
aws_account_id |
aws_region |
785573368785 |
us-east-1 |
007439368137 |
us-east-2 |
710691900526 |
us-west-1 |
301217895009 |
us-west-2 |
802834080501 |
eu-west-1 |
205493899709 |
eu-west-2 |
254080097072 |
eu-west-3 |
601324751636 |
eu-north-1 |
966458181534 |
eu-south-1 |
746233611703 |
eu-central-1 |
110948597952 |
ap-east-1 |
763008648453 |
ap-south-1 |
941853720454 |
ap-northeast-1 |
151534178276 |
ap-northeast-2 |
925152966179 |
ap-northeast-3 |
324986816169 |
ap-southeast-1 |
355873309152 |
ap-southeast-2 |
474822919863 |
cn-northwest-1 |
472730292857 |
cn-north-1 |
756306329178 |
sa-east-1 |
464438896020 |
ca-central-1 |
836785723513 |
me-south-1 |
774647643957 |
af-south-1 |
275950707576 |
il-central-1 |