describe-model¶

Description¶

Describes a model that you created using the CreateModel API.

Synopsis¶

  describe-model
--model-name <value>
[--cli-input-json | --cli-input-yaml]
[--generate-cli-skeleton <value>]
[--debug]
[--endpoint-url <value>]
[--no-verify-ssl]
[--no-paginate]
[--output <value>]
[--query <value>]
[--profile <value>]
[--region <value>]
[--version <value>]
[--color <value>]
[--no-sign-request]
[--ca-bundle <value>]
[--cli-read-timeout <value>]
[--cli-connect-timeout <value>]
[--cli-binary-format <value>]
[--no-cli-pager]
[--cli-auto-prompt]
[--no-cli-auto-prompt]
[--cli-error-format <value>]

Options¶

--model-name (string) [required]

The name of the model.

Constraints:

min: 0

max: 63

pattern: [a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?

--cli-input-json | --cli-input-yaml (string) Reads arguments from the JSON string provided. The JSON string follows the format provided by --generate-cli-skeleton. If other arguments are provided on the command line, those values will override the JSON-provided values. It is not possible to pass arbitrary binary values using a JSON-provided value as the string will be taken literally. This may not be specified along with --cli-input-yaml.

--generate-cli-skeleton (string) Prints a JSON skeleton to standard output without sending an API request. If provided with no value or the value input, prints a sample input JSON that can be used as an argument for --cli-input-json. Similarly, if provided yaml-input it will print a sample input YAML that can be used with --cli-input-yaml. If provided with the value output, it validates the command inputs and returns a sample output JSON for that command. The generated JSON skeleton is not stable between versions of the AWS CLI and there are no backwards compatibility guarantees in the JSON skeleton generated.

Global Options¶

--debug (boolean)

Turn on debug logging.

--endpoint-url (string)

Override command’s default URL with the given URL.

--no-verify-ssl (boolean)

By default, the AWS CLI uses SSL when communicating with AWS services. For each SSL connection, the AWS CLI will verify SSL certificates. This option overrides the default behavior of verifying SSL certificates.

--no-paginate (boolean)

Disable automatic pagination. If automatic pagination is disabled, the AWS CLI will only make one call, for the first page of results.

--output (string)

The formatting style for command output.

json
text
table
yaml
yaml-stream
off

--query (string)

A JMESPath query to use in filtering the response data.

--profile (string)

Use a specific profile from your credential file.

--region (string)

The region to use. Overrides config/env settings.

--version (string)

Display the version of this tool.

--color (string)

Turn on/off color output.

on
off
auto

--no-sign-request (boolean)

Do not sign requests. Credentials will not be loaded if this argument is provided.

--ca-bundle (string)

The CA certificate bundle to use when verifying SSL certificates. Overrides config/env settings.

--cli-read-timeout (int)

The maximum socket read time in seconds. If the value is set to 0, the socket read will be blocking and not timeout. The default value is 60 seconds.

--cli-connect-timeout (int)

The maximum socket connect time in seconds. If the value is set to 0, the socket connect will be blocking and not timeout. The default value is 60 seconds.

--cli-binary-format (string)

The formatting style to be used for binary blobs. The default format is base64. The base64 format expects binary blobs to be provided as a base64 encoded string. The raw-in-base64-out format preserves compatibility with AWS CLI V1 behavior and binary values must be passed literally. When providing contents from a file that map to a binary blob fileb:// will always be treated as binary and use the file contents directly regardless of the cli-binary-format setting. When using file:// the file contents will need to properly formatted for the configured cli-binary-format.

base64
raw-in-base64-out

--no-cli-pager (boolean)

Disable cli pager for output.

--cli-auto-prompt (boolean)

Automatically prompt for CLI input parameters.

--no-cli-auto-prompt (boolean)

Disable automatically prompt for CLI input parameters.

--cli-error-format (string)

The formatting style for error output. By default, errors are displayed in enhanced format.

legacy
json
yaml
text
table
enhanced

Output¶

ModelName -> (string)

Name of the SageMaker model.

Constraints:

min: 0

max: 63

pattern: [a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?

PrimaryContainer -> (structure)

The location of the primary inference code, associated artifacts, and custom environment map that the inference code uses when it is deployed in production.

ContainerHostname -> (string)

This parameter is ignored for models that contain only a PrimaryContainer .

When a ContainerDefinition is part of an inference pipeline, the value of the parameter uniquely identifies the container for the purposes of logging and metrics. For information, see Use Logs and Metrics to Monitor an Inference Pipeline . If you don’t specify a value for this parameter for a ContainerDefinition that is part of an inference pipeline, a unique name is automatically assigned based on the position of the ContainerDefinition in the pipeline. If you specify a value for the ContainerHostName for any ContainerDefinition that is part of an inference pipeline, you must specify a value for the ContainerHostName parameter of every ContainerDefinition in that pipeline.

Constraints:

min: 0

max: 63

pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Image -> (string)

The path where inference code is stored. This can be either in Amazon EC2 Container Registry or in a Docker registry that is accessible from the same VPC that you configure for your endpoint. If you are using your own custom algorithm instead of an algorithm provided by SageMaker, the inference code must meet SageMaker requirements. SageMaker supports both registry/repository[:tag] and registry/repository[@digest] image path formats. For more information, see Using Your Own Algorithms with Amazon SageMaker .

Note
The model artifacts in an Amazon S3 bucket and the Docker image for inference container in Amazon EC2 Container Registry must be in the same region as the model or endpoint you are creating.

Constraints:

min: 0

max: 255

pattern: [\S]+

ImageConfig -> (structure)

Specifies whether the model container is in Amazon ECR or a private Docker registry accessible from your Amazon Virtual Private Cloud (VPC). For information about storing containers in a private Docker registry, see Use a Private Docker Registry for Real-Time Inference Containers .

Note
The model artifacts in an Amazon S3 bucket and the Docker image for inference container in Amazon EC2 Container Registry must be in the same region as the model or endpoint you are creating.

RepositoryAccessMode -> (string) [required]

Set this to one of the following values:

Platform - The model image is hosted in Amazon ECR.

Vpc - The model image is hosted in a private Docker registry in your VPC.

Possible values:

Platform

Vpc

RepositoryAuthConfig -> (structure)

(Optional) Specifies an authentication configuration for the private docker registry where your model image is hosted. Specify a value for this property only if you specified Vpc as the value for the RepositoryAccessMode field, and the private Docker registry where the model image is hosted requires authentication.

RepositoryCredentialsProviderArn -> (string) [required]

The Amazon Resource Name (ARN) of an Amazon Web Services Lambda function that provides credentials to authenticate to the private Docker registry where your model image is hosted. For information about how to create an Amazon Web Services Lambda function, see Create a Lambda function with the console in the Amazon Web Services Lambda Developer Guide .

Constraints:

min: 1

max: 2048

pattern: .*

Mode -> (string)

Whether the container hosts a single model or multiple models.

Possible values:

SingleModel

MultiModel

ModelDataUrl -> (string)

The S3 path where the model artifacts, which result from model training, are stored. This path must point to a single gzip compressed tar archive (.tar.gz suffix). The S3 path is required for SageMaker built-in algorithms, but not if you use your own algorithms. For more information on built-in algorithms, see Common Parameters .

Note
The model artifacts must be in an S3 bucket that is in the same region as the model or endpoint you are creating.

If you provide a value for this parameter, SageMaker uses Amazon Web Services Security Token Service to download model artifacts from the S3 path you provide. Amazon Web Services STS is activated in your Amazon Web Services account by default. If you previously deactivated Amazon Web Services STS for a region, you need to reactivate Amazon Web Services STS for that region. For more information, see Activating and Deactivating Amazon Web Services STS in an Amazon Web Services Region in the Amazon Web Services Identity and Access Management User Guide .

Warning
If you use a built-in algorithm to create a model, SageMaker requires that you provide a S3 path to the model artifacts in ModelDataUrl .

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ModelDataSource -> (structure)

Specifies the location of ML model data to deploy.

Note
Currently you cannot use ModelDataSource in conjunction with SageMaker batch transform, SageMaker serverless endpoints, SageMaker multi-model endpoints, and SageMaker Marketplace.

S3DataSource -> (structure)

Specifies the S3 location of ML model data to deploy.

S3Uri -> (string) [required]

Specifies the S3 path of ML model data to deploy.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

S3DataType -> (string) [required]

Specifies the type of ML model data to deploy.

If you choose S3Prefix , S3Uri identifies a key name prefix. SageMaker uses all objects that match the specified key name prefix as part of the ML model data to deploy. A valid key name prefix identified by S3Uri always ends with a forward slash (/).

If you choose S3Object , S3Uri identifies an object that is the ML model data to deploy.

Possible values:

S3Prefix

S3Object

CompressionType -> (string) [required]

Specifies how the ML model data is prepared.

If you choose Gzip and choose S3Object as the value of S3DataType , S3Uri identifies an object that is a gzip-compressed TAR archive. SageMaker will attempt to decompress and untar the object during model deployment.

If you choose None and chooose S3Object as the value of S3DataType , S3Uri identifies an object that represents an uncompressed ML model to deploy.

If you choose None and choose S3Prefix as the value of S3DataType , S3Uri identifies a key name prefix, under which all objects represents the uncompressed ML model to deploy.

If you choose None, then SageMaker will follow rules below when creating model data files under /opt/ml/model directory for use by your inference code:

If you choose S3Object as the value of S3DataType , then SageMaker will split the key of the S3 object referenced by S3Uri by slash (/), and use the last part as the filename of the file holding the content of the S3 object.

If you choose S3Prefix as the value of S3DataType , then for each S3 object under the key name pefix referenced by S3Uri , SageMaker will trim its key by the prefix, and use the remainder as the path (relative to /opt/ml/model ) of the file holding the content of the S3 object. SageMaker will split the remainder by slash (/), using intermediate parts as directory names and the last part as filename of the file holding the content of the S3 object.

Do not use any of the following as file names or directory names:

An empty or blank string

A string which contains null bytes

A string longer than 255 bytes

A single dot (. )

A double dot (.. )

Ambiguous file names will result in model deployment failure. For example, if your uncompressed ML model consists of two S3 objects s3://mybucket/model/weights and s3://mybucket/model/weights/part1 and you specify s3://mybucket/model/ as the value of S3Uri and S3Prefix as the value of S3DataType , then it will result in name clash between /opt/ml/model/weights (a regular file) and /opt/ml/model/weights/ (a directory).

Do not organize the model artifacts in S3 console using folders . When you create a folder in S3 console, S3 creates a 0-byte object with a key set to the folder name you provide. They key of the 0-byte object ends with a slash (/) which violates SageMaker restrictions on model artifact file names, leading to model deployment failure.

Possible values:

None

Gzip

ModelAccessConfig -> (structure)

Specifies the access configuration file for the ML model. You can explicitly accept the model end-user license agreement (EULA) within the ModelAccessConfig . You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

AcceptEula -> (boolean) [required]

Specifies agreement to the model end-user license agreement (EULA). The AcceptEula value must be explicitly defined as True in order to accept the EULA that this model requires. You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

HubAccessConfig -> (structure)

Configuration information for hub access.

HubContentArn -> (string) [required]

The ARN of the hub content for which deployment access is allowed.

Constraints:

min: 0

max: 255

pattern: .*

ManifestS3Uri -> (string)

The Amazon S3 URI of the manifest file. The manifest file is a CSV file that stores the artifact locations.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ETag -> (string)

The ETag associated with S3 URI.

ManifestEtag -> (string)

The ETag associated with Manifest S3 URI.

AdditionalModelDataSources -> (list)

Data sources that are available to your model in addition to the one that you specify for ModelDataSource when you use the CreateModel action.

Constraints:

min: 0

max: 5

(structure)

Data sources that are available to your model in addition to the one that you specify for ModelDataSource when you use the CreateModel action.

ChannelName -> (string) [required]

A custom name for this AdditionalModelDataSource object.

Constraints:

min: 1

max: 64

pattern: [A-Za-z0-9\.\-_]+

S3DataSource -> (structure) [required]

Specifies the S3 location of ML model data to deploy.

S3Uri -> (string) [required]

Specifies the S3 path of ML model data to deploy.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

S3DataType -> (string) [required]

Specifies the type of ML model data to deploy.

If you choose S3Prefix , S3Uri identifies a key name prefix. SageMaker uses all objects that match the specified key name prefix as part of the ML model data to deploy. A valid key name prefix identified by S3Uri always ends with a forward slash (/).

If you choose S3Object , S3Uri identifies an object that is the ML model data to deploy.

Possible values:

S3Prefix

S3Object

CompressionType -> (string) [required]

Specifies how the ML model data is prepared.

If you choose Gzip and choose S3Object as the value of S3DataType , S3Uri identifies an object that is a gzip-compressed TAR archive. SageMaker will attempt to decompress and untar the object during model deployment.

If you choose None and chooose S3Object as the value of S3DataType , S3Uri identifies an object that represents an uncompressed ML model to deploy.

If you choose None and choose S3Prefix as the value of S3DataType , S3Uri identifies a key name prefix, under which all objects represents the uncompressed ML model to deploy.

If you choose None, then SageMaker will follow rules below when creating model data files under /opt/ml/model directory for use by your inference code:

If you choose S3Object as the value of S3DataType , then SageMaker will split the key of the S3 object referenced by S3Uri by slash (/), and use the last part as the filename of the file holding the content of the S3 object.

If you choose S3Prefix as the value of S3DataType , then for each S3 object under the key name pefix referenced by S3Uri , SageMaker will trim its key by the prefix, and use the remainder as the path (relative to /opt/ml/model ) of the file holding the content of the S3 object. SageMaker will split the remainder by slash (/), using intermediate parts as directory names and the last part as filename of the file holding the content of the S3 object.

Do not use any of the following as file names or directory names:

An empty or blank string

A string which contains null bytes

A string longer than 255 bytes

A single dot (. )

A double dot (.. )

Ambiguous file names will result in model deployment failure. For example, if your uncompressed ML model consists of two S3 objects s3://mybucket/model/weights and s3://mybucket/model/weights/part1 and you specify s3://mybucket/model/ as the value of S3Uri and S3Prefix as the value of S3DataType , then it will result in name clash between /opt/ml/model/weights (a regular file) and /opt/ml/model/weights/ (a directory).

Do not organize the model artifacts in S3 console using folders . When you create a folder in S3 console, S3 creates a 0-byte object with a key set to the folder name you provide. They key of the 0-byte object ends with a slash (/) which violates SageMaker restrictions on model artifact file names, leading to model deployment failure.

Possible values:

None

Gzip

ModelAccessConfig -> (structure)

Specifies the access configuration file for the ML model. You can explicitly accept the model end-user license agreement (EULA) within the ModelAccessConfig . You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

AcceptEula -> (boolean) [required]

Specifies agreement to the model end-user license agreement (EULA). The AcceptEula value must be explicitly defined as True in order to accept the EULA that this model requires. You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

HubAccessConfig -> (structure)

Configuration information for hub access.

HubContentArn -> (string) [required]

The ARN of the hub content for which deployment access is allowed.

Constraints:

min: 0

max: 255

pattern: .*

ManifestS3Uri -> (string)

The Amazon S3 URI of the manifest file. The manifest file is a CSV file that stores the artifact locations.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ETag -> (string)

The ETag associated with S3 URI.

ManifestEtag -> (string)

The ETag associated with Manifest S3 URI.

Environment -> (map)

The environment variables to set in the Docker container. Don’t include any sensitive data in your environment variables.

The maximum length of each key and value in the Environment map is 1024 bytes. The maximum length of all keys and values in the map, combined, is 32 KB. If you pass multiple containers to a CreateModel request, then the maximum length of all of their maps, combined, is also 32 KB.

Constraints:

min: 0

max: 100

key -> (string)

Constraints:

min: 0

max: 1024

pattern: [a-zA-Z_][a-zA-Z0-9_]*

value -> (string)

Constraints:

min: 0

max: 1024

pattern: [\S\s]*

ModelPackageName -> (string)

The name or Amazon Resource Name (ARN) of the model package to use to create the model.

Constraints:

min: 1

max: 176

pattern: (arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:[a-z\-]*\/)?([a-zA-Z0-9]([a-zA-Z0-9-]){0,62})(?<!-)(\/[0-9]{1,9})?

InferenceSpecificationName -> (string)

The inference specification name in the model package version.

Constraints:

min: 1

max: 63

pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

MultiModelConfig -> (structure)

Specifies additional configuration for multi-model endpoints.

ModelCacheSetting -> (string)

Whether to cache models for a multi-model endpoint. By default, multi-model endpoints cache models so that a model does not have to be loaded into memory each time it is invoked. Some use cases do not benefit from model caching. For example, if an endpoint hosts a large number of models that are each invoked infrequently, the endpoint might perform better if you disable model caching. To disable model caching, set the value of this parameter to Disabled .

Possible values:

Enabled

Disabled

ContainerMetricsConfig -> (structure)

The configuration for container metrics scraping. Specifies the metrics endpoint path and publishing frequency. If not specified when EnableDetailedObservability is True , the default path /metrics on port 8080 is used. For first-party and Deep Learning Containers (DLC), the endpoint path is determined automatically and this configuration is optional.

MetricsEndpoints -> (list)

A list of metrics endpoints to scrape from the container. Each endpoint specifies the path where the container exposes Prometheus-formatted metrics and the frequency at which to publish them. You can specify a maximum of 1 endpoint.

Constraints:

min: 0

max: 1

(structure)

Specifies a metrics endpoint for a container, including the path where the container exposes Prometheus-formatted metrics and the frequency at which to publish them to Amazon CloudWatch.

MetricsEndpointPath -> (string) [required]

The path to the metrics endpoint exposed by the container. For example, /metrics or /server/metrics . The path must start with / and can contain alphanumeric characters, forward slashes, underscores, hyphens, and periods. Maximum length is 256 characters. If not specified, defaults to /metrics .

Constraints:

min: 0

max: 256

pattern: /(?!.*\.\.)[a-zA-Z0-9/_.\-]+

MetricPublishFrequencyInSeconds -> (integer)

The interval, in seconds, at which container metrics scraped from the endpoint are published to Amazon CloudWatch. Valid values: 10 , 30 , 60 , 120 , 180 , 240 , 300 . Defaults to 60 .

Containers -> (list)

The containers in the inference pipeline.

Constraints:

min: 0

max: 15

(structure)

Describes the container, as part of model definition.

ContainerHostname -> (string)

This parameter is ignored for models that contain only a PrimaryContainer .

When a ContainerDefinition is part of an inference pipeline, the value of the parameter uniquely identifies the container for the purposes of logging and metrics. For information, see Use Logs and Metrics to Monitor an Inference Pipeline . If you don’t specify a value for this parameter for a ContainerDefinition that is part of an inference pipeline, a unique name is automatically assigned based on the position of the ContainerDefinition in the pipeline. If you specify a value for the ContainerHostName for any ContainerDefinition that is part of an inference pipeline, you must specify a value for the ContainerHostName parameter of every ContainerDefinition in that pipeline.

Constraints:

min: 0

max: 63

pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Image -> (string)

The path where inference code is stored. This can be either in Amazon EC2 Container Registry or in a Docker registry that is accessible from the same VPC that you configure for your endpoint. If you are using your own custom algorithm instead of an algorithm provided by SageMaker, the inference code must meet SageMaker requirements. SageMaker supports both registry/repository[:tag] and registry/repository[@digest] image path formats. For more information, see Using Your Own Algorithms with Amazon SageMaker .

Note
The model artifacts in an Amazon S3 bucket and the Docker image for inference container in Amazon EC2 Container Registry must be in the same region as the model or endpoint you are creating.

Constraints:

min: 0

max: 255

pattern: [\S]+

ImageConfig -> (structure)

Specifies whether the model container is in Amazon ECR or a private Docker registry accessible from your Amazon Virtual Private Cloud (VPC). For information about storing containers in a private Docker registry, see Use a Private Docker Registry for Real-Time Inference Containers .

Note
The model artifacts in an Amazon S3 bucket and the Docker image for inference container in Amazon EC2 Container Registry must be in the same region as the model or endpoint you are creating.

RepositoryAccessMode -> (string) [required]

Set this to one of the following values:

Platform - The model image is hosted in Amazon ECR.

Vpc - The model image is hosted in a private Docker registry in your VPC.

Possible values:

Platform

Vpc

RepositoryAuthConfig -> (structure)

(Optional) Specifies an authentication configuration for the private docker registry where your model image is hosted. Specify a value for this property only if you specified Vpc as the value for the RepositoryAccessMode field, and the private Docker registry where the model image is hosted requires authentication.

RepositoryCredentialsProviderArn -> (string) [required]

The Amazon Resource Name (ARN) of an Amazon Web Services Lambda function that provides credentials to authenticate to the private Docker registry where your model image is hosted. For information about how to create an Amazon Web Services Lambda function, see Create a Lambda function with the console in the Amazon Web Services Lambda Developer Guide .

Constraints:

min: 1

max: 2048

pattern: .*

Mode -> (string)

Whether the container hosts a single model or multiple models.

Possible values:

SingleModel

MultiModel

ModelDataUrl -> (string)

The S3 path where the model artifacts, which result from model training, are stored. This path must point to a single gzip compressed tar archive (.tar.gz suffix). The S3 path is required for SageMaker built-in algorithms, but not if you use your own algorithms. For more information on built-in algorithms, see Common Parameters .

Note
The model artifacts must be in an S3 bucket that is in the same region as the model or endpoint you are creating.

If you provide a value for this parameter, SageMaker uses Amazon Web Services Security Token Service to download model artifacts from the S3 path you provide. Amazon Web Services STS is activated in your Amazon Web Services account by default. If you previously deactivated Amazon Web Services STS for a region, you need to reactivate Amazon Web Services STS for that region. For more information, see Activating and Deactivating Amazon Web Services STS in an Amazon Web Services Region in the Amazon Web Services Identity and Access Management User Guide .

Warning
If you use a built-in algorithm to create a model, SageMaker requires that you provide a S3 path to the model artifacts in ModelDataUrl .

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ModelDataSource -> (structure)

Specifies the location of ML model data to deploy.

Note
Currently you cannot use ModelDataSource in conjunction with SageMaker batch transform, SageMaker serverless endpoints, SageMaker multi-model endpoints, and SageMaker Marketplace.

S3DataSource -> (structure)

Specifies the S3 location of ML model data to deploy.

S3Uri -> (string) [required]

Specifies the S3 path of ML model data to deploy.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

S3DataType -> (string) [required]

Specifies the type of ML model data to deploy.

If you choose S3Prefix , S3Uri identifies a key name prefix. SageMaker uses all objects that match the specified key name prefix as part of the ML model data to deploy. A valid key name prefix identified by S3Uri always ends with a forward slash (/).

If you choose S3Object , S3Uri identifies an object that is the ML model data to deploy.

Possible values:

S3Prefix

S3Object

CompressionType -> (string) [required]

Specifies how the ML model data is prepared.

If you choose Gzip and choose S3Object as the value of S3DataType , S3Uri identifies an object that is a gzip-compressed TAR archive. SageMaker will attempt to decompress and untar the object during model deployment.

If you choose None and chooose S3Object as the value of S3DataType , S3Uri identifies an object that represents an uncompressed ML model to deploy.

If you choose None and choose S3Prefix as the value of S3DataType , S3Uri identifies a key name prefix, under which all objects represents the uncompressed ML model to deploy.

If you choose None, then SageMaker will follow rules below when creating model data files under /opt/ml/model directory for use by your inference code:

If you choose S3Object as the value of S3DataType , then SageMaker will split the key of the S3 object referenced by S3Uri by slash (/), and use the last part as the filename of the file holding the content of the S3 object.

If you choose S3Prefix as the value of S3DataType , then for each S3 object under the key name pefix referenced by S3Uri , SageMaker will trim its key by the prefix, and use the remainder as the path (relative to /opt/ml/model ) of the file holding the content of the S3 object. SageMaker will split the remainder by slash (/), using intermediate parts as directory names and the last part as filename of the file holding the content of the S3 object.

Do not use any of the following as file names or directory names:

An empty or blank string

A string which contains null bytes

A string longer than 255 bytes

A single dot (. )

A double dot (.. )

Ambiguous file names will result in model deployment failure. For example, if your uncompressed ML model consists of two S3 objects s3://mybucket/model/weights and s3://mybucket/model/weights/part1 and you specify s3://mybucket/model/ as the value of S3Uri and S3Prefix as the value of S3DataType , then it will result in name clash between /opt/ml/model/weights (a regular file) and /opt/ml/model/weights/ (a directory).

Do not organize the model artifacts in S3 console using folders . When you create a folder in S3 console, S3 creates a 0-byte object with a key set to the folder name you provide. They key of the 0-byte object ends with a slash (/) which violates SageMaker restrictions on model artifact file names, leading to model deployment failure.

Possible values:

None

Gzip

ModelAccessConfig -> (structure)

Specifies the access configuration file for the ML model. You can explicitly accept the model end-user license agreement (EULA) within the ModelAccessConfig . You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

AcceptEula -> (boolean) [required]

Specifies agreement to the model end-user license agreement (EULA). The AcceptEula value must be explicitly defined as True in order to accept the EULA that this model requires. You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

HubAccessConfig -> (structure)

Configuration information for hub access.

HubContentArn -> (string) [required]

The ARN of the hub content for which deployment access is allowed.

Constraints:

min: 0

max: 255

pattern: .*

ManifestS3Uri -> (string)

The Amazon S3 URI of the manifest file. The manifest file is a CSV file that stores the artifact locations.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ETag -> (string)

The ETag associated with S3 URI.

ManifestEtag -> (string)

The ETag associated with Manifest S3 URI.

AdditionalModelDataSources -> (list)

Data sources that are available to your model in addition to the one that you specify for ModelDataSource when you use the CreateModel action.

Constraints:

min: 0

max: 5

(structure)

Data sources that are available to your model in addition to the one that you specify for ModelDataSource when you use the CreateModel action.

ChannelName -> (string) [required]

A custom name for this AdditionalModelDataSource object.

Constraints:

min: 1

max: 64

pattern: [A-Za-z0-9\.\-_]+

S3DataSource -> (structure) [required]

Specifies the S3 location of ML model data to deploy.

S3Uri -> (string) [required]

Specifies the S3 path of ML model data to deploy.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

S3DataType -> (string) [required]

Specifies the type of ML model data to deploy.

If you choose S3Prefix , S3Uri identifies a key name prefix. SageMaker uses all objects that match the specified key name prefix as part of the ML model data to deploy. A valid key name prefix identified by S3Uri always ends with a forward slash (/).

If you choose S3Object , S3Uri identifies an object that is the ML model data to deploy.

Possible values:

S3Prefix

S3Object

CompressionType -> (string) [required]

Specifies how the ML model data is prepared.

If you choose Gzip and choose S3Object as the value of S3DataType , S3Uri identifies an object that is a gzip-compressed TAR archive. SageMaker will attempt to decompress and untar the object during model deployment.

If you choose None and chooose S3Object as the value of S3DataType , S3Uri identifies an object that represents an uncompressed ML model to deploy.

If you choose None and choose S3Prefix as the value of S3DataType , S3Uri identifies a key name prefix, under which all objects represents the uncompressed ML model to deploy.

If you choose None, then SageMaker will follow rules below when creating model data files under /opt/ml/model directory for use by your inference code:

If you choose S3Object as the value of S3DataType , then SageMaker will split the key of the S3 object referenced by S3Uri by slash (/), and use the last part as the filename of the file holding the content of the S3 object.

If you choose S3Prefix as the value of S3DataType , then for each S3 object under the key name pefix referenced by S3Uri , SageMaker will trim its key by the prefix, and use the remainder as the path (relative to /opt/ml/model ) of the file holding the content of the S3 object. SageMaker will split the remainder by slash (/), using intermediate parts as directory names and the last part as filename of the file holding the content of the S3 object.

Do not use any of the following as file names or directory names:

An empty or blank string

A string which contains null bytes

A string longer than 255 bytes

A single dot (. )

A double dot (.. )

Ambiguous file names will result in model deployment failure. For example, if your uncompressed ML model consists of two S3 objects s3://mybucket/model/weights and s3://mybucket/model/weights/part1 and you specify s3://mybucket/model/ as the value of S3Uri and S3Prefix as the value of S3DataType , then it will result in name clash between /opt/ml/model/weights (a regular file) and /opt/ml/model/weights/ (a directory).

Do not organize the model artifacts in S3 console using folders . When you create a folder in S3 console, S3 creates a 0-byte object with a key set to the folder name you provide. They key of the 0-byte object ends with a slash (/) which violates SageMaker restrictions on model artifact file names, leading to model deployment failure.

Possible values:

None

Gzip

ModelAccessConfig -> (structure)

Specifies the access configuration file for the ML model. You can explicitly accept the model end-user license agreement (EULA) within the ModelAccessConfig . You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

AcceptEula -> (boolean) [required]

Specifies agreement to the model end-user license agreement (EULA). The AcceptEula value must be explicitly defined as True in order to accept the EULA that this model requires. You are responsible for reviewing and complying with any applicable license terms and making sure they are acceptable for your use case before downloading or using a model.

HubAccessConfig -> (structure)

Configuration information for hub access.

HubContentArn -> (string) [required]

The ARN of the hub content for which deployment access is allowed.

Constraints:

min: 0

max: 255

pattern: .*

ManifestS3Uri -> (string)

The Amazon S3 URI of the manifest file. The manifest file is a CSV file that stores the artifact locations.

Constraints:

min: 0

max: 1024

pattern: (https|s3)://([^/]+)/?(.*)

ETag -> (string)

The ETag associated with S3 URI.

ManifestEtag -> (string)

The ETag associated with Manifest S3 URI.

Environment -> (map)

The environment variables to set in the Docker container. Don’t include any sensitive data in your environment variables.

The maximum length of each key and value in the Environment map is 1024 bytes. The maximum length of all keys and values in the map, combined, is 32 KB. If you pass multiple containers to a CreateModel request, then the maximum length of all of their maps, combined, is also 32 KB.

Constraints:

min: 0

max: 100

key -> (string)

Constraints:

min: 0

max: 1024

pattern: [a-zA-Z_][a-zA-Z0-9_]*

value -> (string)

Constraints:

min: 0

max: 1024

pattern: [\S\s]*

ModelPackageName -> (string)

The name or Amazon Resource Name (ARN) of the model package to use to create the model.

Constraints:

min: 1

max: 176

pattern: (arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:[a-z\-]*\/)?([a-zA-Z0-9]([a-zA-Z0-9-]){0,62})(?<!-)(\/[0-9]{1,9})?

InferenceSpecificationName -> (string)

The inference specification name in the model package version.

Constraints:

min: 1

max: 63

pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

MultiModelConfig -> (structure)

Specifies additional configuration for multi-model endpoints.

ModelCacheSetting -> (string)

Whether to cache models for a multi-model endpoint. By default, multi-model endpoints cache models so that a model does not have to be loaded into memory each time it is invoked. Some use cases do not benefit from model caching. For example, if an endpoint hosts a large number of models that are each invoked infrequently, the endpoint might perform better if you disable model caching. To disable model caching, set the value of this parameter to Disabled .

Possible values:

Enabled

Disabled

ContainerMetricsConfig -> (structure)

The configuration for container metrics scraping. Specifies the metrics endpoint path and publishing frequency. If not specified when EnableDetailedObservability is True , the default path /metrics on port 8080 is used. For first-party and Deep Learning Containers (DLC), the endpoint path is determined automatically and this configuration is optional.

MetricsEndpoints -> (list)

A list of metrics endpoints to scrape from the container. Each endpoint specifies the path where the container exposes Prometheus-formatted metrics and the frequency at which to publish them. You can specify a maximum of 1 endpoint.

Constraints:

min: 0

max: 1

(structure)

Specifies a metrics endpoint for a container, including the path where the container exposes Prometheus-formatted metrics and the frequency at which to publish them to Amazon CloudWatch.

MetricsEndpointPath -> (string) [required]

The path to the metrics endpoint exposed by the container. For example, /metrics or /server/metrics . The path must start with / and can contain alphanumeric characters, forward slashes, underscores, hyphens, and periods. Maximum length is 256 characters. If not specified, defaults to /metrics .

Constraints:

min: 0

max: 256

pattern: /(?!.*\.\.)[a-zA-Z0-9/_.\-]+

MetricPublishFrequencyInSeconds -> (integer)

The interval, in seconds, at which container metrics scraped from the endpoint are published to Amazon CloudWatch. Valid values: 10 , 30 , 60 , 120 , 180 , 240 , 300 . Defaults to 60 .

InferenceExecutionConfig -> (structure)

Specifies details of how containers in a multi-container endpoint are called.

Mode -> (string) [required]

How containers in a multi-container are run. The following values are valid.

SERIAL - Containers run as a serial pipeline.

DIRECT - Only the individual container that you specify is run.

Possible values:

Serial

Direct

ExecutionRoleArn -> (string)

The Amazon Resource Name (ARN) of the IAM role that you specified for the model.

Constraints:

min: 20

max: 2048

pattern: arn:aws[a-z\-]*:iam::\d{12}:role/?[a-zA-Z_0-9+=,.@\-_/]+

VpcConfig -> (structure)

A VpcConfig object that specifies the VPC that this model has access to. For more information, see Protect Endpoints by Using an Amazon Virtual Private Cloud

SecurityGroupIds -> (list) [required]

The VPC security group IDs, in the form sg-xxxxxxxx . Specify the security groups for the VPC that is specified in the Subnets field.

Constraints:

min: 1

max: 5

(string)

Constraints:

min: 0

max: 32

pattern: [-0-9a-zA-Z]+

Subnets -> (list) [required]

The ID of the subnets in the VPC to which you want to connect your training job or model. For information about the availability of specific instance types, see Supported Instance Types and Availability Zones .

Constraints:

min: 1

max: 16

(string)

Constraints:

min: 0

max: 32

pattern: [-0-9a-zA-Z]+

CreationTime -> (timestamp)

A timestamp that shows when the model was created.

ModelArn -> (string)

The Amazon Resource Name (ARN) of the model.

Constraints:

min: 20

max: 2048

pattern: arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:model/.*

EnableNetworkIsolation -> (boolean)

If True , no inbound or outbound network calls can be made to or from the model container.

DeploymentRecommendation -> (structure)

A set of recommended deployment configurations for the model.

RecommendationStatus -> (string) [required]

Status of the deployment recommendation. The status NOT_APPLICABLE means that SageMaker is unable to provide a default recommendation for the model using the information provided. If the deployment status is IN_PROGRESS , retry your API call after a few seconds to get a COMPLETED deployment recommendation.

Possible values:

IN_PROGRESS

COMPLETED

FAILED

NOT_APPLICABLE

RealTimeInferenceRecommendations -> (list)

A list of RealTimeInferenceRecommendation items.

Constraints:

min: 0

max: 3

(structure)

The recommended configuration to use for Real-Time Inference.

RecommendationId -> (string) [required]

The recommendation ID which uniquely identifies each recommendation.

InstanceType -> (string) [required]

The recommended instance type for Real-Time Inference.

Possible values:

ml.t2.medium

ml.t2.large

ml.t2.xlarge

ml.t2.2xlarge

ml.m4.xlarge

ml.m4.2xlarge

ml.m4.4xlarge

ml.m4.10xlarge

ml.m4.16xlarge

ml.m5.large

ml.m5.xlarge

ml.m5.2xlarge

ml.m5.4xlarge

ml.m5.12xlarge

ml.m5.24xlarge

ml.m5d.large

ml.m5d.xlarge

ml.m5d.2xlarge

ml.m5d.4xlarge

ml.m5d.12xlarge

ml.m5d.24xlarge

ml.c4.large

ml.c4.xlarge

ml.c4.2xlarge

ml.c4.4xlarge

ml.c4.8xlarge

ml.p2.xlarge

ml.p2.8xlarge

ml.p2.16xlarge

ml.p3.2xlarge

ml.p3.8xlarge

ml.p3.16xlarge

ml.c5.large

ml.c5.xlarge

ml.c5.2xlarge

ml.c5.4xlarge

ml.c5.9xlarge

ml.c5.18xlarge

ml.c5d.large

ml.c5d.xlarge

ml.c5d.2xlarge

ml.c5d.4xlarge

ml.c5d.9xlarge

ml.c5d.18xlarge

ml.g4dn.xlarge

ml.g4dn.2xlarge

ml.g4dn.4xlarge

ml.g4dn.8xlarge

ml.g4dn.12xlarge

ml.g4dn.16xlarge

ml.r5.large

ml.r5.xlarge

ml.r5.2xlarge

ml.r5.4xlarge

ml.r5.12xlarge

ml.r5.24xlarge

ml.r5d.large

ml.r5d.xlarge

ml.r5d.2xlarge

ml.r5d.4xlarge

ml.r5d.12xlarge

ml.r5d.24xlarge

ml.inf1.xlarge

ml.inf1.2xlarge

ml.inf1.6xlarge

ml.inf1.24xlarge

ml.dl1.24xlarge

ml.c6i.large

ml.c6i.xlarge

ml.c6i.2xlarge

ml.c6i.4xlarge

ml.c6i.8xlarge

ml.c6i.12xlarge

ml.c6i.16xlarge

ml.c6i.24xlarge

ml.c6i.32xlarge

ml.m6i.large

ml.m6i.xlarge

ml.m6i.2xlarge

ml.m6i.4xlarge

ml.m6i.8xlarge

ml.m6i.12xlarge

ml.m6i.16xlarge

ml.m6i.24xlarge

ml.m6i.32xlarge

ml.r6i.large

ml.r6i.xlarge

ml.r6i.2xlarge

ml.r6i.4xlarge

ml.r6i.8xlarge

ml.r6i.12xlarge

ml.r6i.16xlarge

ml.r6i.24xlarge

ml.r6i.32xlarge

ml.g5.xlarge

ml.g5.2xlarge

ml.g5.4xlarge

ml.g5.8xlarge

ml.g5.12xlarge

ml.g5.16xlarge

ml.g5.24xlarge

ml.g5.48xlarge

ml.g6.xlarge

ml.g6.2xlarge

ml.g6.4xlarge

ml.g6.8xlarge

ml.g6.12xlarge

ml.g6.16xlarge

ml.g6.24xlarge

ml.g6.48xlarge

ml.r8g.medium

ml.r8g.large

ml.r8g.xlarge

ml.r8g.2xlarge

ml.r8g.4xlarge

ml.r8g.8xlarge

ml.r8g.12xlarge

ml.r8g.16xlarge

ml.r8g.24xlarge

ml.r8g.48xlarge

ml.g6e.xlarge

ml.g6e.2xlarge

ml.g6e.4xlarge

ml.g6e.8xlarge

ml.g6e.12xlarge

ml.g6e.16xlarge

ml.g6e.24xlarge

ml.g6e.48xlarge

ml.g7e.2xlarge

ml.g7e.4xlarge

ml.g7e.8xlarge

ml.g7e.12xlarge

ml.g7e.24xlarge

ml.g7e.48xlarge

ml.g7.2xlarge

ml.g7.4xlarge

ml.g7.8xlarge

ml.g7.12xlarge

ml.g7.24xlarge

ml.g7.48xlarge

ml.p4d.24xlarge

ml.c7g.large

ml.c7g.xlarge

ml.c7g.2xlarge

ml.c7g.4xlarge

ml.c7g.8xlarge

ml.c7g.12xlarge

ml.c7g.16xlarge

ml.m6g.large

ml.m6g.xlarge

ml.m6g.2xlarge

ml.m6g.4xlarge

ml.m6g.8xlarge

ml.m6g.12xlarge

ml.m6g.16xlarge

ml.m6gd.large

ml.m6gd.xlarge

ml.m6gd.2xlarge

ml.m6gd.4xlarge

ml.m6gd.8xlarge

ml.m6gd.12xlarge

ml.m6gd.16xlarge

ml.c6g.large

ml.c6g.xlarge

ml.c6g.2xlarge

ml.c6g.4xlarge

ml.c6g.8xlarge

ml.c6g.12xlarge

ml.c6g.16xlarge

ml.c6gd.large

ml.c6gd.xlarge

ml.c6gd.2xlarge

ml.c6gd.4xlarge

ml.c6gd.8xlarge

ml.c6gd.12xlarge

ml.c6gd.16xlarge

ml.c6gn.large

ml.c6gn.xlarge

ml.c6gn.2xlarge

ml.c6gn.4xlarge

ml.c6gn.8xlarge

ml.c6gn.12xlarge

ml.c6gn.16xlarge

ml.r6g.large

ml.r6g.xlarge

ml.r6g.2xlarge

ml.r6g.4xlarge

ml.r6g.8xlarge

ml.r6g.12xlarge

ml.r6g.16xlarge

ml.r6gd.large

ml.r6gd.xlarge

ml.r6gd.2xlarge

ml.r6gd.4xlarge

ml.r6gd.8xlarge

ml.r6gd.12xlarge

ml.r6gd.16xlarge

ml.p4de.24xlarge

ml.trn1.2xlarge

ml.trn1.32xlarge

ml.trn1n.32xlarge

ml.trn2.48xlarge

ml.inf2.xlarge

ml.inf2.8xlarge

ml.inf2.24xlarge

ml.inf2.48xlarge

ml.p5.48xlarge

ml.p5e.48xlarge

ml.p5en.48xlarge

ml.m7i.large

ml.m7i.xlarge

ml.m7i.2xlarge

ml.m7i.4xlarge

ml.m7i.8xlarge

ml.m7i.12xlarge

ml.m7i.16xlarge

ml.m7i.24xlarge

ml.m7i.48xlarge

ml.c7i.large

ml.c7i.xlarge

ml.c7i.2xlarge

ml.c7i.4xlarge

ml.c7i.8xlarge

ml.c7i.12xlarge

ml.c7i.16xlarge

ml.c7i.24xlarge

ml.c7i.48xlarge

ml.r7i.large

ml.r7i.xlarge

ml.r7i.2xlarge

ml.r7i.4xlarge

ml.r7i.8xlarge

ml.r7i.12xlarge

ml.r7i.16xlarge

ml.r7i.24xlarge

ml.r7i.48xlarge

ml.c8g.medium

ml.c8g.large

ml.c8g.xlarge

ml.c8g.2xlarge

ml.c8g.4xlarge

ml.c8g.8xlarge

ml.c8g.12xlarge

ml.c8g.16xlarge

ml.c8g.24xlarge

ml.c8g.48xlarge

ml.r7gd.medium

ml.r7gd.large

ml.r7gd.xlarge

ml.r7gd.2xlarge

ml.r7gd.4xlarge

ml.r7gd.8xlarge

ml.r7gd.12xlarge

ml.r7gd.16xlarge

ml.m8g.medium

ml.m8g.large

ml.m8g.xlarge

ml.m8g.2xlarge

ml.m8g.4xlarge

ml.m8g.8xlarge

ml.m8g.12xlarge

ml.m8g.16xlarge

ml.m8g.24xlarge

ml.m8g.48xlarge

ml.c6in.large

ml.c6in.xlarge

ml.c6in.2xlarge

ml.c6in.4xlarge

ml.c6in.8xlarge

ml.c6in.12xlarge

ml.c6in.16xlarge

ml.c6in.24xlarge

ml.c6in.32xlarge

ml.p6-b200.48xlarge

ml.p6-b300.48xlarge

ml.p6e-gb200.36xlarge

ml.p5.4xlarge

Environment -> (map)

The recommended environment variables to set in the model container for Real-Time Inference.

Constraints:

min: 0

max: 100

key -> (string)

Constraints:

min: 0

max: 1024

pattern: [a-zA-Z_][a-zA-Z0-9_]*

value -> (string)

Constraints:

min: 0

max: 1024

pattern: [\S\s]*

Table of Contents

Feedback

User Guide

describe-model¶

Description¶

Synopsis¶

Options¶

Global Options¶

Output¶

Note

Note

Note

Warning

Note

Note

Note

Note

Warning

Note