StartTrainedModelInferenceJob
Defines the information necessary to begin a trained model inference job.
Request Syntax
POST /memberships/membershipIdentifier
/trained-model-inference-jobs HTTP/1.1
Content-type: application/json
{
"configuredModelAlgorithmAssociationArn": "string
",
"containerExecutionParameters": {
"maxPayloadInMB": number
},
"dataSource": {
"mlInputChannelArn": "string
"
},
"description": "string
",
"environment": {
"string
" : "string
"
},
"kmsKeyArn": "string
",
"name": "string
",
"outputConfiguration": {
"accept": "string
",
"members": [
{
"accountId": "string
"
}
]
},
"resourceConfig": {
"instanceCount": number
,
"instanceType": "string
"
},
"tags": {
"string
" : "string
"
},
"trainedModelArn": "string
"
}
URI Request Parameters
The request uses the following URI parameters.
- membershipIdentifier
-
The membership ID of the membership that contains the trained model inference job.
Length Constraints: Fixed length of 36.
Pattern:
[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}
Required: Yes
Request Body
The request accepts the following data in JSON format.
- configuredModelAlgorithmAssociationArn
-
The Amazon Resource Name (ARN) of the configured model algorithm association that is used for this trained model inference job.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[-a-z]*:cleanrooms-ml:[-a-z0-9]+:[0-9]{12}:membership/[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}/configured-model-algorithm-association/[-a-zA-Z0-9_/.]+
Required: No
- containerExecutionParameters
-
The execution parameters for the container.
Type: InferenceContainerExecutionParameters object
Required: No
- dataSource
-
Defines the data source that is used for the trained model inference job.
Type: ModelInferenceDataSource object
Required: Yes
- description
-
The description of the trained model inference job.
Type: String
Length Constraints: Minimum length of 0. Maximum length of 255.
Pattern:
[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDBFF-\uDC00\uDFFF\t\r\n]*
Required: No
- environment
-
The environment variables to set in the Docker container.
Type: String to string map
Map Entries: Minimum number of 0 items. Maximum number of 16 items.
Key Length Constraints: Minimum length of 1. Maximum length of 1024.
Key Pattern:
[a-zA-Z_][a-zA-Z0-9_]*
Value Length Constraints: Minimum length of 1. Maximum length of 10240.
Value Pattern:
[\S\s]*
Required: No
- kmsKeyArn
-
The Amazon Resource Name (ARN) of the AWS KMS key. This key is used to encrypt and decrypt customer-owned data in the ML inference job and associated data.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[-a-z]*:kms:[-a-z0-9]+:[0-9]{12}:key/.+
Required: No
- name
-
The name of the trained model inference job.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 63.
Pattern:
(?!\s*$)[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDBFF-\uDC00\uDFFF\t]*
Required: Yes
- outputConfiguration
-
Defines the output configuration information for the trained model inference job.
Type: InferenceOutputConfiguration object
Required: Yes
- resourceConfig
-
Defines the resource configuration for the trained model inference job.
Type: InferenceResourceConfig object
Required: Yes
-
The optional metadata that you apply to the resource to help you categorize and organize them. Each tag consists of a key and an optional value, both of which you define.
The following basic restrictions apply to tags:
-
Maximum number of tags per resource - 50.
-
For each resource, each tag key must be unique, and each tag key can have only one value.
-
Maximum key length - 128 Unicode characters in UTF-8.
-
Maximum value length - 256 Unicode characters in UTF-8.
-
If your tagging schema is used across multiple services and resources, remember that other services may have restrictions on allowed characters. Generally allowed characters are: letters, numbers, and spaces representable in UTF-8, and the following characters: + - = . _ : / @.
-
Tag keys and values are case sensitive.
-
Do not use aws:, AWS:, or any upper or lowercase combination of such as a prefix for keys as it is reserved for AWS use. You cannot edit or delete tag keys with this prefix. Values can have this prefix. If a tag value has aws as its prefix but the key does not, then Clean Rooms ML considers it to be a user tag and will count against the limit of 50 tags. Tags with only the key prefix of aws do not count against your tags per resource limit.
Type: String to string map
Map Entries: Minimum number of 0 items. Maximum number of 200 items.
Key Length Constraints: Minimum length of 1. Maximum length of 128.
Value Length Constraints: Minimum length of 0. Maximum length of 256.
Required: No
-
- trainedModelArn
-
The Amazon Resource Name (ARN) of the trained model that is used for this trained model inference job.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[-a-z]*:cleanrooms-ml:[-a-z0-9]+:[0-9]{12}:membership/[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}/trained-model/[-a-zA-Z0-9_/.]+
Required: Yes
Response Syntax
HTTP/1.1 200
Content-type: application/json
{
"trainedModelInferenceJobArn": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- trainedModelInferenceJobArn
-
The Amazon Resource Name (ARN) of the trained model inference job.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[-a-z]*:cleanrooms-ml:[-a-z0-9]+:[0-9]{12}:membership/[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}/trained-model-inference-job/[-a-zA-Z0-9_/.]+
Errors
For information about the errors that are common to all actions, see Common Errors.
- AccessDeniedException
-
You do not have sufficient access to perform this action.
HTTP Status Code: 403
- ConflictException
-
You can't complete this action because another resource depends on this resource.
HTTP Status Code: 409
- ResourceNotFoundException
-
The resource you are requesting does not exist.
HTTP Status Code: 404
- ServiceQuotaExceededException
-
You have exceeded your service quota.
HTTP Status Code: 402
- ValidationException
-
The request parameters for this request are incorrect.
HTTP Status Code: 400
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: