UpdateInferenceComponentRuntimeConfig
Runtime settings for a model that is deployed with an inference component.
Request Syntax
{
"DesiredRuntimeConfig": {
"CopyCount": number
},
"InferenceComponentName": "string
"
}
Request Parameters
For information about the parameters that are common to all actions, see Common Parameters.
The request accepts the following data in JSON format.
- DesiredRuntimeConfig
-
Runtime settings for a model that is deployed with an inference component.
Type: InferenceComponentRuntimeConfig object
Required: Yes
- InferenceComponentName
-
The name of the inference component to update.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$
Required: Yes
Response Syntax
{
"InferenceComponentArn": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- InferenceComponentArn
-
The Amazon Resource Name (ARN) of the inference component.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Errors
For information about the errors that are common to all actions, see Common Errors.
- ResourceLimitExceeded
-
You have exceeded an SageMaker resource limit. For example, you might have too many training jobs created.
HTTP Status Code: 400
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: