DescribeInferenceComponent
Returns information about an inference component.
Request Syntax
{
"InferenceComponentName": "string
"
}
Request Parameters
For information about the parameters that are common to all actions, see Common Parameters.
The request accepts the following data in JSON format.
- InferenceComponentName
-
The name of the inference component.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$
Required: Yes
Response Syntax
{
"CreationTime": number,
"EndpointArn": "string",
"EndpointName": "string",
"FailureReason": "string",
"InferenceComponentArn": "string",
"InferenceComponentName": "string",
"InferenceComponentStatus": "string",
"LastModifiedTime": number,
"RuntimeConfig": {
"CurrentCopyCount": number,
"DesiredCopyCount": number
},
"Specification": {
"ComputeResourceRequirements": {
"MaxMemoryRequiredInMb": number,
"MinMemoryRequiredInMb": number,
"NumberOfAcceleratorDevicesRequired": number,
"NumberOfCpuCoresRequired": number
},
"Container": {
"ArtifactUrl": "string",
"DeployedImage": {
"ResolutionTime": number,
"ResolvedImage": "string",
"SpecifiedImage": "string"
},
"Environment": {
"string" : "string"
}
},
"ModelName": "string",
"StartupParameters": {
"ContainerStartupHealthCheckTimeoutInSeconds": number,
"ModelDataDownloadTimeoutInSeconds": number
}
},
"VariantName": "string"
}
Response Elements
If the action is successful, the service sends back an HTTP 200 response.
The following data is returned in JSON format by the service.
- CreationTime
-
The time when the inference component was created.
Type: Timestamp
- EndpointArn
-
The Amazon Resource Name (ARN) of the endpoint that hosts the inference component.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
Pattern:
arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:endpoint/.*
- EndpointName
-
The name of the endpoint that hosts the inference component.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}
- FailureReason
-
If the inference component status is
Failed
, the reason for the failure.Type: String
Length Constraints: Maximum length of 1024.
- InferenceComponentArn
-
The Amazon Resource Name (ARN) of the inference component.
Type: String
Length Constraints: Minimum length of 20. Maximum length of 2048.
- InferenceComponentName
-
The name of the inference component.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$
- InferenceComponentStatus
-
The status of the inference component.
Type: String
Valid Values:
InService | Creating | Updating | Failed | Deleting
- LastModifiedTime
-
The time when the inference component was last updated.
Type: Timestamp
- RuntimeConfig
-
Details about the runtime settings for the model that is deployed with the inference component.
Type: InferenceComponentRuntimeConfigSummary object
- Specification
-
Details about the resources that are deployed with this inference component.
Type: InferenceComponentSpecificationSummary object
- VariantName
-
The name of the production variant that hosts the inference component.
Type: String
Length Constraints: Maximum length of 63.
Pattern:
^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}
Errors
For information about the errors that are common to all actions, see Common Errors.
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: