interface InferenceComponentCapacitySizeProperty
Language | Type name |
---|---|
![]() | Amazon.CDK.AWS.Sagemaker.CfnInferenceComponent.InferenceComponentCapacitySizeProperty |
![]() | github.com/aws/aws-cdk-go/awscdk/v2/awssagemaker#CfnInferenceComponent_InferenceComponentCapacitySizeProperty |
![]() | software.amazon.awscdk.services.sagemaker.CfnInferenceComponent.InferenceComponentCapacitySizeProperty |
![]() | aws_cdk.aws_sagemaker.CfnInferenceComponent.InferenceComponentCapacitySizeProperty |
![]() | aws-cdk-lib » aws_sagemaker » CfnInferenceComponent » InferenceComponentCapacitySizeProperty |
Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy.
You can specify your batches as either of the following:
- A count of inference component copies
- The overall percentage or your fleet
For a rollback strategy, if you don't specify the fields in this object, or if you set the Value
parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.
Example
// The code below shows an example of how to instantiate this type.
// The values are placeholders you should change.
import { aws_sagemaker as sagemaker } from 'aws-cdk-lib';
const inferenceComponentCapacitySizeProperty: sagemaker.CfnInferenceComponent.InferenceComponentCapacitySizeProperty = {
type: 'type',
value: 123,
};
Properties
Name | Type | Description |
---|---|---|
type | string | Specifies the endpoint capacity type. |
value | number | Defines the capacity size, either as a number of inference component copies or a capacity percentage. |
type
Type:
string
Specifies the endpoint capacity type.
- COPY_COUNT - The endpoint activates based on the number of inference component copies.
- CAPACITY_PERCENT - The endpoint activates based on the specified percentage of capacity.
value
Type:
number
Defines the capacity size, either as a number of inference component copies or a capacity percentage.