AWS::SageMaker::Endpoint RollingUpdatePolicy
Specifies a rolling deployment strategy for updating a SageMaker endpoint.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "MaximumBatchSize" :
CapacitySize
, "MaximumExecutionTimeoutInSeconds" :Integer
, "RollbackMaximumBatchSize" :CapacitySize
, "WaitIntervalInSeconds" :Integer
}
YAML
MaximumBatchSize:
CapacitySize
MaximumExecutionTimeoutInSeconds:Integer
RollbackMaximumBatchSize:CapacitySize
WaitIntervalInSeconds:Integer
Properties
MaximumBatchSize
-
Batch size for each rolling step to provision capacity and turn on traffic on the new endpoint fleet, and terminate capacity on the old endpoint fleet. Value must be between 5% to 50% of the variant's total instance count.
Required: Yes
Type: CapacitySize
Update requires: No interruption
MaximumExecutionTimeoutInSeconds
-
The time limit for the total deployment. Exceeding this limit causes a timeout.
Required: No
Type: Integer
Minimum:
600
Maximum:
28800
Update requires: No interruption
RollbackMaximumBatchSize
-
Batch size for rollback to the old endpoint fleet. Each rolling step to provision capacity and turn on traffic on the old endpoint fleet, and terminate capacity on the new endpoint fleet. If this field is absent, the default value will be set to 100% of total capacity which means to bring up the whole capacity of the old fleet at once during rollback.
Required: No
Type: CapacitySize
Update requires: No interruption
WaitIntervalInSeconds
-
The length of the baking period, during which SageMaker monitors alarms for each batch on the new fleet.
Required: Yes
Type: Integer
Minimum:
0
Maximum:
3600
Update requires: No interruption