AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

RSS

Mode de mise au point

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy - AWS CloudFormation

Syntax Properties

Cette page n'a pas été traduite dans votre langue. Demande de traduction

Specifies a rolling deployment strategy for updating a SageMaker AI inference component.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON


{
  "MaximumBatchSize" : InferenceComponentCapacitySize,
  "MaximumExecutionTimeoutInSeconds" : Integer,
  "RollbackMaximumBatchSize" : InferenceComponentCapacitySize,
  "WaitIntervalInSeconds" : Integer
}

YAML


  MaximumBatchSize: 
    InferenceComponentCapacitySize
  MaximumExecutionTimeoutInSeconds: Integer
  RollbackMaximumBatchSize: 
    InferenceComponentCapacitySize
  WaitIntervalInSeconds: Integer

Properties

MaximumBatchSize

The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

MaximumExecutionTimeoutInSeconds

The time limit for the total deployment. Exceeding this limit causes a timeout.

Required: No

Type: Integer

Minimum: 600

Maximum: 28800

Update requires: No interruption

RollbackMaximumBatchSize

The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

WaitIntervalInSeconds

The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.

Required: No

Type: Integer

Minimum: 0

Maximum: 3600

Update requires: No interruption

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

InferenceComponentDeploymentConfig

InferenceComponentRuntimeConfig

Sur cette page

Sélectionner vos préférences de cookies

Personnaliser les préférences de cookies

Essentiels

Performances

Fonctionnels

Publicitaires

Impossible d'enregistrer les préférences concernant les cookies

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

Syntax

JSON

YAML

Properties

Sur cette page

Cette page vous a-t-elle été utile ?

Rubrique suivante :

Rubrique précédente :

Avez-vous besoin d’aide ?