AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

RSS

Fokusmodus

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy - AWS CloudFormation

Syntax Properties

Diese Seite wurde nicht in Ihre Sprache übersetzt. Übersetzung anfragen

Specifies a rolling deployment strategy for updating a SageMaker AI inference component.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON


{
  "MaximumBatchSize" : InferenceComponentCapacitySize,
  "MaximumExecutionTimeoutInSeconds" : Integer,
  "RollbackMaximumBatchSize" : InferenceComponentCapacitySize,
  "WaitIntervalInSeconds" : Integer
}

YAML


  MaximumBatchSize: 
    InferenceComponentCapacitySize
  MaximumExecutionTimeoutInSeconds: Integer
  RollbackMaximumBatchSize: 
    InferenceComponentCapacitySize
  WaitIntervalInSeconds: Integer

Properties

MaximumBatchSize

The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

MaximumExecutionTimeoutInSeconds

The time limit for the total deployment. Exceeding this limit causes a timeout.

Required: No

Type: Integer

Minimum: 600

Maximum: 28800

Update requires: No interruption

RollbackMaximumBatchSize

The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

WaitIntervalInSeconds

The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.

Required: No

Type: Integer

Minimum: 0

Maximum: 3600

Update requires: No interruption

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

InferenceComponentDeploymentConfig

InferenceComponentRuntimeConfig

Auf dieser Seite

Wählen Sie Ihre Cookie-Einstellungen aus

Cookie-Einstellungen anpassen

Essenziell

Leistung

Funktional

Werbung

Cookie-Einstellungen konnten nicht gespeichert werden

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

Syntax

JSON

YAML

Properties

Auf dieser Seite

Hat Ihnen diese Seite geholfen?

Nächstes Thema:

Vorheriges Thema:

Brauchen Sie Hilfe?