AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

RSS

포커스 모드

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy - AWS CloudFormation

Syntax Properties

이 페이지는 귀하의 언어로 번역되지 않았습니다. 번역 요청

Specifies a rolling deployment strategy for updating a SageMaker AI inference component.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON


{
  "MaximumBatchSize" : InferenceComponentCapacitySize,
  "MaximumExecutionTimeoutInSeconds" : Integer,
  "RollbackMaximumBatchSize" : InferenceComponentCapacitySize,
  "WaitIntervalInSeconds" : Integer
}

YAML


  MaximumBatchSize: 
    InferenceComponentCapacitySize
  MaximumExecutionTimeoutInSeconds: Integer
  RollbackMaximumBatchSize: 
    InferenceComponentCapacitySize
  WaitIntervalInSeconds: Integer

Properties

MaximumBatchSize

The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

MaximumExecutionTimeoutInSeconds

The time limit for the total deployment. Exceeding this limit causes a timeout.

Required: No

Type: Integer

Minimum: 600

Maximum: 28800

Update requires: No interruption

RollbackMaximumBatchSize

The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

WaitIntervalInSeconds

The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.

Required: No

Type: Integer

Minimum: 0

Maximum: 3600

Update requires: No interruption

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

InferenceComponentDeploymentConfig

InferenceComponentRuntimeConfig

이 페이지에서

쿠키 기본 설정 선택

쿠키 기본 설정 사용자 지정

필수

성능

기능

광고

쿠키 기본 설정을 저장할 수 없음

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

Syntax

JSON

YAML

Properties

이 페이지에서

페이지 내용이 도움이 되었습니까?

다음 주제:

이전 주제:

도움이 필요하십니까?