AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

RSS

Mode fokus

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy - AWS CloudFormation

Syntax Properties

Halaman ini belum diterjemahkan ke dalam bahasa Anda. Minta terjemahan

Specifies a rolling deployment strategy for updating a SageMaker AI inference component.

Syntax

To declare this entity in your AWS CloudFormation template, use the following syntax:

JSON


{
  "MaximumBatchSize" : InferenceComponentCapacitySize,
  "MaximumExecutionTimeoutInSeconds" : Integer,
  "RollbackMaximumBatchSize" : InferenceComponentCapacitySize,
  "WaitIntervalInSeconds" : Integer
}

YAML


  MaximumBatchSize: 
    InferenceComponentCapacitySize
  MaximumExecutionTimeoutInSeconds: Integer
  RollbackMaximumBatchSize: 
    InferenceComponentCapacitySize
  WaitIntervalInSeconds: Integer

Properties

MaximumBatchSize

The batch size for each rolling step in the deployment process. For each step, SageMaker AI provisions capacity on the new endpoint fleet, routes traffic to that fleet, and terminates capacity on the old endpoint fleet. The value must be between 5% to 50% of the copy count of the inference component.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

MaximumExecutionTimeoutInSeconds

The time limit for the total deployment. Exceeding this limit causes a timeout.

Required: No

Type: Integer

Minimum: 600

Maximum: 28800

Update requires: No interruption

RollbackMaximumBatchSize

The batch size for a rollback to the old endpoint fleet. If this field is absent, the value is set to the default, which is 100% of the total capacity. When the default is used, SageMaker AI provisions the entire capacity of the old fleet at once during rollback.

Required: No

Type: InferenceComponentCapacitySize

Update requires: No interruption

WaitIntervalInSeconds

The length of the baking period, during which SageMaker AI monitors alarms for each batch on the new fleet.

Required: No

Type: Integer

Minimum: 0

Maximum: 3600

Update requires: No interruption

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

InferenceComponentDeploymentConfig

InferenceComponentRuntimeConfig

Di halaman ini

Pilih preferensi cookie Anda

Sesuaikan preferensi cookie

Penting

Kinerja

Fungsional

Iklan

Tidak dapat menyimpan preferensi cookie

AWS::SageMaker::InferenceComponent InferenceComponentRollingUpdatePolicy

Syntax

JSON

YAML

Properties

Di halaman ini

Apakah halaman ini membantu Anda?

Topik berikutnya:

Topik sebelumnya:

Perlu bantuan?