CreateComputeQuota - Amazon SageMaker

CreateComputeQuota

Create compute allocation definition. This defines how compute is allocated, shared, and borrowed for specified entities. Specifically, how to lend and borrow idle compute and assign a fair-share weight to the specified entities.

Request Syntax

{ "ActivationState": "string", "ClusterArn": "string", "ComputeQuotaConfig": { "ComputeQuotaResources": [ { "Count": number, "InstanceType": "string" } ], "PreemptTeamTasks": "string", "ResourceSharingConfig": { "BorrowLimit": number, "Strategy": "string" } }, "ComputeQuotaTarget": { "FairShareWeight": number, "TeamName": "string" }, "Description": "string", "Name": "string", "Tags": [ { "Key": "string", "Value": "string" } ] }

Request Parameters

For information about the parameters that are common to all actions, see Common Parameters.

The request accepts the following data in JSON format.

ActivationState

The state of the compute allocation being described. Use to enable or disable compute allocation.

Default is Enabled.

Type: String

Valid Values: Enabled | Disabled

Required: No

ClusterArn

ARN of the cluster.

Type: String

Length Constraints: Maximum length of 256.

Pattern: ^arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:cluster/[a-z0-9]{12}$

Required: Yes

ComputeQuotaConfig

Configuration of the compute allocation definition. This includes the resource sharing option, and the setting to preempt low priority tasks.

Type: ComputeQuotaConfig object

Required: Yes

ComputeQuotaTarget

The target entity to allocate compute resources to.

Type: ComputeQuotaTarget object

Required: Yes

Description

Description of the compute allocation definition.

Type: String

Length Constraints: Maximum length of 1024.

Pattern: [\p{L}\p{M}\p{Z}\p{S}\p{N}\p{P}]*

Required: No

Name

Name to the compute allocation definition.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 63.

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}$

Required: Yes

Tags

Tags of the compute allocation definition.

Type: Array of Tag objects

Array Members: Minimum number of 0 items. Maximum number of 50 items.

Required: No

Response Syntax

{ "ComputeQuotaArn": "string", "ComputeQuotaId": "string" }

Response Elements

If the action is successful, the service sends back an HTTP 200 response.

The following data is returned in JSON format by the service.

ComputeQuotaArn

ARN of the compute allocation definition.

Type: String

Length Constraints: Maximum length of 2048.

Pattern: arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:compute-quota/[a-z0-9]{12}$

ComputeQuotaId

ID of the compute allocation definition.

Type: String

Pattern: ^[a-z0-9]{12}$

Errors

For information about the errors that are common to all actions, see Common Errors.

ConflictException

There was a conflict when you attempted to modify a SageMaker entity such as an Experiment or Artifact.

HTTP Status Code: 400

ResourceLimitExceeded

You have exceeded an SageMaker resource limit. For example, you might have too many training jobs created.

HTTP Status Code: 400

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: