View a markdown version of this page

CreateAIWorkloadConfig - Amazon SageMaker

CreateAIWorkloadConfig

Creates a reusable AI workload configuration that defines datasets, data sources, and benchmark tool settings for consistent performance testing of generative AI inference deployments on Amazon SageMaker AI.

Request Syntax

{ "AIWorkloadConfigName": "string", "AIWorkloadConfigs": { "WorkloadSpec": { ... } }, "DatasetConfig": { ... }, "Tags": [ { "Key": "string", "Value": "string" } ] }

Request Parameters

For information about the parameters that are common to all actions, see Common Parameters.

The request accepts the following data in JSON format.

AIWorkloadConfigName

The name of the AI workload configuration. The name must be unique within your AWS account in the current AWS Region.

Type: String

Length Constraints: Minimum length of 1. Maximum length of 63.

Pattern: [a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Required: Yes

AIWorkloadConfigs

The benchmark tool configuration and workload specification. Provide the specification as an inline YAML or JSON string.

Type: AIWorkloadConfigs object

Required: No

DatasetConfig

The dataset configuration for the workload. Specify input data channels with their data sources for benchmark workloads.

Type: AIDatasetConfig object

Note: This object is a Union. Only one member of this object can be specified or returned.

Required: No

Tags

The metadata that you apply to AWS resources to help you categorize and organize them. Each tag consists of a key and a value, both of which you define. For more information, see Tagging AWS Resources in the AWS General Reference.

Type: Array of Tag objects

Array Members: Minimum number of 0 items. Maximum number of 50 items.

Required: No

Response Syntax

{ "AIWorkloadConfigArn": "string" }

Response Elements

If the action is successful, the service sends back an HTTP 200 response.

The following data is returned in JSON format by the service.

AIWorkloadConfigArn

The Amazon Resource Name (ARN) of the created AI workload configuration.

Type: String

Length Constraints: Minimum length of 0. Maximum length of 256.

Pattern: arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:ai-workload-config/[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Errors

For information about the errors that are common to all actions, see Common Error Types.

ResourceInUse

Resource being accessed is in use.

HTTP Status Code: 400

ResourceLimitExceeded

You have exceeded an SageMaker resource limit. For example, you might have too many training jobs created.

HTTP Status Code: 400

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following: