Interface ProductionVariant
- All Superinterfaces:
software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
ProductionVariant.Jsii$Proxy
@Generated(value="jsii-pacmak/1.103.1 (build bef2dea)",
date="2024-10-25T14:21:24.593Z")
@Stability(Stable)
public interface ProductionVariant
extends software.amazon.jsii.JsiiSerializable
Identifies a model that you want to host and the resources to deploy for hosting it.
Example:
// The code below shows an example of how to instantiate this type. // The values are placeholders you should change. import software.amazon.awscdk.services.ec2.*; import software.amazon.awscdk.services.stepfunctions.tasks.*; AcceleratorType acceleratorType; InstanceType instanceType; ProductionVariant productionVariant = ProductionVariant.builder() .instanceType(instanceType) .modelName("modelName") .variantName("variantName") // the properties below are optional .acceleratorType(acceleratorType) .initialInstanceCount(123) .initialVariantWeight(123) .build();
- See Also:
-
Nested Class Summary
Modifier and TypeInterfaceDescriptionstatic final class
A builder forProductionVariant
static final class
An implementation forProductionVariant
-
Method Summary
Modifier and TypeMethodDescriptionstatic ProductionVariant.Builder
builder()
default AcceleratorType
The size of the Elastic Inference (EI) instance to use for the production variant.default Number
Number of instances to launch initially.default Number
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.The ML compute instance type.The name of the model that you want to host.The name of the production variant.Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getInstanceType
The ML compute instance type. -
getModelName
The name of the model that you want to host.This is the name that you specified when creating the model.
-
getVariantName
The name of the production variant. -
getAcceleratorType
The size of the Elastic Inference (EI) instance to use for the production variant.Default: - None
-
getInitialInstanceCount
Number of instances to launch initially.Default: - 1
-
getInitialVariantWeight
Determines initial traffic distribution among all of the models that you specify in the endpoint configuration.Default: - 1.0
-
builder
- Returns:
- a
ProductionVariant.Builder
ofProductionVariant
-