RecommendationJobContainerConfig
Specifies mandatory fields for running an Inference Recommender job directly in the
CreateInferenceRecommendationsJob
API. The fields specified in ContainerConfig
override the corresponding fields in the model package. Use
ContainerConfig
if you want to specify these fields for the recommendation job but don't want to edit them in your model package.
Contents
- DataInputConfig
-
Specifies the name and shape of the expected data inputs for your trained model with a JSON dictionary form. This field is used for optimizing your model using SageMaker Neo. For more information, see DataInputConfig.
Type: String
Length Constraints: Minimum length of 1. Maximum length of 1024.
Pattern:
[\S\s]+
Required: No
- Domain
-
The machine learning domain of the model and its components.
Valid Values:
COMPUTER_VISION | NATURAL_LANGUAGE_PROCESSING | MACHINE_LEARNING
Type: String
Required: No
- Framework
-
The machine learning framework of the container image.
Valid Values:
TENSORFLOW | PYTORCH | XGBOOST | SAGEMAKER-SCIKIT-LEARN
Type: String
Required: No
- FrameworkVersion
-
The framework version of the container image.
Type: String
Length Constraints: Minimum length of 3. Maximum length of 10.
Pattern:
[0-9]\.[A-Za-z0-9.-]+
Required: No
- NearestModelName
-
The name of a pre-trained machine learning model benchmarked by Amazon SageMaker Inference Recommender that matches your model.
Valid Values:
efficientnetb7 | unet | xgboost | faster-rcnn-resnet101 | nasnetlarge | vgg16 | inception-v3 | mask-rcnn | sagemaker-scikit-learn | densenet201-gluon | resnet18v2-gluon | xception | densenet201 | yolov4 | resnet152 | bert-base-cased | xceptionV1-keras | resnet50 | retinanet
Type: String
Required: No
- PayloadConfig
-
Specifies the
SamplePayloadUrl
and all other sample payload-related fields.Type: RecommendationJobPayloadConfig object
Required: No
- SupportedEndpointType
-
The endpoint type to receive recommendations for. By default this is null, and the results of the inference recommendation job return a combined list of both real-time and serverless benchmarks. By specifying a value for this field, you can receive a longer list of benchmarks for the desired endpoint type.
Type: String
Valid Values:
RealTime | Serverless
Required: No
- SupportedInstanceTypes
-
A list of the instance types that are used to generate inferences in real-time.
Type: Array of strings
Required: No
- SupportedResponseMIMETypes
-
The supported MIME types for the output data.
Type: Array of strings
Length Constraints: Maximum length of 1024.
Pattern:
^[-\w]+\/.+$
Required: No
- Task
-
The machine learning task that the model accomplishes.
Valid Values:
IMAGE_CLASSIFICATION | OBJECT_DETECTION | TEXT_GENERATION | IMAGE_SEGMENTATION | FILL_MASK | CLASSIFICATION | REGRESSION | OTHER
Type: String
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: