RecommendationJobStoppingConditions
Specifies conditions for stopping a job. When a job reaches a stopping condition limit, SageMaker ends the job.
Contents
- FlatInvocations
-
Stops a load test when the number of invocations (TPS) peaks and flattens, which means that the instance has reached capacity. The default value is
Stop
. If you want the load test to continue after invocations have flattened, set the value toContinue
.Type: String
Valid Values:
Continue | Stop
Required: No
- MaxInvocations
-
The maximum number of requests per minute expected for the endpoint.
Type: Integer
Required: No
- ModelLatencyThresholds
-
The interval of time taken by a model to respond as viewed from SageMaker. The interval includes the local communication time taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
Type: Array of ModelLatencyThreshold objects
Array Members: Fixed number of 1 item.
Required: No
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: