InferenceComponentRuntimeConfig
Runtime settings for a model that is deployed with an inference component.
Contents
- CopyCount
-
The number of runtime copies of the model container to deploy with the inference component. Each copy can serve inference requests.
Type: Integer
Valid Range: Minimum value of 0.
Required: Yes
See Also
For more information about using this API in one of the language-specific AWS SDKs, see the following: