Class MetricsConfig
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<MetricsConfig.Builder,MetricsConfig>
The configuration for Utilization metrics.
- See Also:
-
Nested Class Summary
Nested Classes -
Method Summary
Modifier and TypeMethodDescriptionstatic MetricsConfig.Builderbuilder()final BooleanIndicates whether detailed observability is enabled for the endpoint.final BooleanSpecifies whether to enable enhanced metrics for the endpoint.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final IntegerThe interval, in seconds, at which metrics are published to Amazon CloudWatch.static Class<? extends MetricsConfig.Builder> Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
enableEnhancedMetrics
Specifies whether to enable enhanced metrics for the endpoint. Enhanced metrics provide utilization and invocation data at instance and container granularity. Container granularity is supported for Inference Components. The default is
False.- Returns:
- Specifies whether to enable enhanced metrics for the endpoint. Enhanced metrics provide utilization and
invocation data at instance and container granularity. Container granularity is supported for Inference
Components. The default is
False.
-
enableDetailedObservability
Indicates whether detailed observability is enabled for the endpoint. When set to
True, the following metrics are published at the configured frequency:-
Container-level inference metrics scraped from the container's Prometheus endpoint (such as request latency, error counts, and throughput). Available metrics vary by framework.
-
Per-GPU metrics (utilization, memory, and temperature) attributed to individual inference components.
-
Per-instance host metrics (CPU, memory, and disk utilization).
-
Inference component placement metrics (copy count per Availability Zone).
For first-party and Deep Learning Containers (DLC), the Prometheus endpoint path is determined automatically. For Bring-Your-Own-Container (BYOC) cases, you can optionally set
ContainerMetricsConfigto specify a custom endpoint path. If not specified, the default path/metricson port8080is used.When set to
False, these additional metrics are not published. Standard invocation and utilization metrics controlled byEnableEnhancedMetricsare unaffected.The default value for new endpoint configurations is
True. For existing endpoint configurations created before this feature, the value isFalseunless explicitly set.- Returns:
- Indicates whether detailed observability is enabled for the endpoint. When set to
True, the following metrics are published at the configured frequency:-
Container-level inference metrics scraped from the container's Prometheus endpoint (such as request latency, error counts, and throughput). Available metrics vary by framework.
-
Per-GPU metrics (utilization, memory, and temperature) attributed to individual inference components.
-
Per-instance host metrics (CPU, memory, and disk utilization).
-
Inference component placement metrics (copy count per Availability Zone).
For first-party and Deep Learning Containers (DLC), the Prometheus endpoint path is determined automatically. For Bring-Your-Own-Container (BYOC) cases, you can optionally set
ContainerMetricsConfigto specify a custom endpoint path. If not specified, the default path/metricson port8080is used.When set to
False, these additional metrics are not published. Standard invocation and utilization metrics controlled byEnableEnhancedMetricsare unaffected.The default value for new endpoint configurations is
True. For existing endpoint configurations created before this feature, the value isFalseunless explicitly set. -
-
-
metricPublishFrequencyInSeconds
The interval, in seconds, at which metrics are published to Amazon CloudWatch. Defaults to
60. Valid values:10,30,60,120,180,240,300.When
EnableEnhancedMetricsis set toFalse, this interval applies to utilization metrics only. Invocation metrics continue to be published at the default 60-second interval. WhenEnableEnhancedMetricsis set toTrue, this interval applies to both utilization and invocation metrics.When
EnableDetailedObservabilityis set toTrue, this interval applies to per-GPU metrics, per-instance host metrics, container metrics, and fleet-level inference component lifecycle and placement metrics.- Returns:
- The interval, in seconds, at which metrics are published to Amazon CloudWatch. Defaults to
60. Valid values:10,30,60,120,180,240,300.When
EnableEnhancedMetricsis set toFalse, this interval applies to utilization metrics only. Invocation metrics continue to be published at the default 60-second interval. WhenEnableEnhancedMetricsis set toTrue, this interval applies to both utilization and invocation metrics.When
EnableDetailedObservabilityis set toTrue, this interval applies to per-GPU metrics, per-instance host metrics, container metrics, and fleet-level inference component lifecycle and placement metrics.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<MetricsConfig.Builder,MetricsConfig> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
-
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo- Returns:
- The mapping between the field name and its corresponding field.
-