Interface ScalingPolicyMetric.Builder
- All Superinterfaces:
Buildable
,CopyableBuilder<ScalingPolicyMetric.Builder,
,ScalingPolicyMetric> SdkBuilder<ScalingPolicyMetric.Builder,
,ScalingPolicyMetric> SdkPojo
- Enclosing class:
ScalingPolicyMetric
public static interface ScalingPolicyMetric.Builder
extends SdkPojo, CopyableBuilder<ScalingPolicyMetric.Builder,ScalingPolicyMetric>
-
Method Summary
Modifier and TypeMethodDescriptioninvocationsPerInstance
(Integer invocationsPerInstance) The number of invocations sent to a model, normalized byInstanceCount
in each ProductionVariant.modelLatency
(Integer modelLatency) The interval of time taken by a model to respond as viewed from SageMaker.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copy
Methods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, build
Methods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFields
-
Method Details
-
invocationsPerInstance
The number of invocations sent to a model, normalized by
InstanceCount
in each ProductionVariant.1/numberOfInstances
is sent as the value on each request, wherenumberOfInstances
is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Parameters:
invocationsPerInstance
- The number of invocations sent to a model, normalized byInstanceCount
in each ProductionVariant.1/numberOfInstances
is sent as the value on each request, wherenumberOfInstances
is the number of active instances for the ProductionVariant behind the endpoint at the time of the request.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
modelLatency
The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.
- Parameters:
modelLatency
- The interval of time taken by a model to respond as viewed from SageMaker. This interval includes the local communication times taken to send the request and to fetch the response from the container of a model and the time taken to complete the inference in the container.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-