Class ProductionVariantManagedInstanceScalingScaleInPolicy
- All Implemented Interfaces:
Serializable,SdkPojo,ToCopyableBuilder<ProductionVariantManagedInstanceScalingScaleInPolicy.Builder,ProductionVariantManagedInstanceScalingScaleInPolicy>
Configures the scale-in behavior for managed instance scaling.
- See Also:
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic interface -
Method Summary
Modifier and TypeMethodDescriptionbuilder()final IntegerThe cooldown period, in minutes, after the last endpoint operation before the endpoint evaluates consolidation scale-in opportunities.final booleanfinal booleanequalsBySdkFields(Object obj) Indicates whether some other object is "equal to" this one by SDK fields.final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz) final inthashCode()final IntegerThe maximum number of instances that the endpoint can terminate at a time during a consolidation scale-in operation.static Class<? extends ProductionVariantManagedInstanceScalingScaleInPolicy.Builder> strategy()The strategy for scaling in instances.final StringThe strategy for scaling in instances.Take this object and create a builder that contains all of the current property values of this object.final StringtoString()Returns a string representation of this object.Methods inherited from interface software.amazon.awssdk.utils.builder.ToCopyableBuilder
copy
-
Method Details
-
strategy
The strategy for scaling in instances.
- IDLE_RELEASE
-
Releases instances that have no hosted inference component copies.
- CONSOLIDATION
-
Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.
If the service returns an enum value that is not available in the current SDK version,
strategywill returnManagedInstanceScalingScaleInStrategy.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstrategyAsString().- Returns:
- The strategy for scaling in instances.
- IDLE_RELEASE
-
Releases instances that have no hosted inference component copies.
- CONSOLIDATION
-
Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.
- See Also:
-
strategyAsString
The strategy for scaling in instances.
- IDLE_RELEASE
-
Releases instances that have no hosted inference component copies.
- CONSOLIDATION
-
Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.
If the service returns an enum value that is not available in the current SDK version,
strategywill returnManagedInstanceScalingScaleInStrategy.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available fromstrategyAsString().- Returns:
- The strategy for scaling in instances.
- IDLE_RELEASE
-
Releases instances that have no hosted inference component copies.
- CONSOLIDATION
-
Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.
- See Also:
-
maximumStepSize
The maximum number of instances that the endpoint can terminate at a time during a consolidation scale-in operation.
Default value:
1.- Returns:
- The maximum number of instances that the endpoint can terminate at a time during a consolidation scale-in
operation.
Default value:
1.
-
cooldownInMinutes
The cooldown period, in minutes, after the last endpoint operation before the endpoint evaluates consolidation scale-in opportunities.
Default value:
20.- Returns:
- The cooldown period, in minutes, after the last endpoint operation before the endpoint evaluates
consolidation scale-in opportunities.
Default value:
20.
-
toBuilder
Description copied from interface:ToCopyableBuilderTake this object and create a builder that contains all of the current property values of this object.- Specified by:
toBuilderin interfaceToCopyableBuilder<ProductionVariantManagedInstanceScalingScaleInPolicy.Builder,ProductionVariantManagedInstanceScalingScaleInPolicy> - Returns:
- a builder for type T
-
builder
-
serializableBuilderClass
public static Class<? extends ProductionVariantManagedInstanceScalingScaleInPolicy.Builder> serializableBuilderClass() -
hashCode
-
equals
-
equalsBySdkFields
Description copied from interface:SdkPojoIndicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in anSdkPojoclass, and is generated based on a service model.If an
SdkPojoclass does not have any inherited fields,equalsBySdkFieldsandequalsare essentially the same.- Specified by:
equalsBySdkFieldsin interfaceSdkPojo- Parameters:
obj- the object to be compared with- Returns:
- true if the other object equals to this object by sdk fields, false otherwise.
-
toString
-
getValueForField
-
sdkFields
-
sdkFieldNameToField
- Specified by:
sdkFieldNameToFieldin interfaceSdkPojo- Returns:
- The mapping between the field name and its corresponding field.
-