Class ProductionVariantManagedInstanceScalingScaleInPolicy

java.lang.Object
software.amazon.awssdk.services.sagemaker.model.ProductionVariantManagedInstanceScalingScaleInPolicy
All Implemented Interfaces:
Serializable, SdkPojo, ToCopyableBuilder<ProductionVariantManagedInstanceScalingScaleInPolicy.Builder,ProductionVariantManagedInstanceScalingScaleInPolicy>

@Generated("software.amazon.awssdk:codegen") public final class ProductionVariantManagedInstanceScalingScaleInPolicy extends Object implements SdkPojo, Serializable, ToCopyableBuilder<ProductionVariantManagedInstanceScalingScaleInPolicy.Builder,ProductionVariantManagedInstanceScalingScaleInPolicy>

Configures the scale-in behavior for managed instance scaling.

See Also:
  • Method Details

    • strategy

      public final ManagedInstanceScalingScaleInStrategy strategy()

      The strategy for scaling in instances.

      IDLE_RELEASE

      Releases instances that have no hosted inference component copies.

      CONSOLIDATION

      Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.

      If the service returns an enum value that is not available in the current SDK version, strategy will return ManagedInstanceScalingScaleInStrategy.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available from strategyAsString().

      Returns:
      The strategy for scaling in instances.

      IDLE_RELEASE

      Releases instances that have no hosted inference component copies.

      CONSOLIDATION

      Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.

      See Also:
    • strategyAsString

      public final String strategyAsString()

      The strategy for scaling in instances.

      IDLE_RELEASE

      Releases instances that have no hosted inference component copies.

      CONSOLIDATION

      Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.

      If the service returns an enum value that is not available in the current SDK version, strategy will return ManagedInstanceScalingScaleInStrategy.UNKNOWN_TO_SDK_VERSION. The raw value returned by the service is available from strategyAsString().

      Returns:
      The strategy for scaling in instances.

      IDLE_RELEASE

      Releases instances that have no hosted inference component copies.

      CONSOLIDATION

      Consolidates inference component copies onto fewer instances to release more instances. Consolidation honors the scheduling configuration of each inference component. For example, if an inference component specifies Availability Zone balance, consolidation only proceeds when the resulting distribution does not increase the imbalance.

      See Also:
    • maximumStepSize

      public final Integer maximumStepSize()

      The maximum number of instances that the endpoint can terminate at a time during a consolidation scale-in operation.

      Default value: 1.

      Returns:
      The maximum number of instances that the endpoint can terminate at a time during a consolidation scale-in operation.

      Default value: 1.

    • cooldownInMinutes

      public final Integer cooldownInMinutes()

      The cooldown period, in minutes, after the last endpoint operation before the endpoint evaluates consolidation scale-in opportunities.

      Default value: 20.

      Returns:
      The cooldown period, in minutes, after the last endpoint operation before the endpoint evaluates consolidation scale-in opportunities.

      Default value: 20.

    • toBuilder

      Description copied from interface: ToCopyableBuilder
      Take this object and create a builder that contains all of the current property values of this object.
      Specified by:
      toBuilder in interface ToCopyableBuilder<ProductionVariantManagedInstanceScalingScaleInPolicy.Builder,ProductionVariantManagedInstanceScalingScaleInPolicy>
      Returns:
      a builder for type T
    • builder

    • serializableBuilderClass

      public static Class<? extends ProductionVariantManagedInstanceScalingScaleInPolicy.Builder> serializableBuilderClass()
    • hashCode

      public final int hashCode()
      Overrides:
      hashCode in class Object
    • equals

      public final boolean equals(Object obj)
      Overrides:
      equals in class Object
    • equalsBySdkFields

      public final boolean equalsBySdkFields(Object obj)
      Description copied from interface: SdkPojo
      Indicates whether some other object is "equal to" this one by SDK fields. An SDK field is a modeled, non-inherited field in an SdkPojo class, and is generated based on a service model.

      If an SdkPojo class does not have any inherited fields, equalsBySdkFields and equals are essentially the same.

      Specified by:
      equalsBySdkFields in interface SdkPojo
      Parameters:
      obj - the object to be compared with
      Returns:
      true if the other object equals to this object by sdk fields, false otherwise.
    • toString

      public final String toString()
      Returns a string representation of this object. This is useful for testing and debugging. Sensitive data will be redacted from this string using a placeholder value.
      Overrides:
      toString in class Object
    • getValueForField

      public final <T> Optional<T> getValueForField(String fieldName, Class<T> clazz)
    • sdkFields

      public final List<SdkField<?>> sdkFields()
      Specified by:
      sdkFields in interface SdkPojo
      Returns:
      List of SdkField in this POJO. May be empty list but should never be null.
    • sdkFieldNameToField

      public final Map<String,SdkField<?>> sdkFieldNameToField()
      Specified by:
      sdkFieldNameToField in interface SdkPojo
      Returns:
      The mapping between the field name and its corresponding field.