Interface InferenceComponentAvailabilityZoneBalance.Builder

  • Method Details

    • enforcementMode

      Determines how strictly the Availability Zone balance constraint is enforced.

      PERMISSIVE

      The endpoint attempts to balance copies across Availability Zones but proceeds with scheduling even if balance can't be achieved due to available capacity or instance distribution across Availability Zones.

      Parameters:
      enforcementMode - Determines how strictly the Availability Zone balance constraint is enforced.

      PERMISSIVE

      The endpoint attempts to balance copies across Availability Zones but proceeds with scheduling even if balance can't be achieved due to available capacity or instance distribution across Availability Zones.

      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • enforcementMode

      Determines how strictly the Availability Zone balance constraint is enforced.

      PERMISSIVE

      The endpoint attempts to balance copies across Availability Zones but proceeds with scheduling even if balance can't be achieved due to available capacity or instance distribution across Availability Zones.

      Parameters:
      enforcementMode - Determines how strictly the Availability Zone balance constraint is enforced.

      PERMISSIVE

      The endpoint attempts to balance copies across Availability Zones but proceeds with scheduling even if balance can't be achieved due to available capacity or instance distribution across Availability Zones.

      Returns:
      Returns a reference to this object so that method calls can be chained together.
      See Also:
    • maxImbalance

      The maximum allowed difference in the number of inference component copies between any two Availability Zones. This parameter applies only when the endpoint has instances across two or more Availability Zones. A copy placement is allowed if it reduces imbalance or the resulting imbalance is within this value.

      Default value: 0.

      Parameters:
      maxImbalance - The maximum allowed difference in the number of inference component copies between any two Availability Zones. This parameter applies only when the endpoint has instances across two or more Availability Zones. A copy placement is allowed if it reduces imbalance or the resulting imbalance is within this value.

      Default value: 0.

      Returns:
      Returns a reference to this object so that method calls can be chained together.