Interface CapacityLimit.Builder
- All Superinterfaces:
Buildable,CopyableBuilder<CapacityLimit.Builder,,CapacityLimit> SdkBuilder<CapacityLimit.Builder,,CapacityLimit> SdkPojo
- Enclosing class:
CapacityLimit
-
Method Summary
Modifier and TypeMethodDescriptioncapacityUnit(String capacityUnit) The unit of measure for the capacity limit, which defines howmaxCapacityis interpreted.maxCapacity(Integer maxCapacity) The maximum capacity available for the service environment.Methods inherited from interface software.amazon.awssdk.utils.builder.CopyableBuilder
copyMethods inherited from interface software.amazon.awssdk.utils.builder.SdkBuilder
applyMutation, buildMethods inherited from interface software.amazon.awssdk.core.SdkPojo
equalsBySdkFields, sdkFieldNameToField, sdkFields
-
Method Details
-
maxCapacity
The maximum capacity available for the service environment. For a quota management enabled service environment, this value represents the maximum quantity of a particular resource type (specified by
capacityUnit) that can be allocated to service jobs. For other service environments, this value represents the maximum quantity of all resources that can be allocated to service jobs.For example, if
maxCapacity=50andcapacityUnit=NUM_INSTANCES, you can run up to 50 instances concurrently. If you run 5 SageMaker Training jobs that each use 10 instances, a subsequent job requiring 10 instances waits in the queue until capacity is available. In a quota management enabled service environment withcapacityUnit=ml.m5.large, onlyml.m5.largeinstances count against this limit, and jobs requiring other instance types wait until a matching capacity limit is configured.- Parameters:
maxCapacity- The maximum capacity available for the service environment. For a quota management enabled service environment, this value represents the maximum quantity of a particular resource type (specified bycapacityUnit) that can be allocated to service jobs. For other service environments, this value represents the maximum quantity of all resources that can be allocated to service jobs.For example, if
maxCapacity=50andcapacityUnit=NUM_INSTANCES, you can run up to 50 instances concurrently. If you run 5 SageMaker Training jobs that each use 10 instances, a subsequent job requiring 10 instances waits in the queue until capacity is available. In a quota management enabled service environment withcapacityUnit=ml.m5.large, onlyml.m5.largeinstances count against this limit, and jobs requiring other instance types wait until a matching capacity limit is configured.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-
capacityUnit
The unit of measure for the capacity limit, which defines how
maxCapacityis interpreted. ForSAGEMAKER_TRAININGjobs in a quota management enabled service environment, specify the instance type (for example,ml.m5.large). Otherwise, useNUM_INSTANCES.- Parameters:
capacityUnit- The unit of measure for the capacity limit, which defines howmaxCapacityis interpreted. ForSAGEMAKER_TRAININGjobs in a quota management enabled service environment, specify the instance type (for example,ml.m5.large). Otherwise, useNUM_INSTANCES.- Returns:
- Returns a reference to this object so that method calls can be chained together.
-