public static interface ProductionVariant.Builder extends SdkPojo, CopyableBuilder<ProductionVariant.Builder,ProductionVariant>
Modifier and Type | Method and Description |
---|---|
ProductionVariant.Builder |
acceleratorType(ProductionVariantAcceleratorType acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant.Builder |
acceleratorType(String acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant.
|
ProductionVariant.Builder |
initialInstanceCount(Integer initialInstanceCount)
Number of instances to launch initially.
|
ProductionVariant.Builder |
initialVariantWeight(Float initialVariantWeight)
Determines initial traffic distribution among all of the models that you specify in the endpoint
configuration.
|
ProductionVariant.Builder |
instanceType(ProductionVariantInstanceType instanceType)
The ML compute instance type.
|
ProductionVariant.Builder |
instanceType(String instanceType)
The ML compute instance type.
|
ProductionVariant.Builder |
modelName(String modelName)
The name of the model that you want to host.
|
ProductionVariant.Builder |
variantName(String variantName)
The name of the production variant.
|
copy
applyMutation, build
ProductionVariant.Builder variantName(String variantName)
The name of the production variant.
variantName
- The name of the production variant.ProductionVariant.Builder modelName(String modelName)
The name of the model that you want to host. This is the name that you specified when creating the model.
modelName
- The name of the model that you want to host. This is the name that you specified when creating the
model.ProductionVariant.Builder initialInstanceCount(Integer initialInstanceCount)
Number of instances to launch initially.
initialInstanceCount
- Number of instances to launch initially.ProductionVariant.Builder instanceType(String instanceType)
The ML compute instance type.
instanceType
- The ML compute instance type.ProductionVariantInstanceType
,
ProductionVariantInstanceType
ProductionVariant.Builder instanceType(ProductionVariantInstanceType instanceType)
The ML compute instance type.
instanceType
- The ML compute instance type.ProductionVariantInstanceType
,
ProductionVariantInstanceType
ProductionVariant.Builder initialVariantWeight(Float initialVariantWeight)
Determines initial traffic distribution among all of the models that you specify in the endpoint
configuration. The traffic to a production variant is determined by the ratio of the
VariantWeight
to the sum of all VariantWeight
values across all ProductionVariants.
If unspecified, it defaults to 1.0.
initialVariantWeight
- Determines initial traffic distribution among all of the models that you specify in the endpoint
configuration. The traffic to a production variant is determined by the ratio of the
VariantWeight
to the sum of all VariantWeight
values across all
ProductionVariants. If unspecified, it defaults to 1.0.ProductionVariant.Builder acceleratorType(String acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker. For more information, see Using Elastic Inference in Amazon SageMaker.
acceleratorType
- The size of the Elastic Inference (EI) instance to use for the production variant. EI instances
provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon
SageMaker. For more information, see Using Elastic Inference in Amazon
SageMaker.ProductionVariantAcceleratorType
,
ProductionVariantAcceleratorType
ProductionVariant.Builder acceleratorType(ProductionVariantAcceleratorType acceleratorType)
The size of the Elastic Inference (EI) instance to use for the production variant. EI instances provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon SageMaker. For more information, see Using Elastic Inference in Amazon SageMaker.
acceleratorType
- The size of the Elastic Inference (EI) instance to use for the production variant. EI instances
provide on-demand GPU computing for inference. For more information, see Using Elastic Inference in Amazon
SageMaker. For more information, see Using Elastic Inference in Amazon
SageMaker.ProductionVariantAcceleratorType
,
ProductionVariantAcceleratorType
Copyright © 2017 Amazon Web Services, Inc. All Rights Reserved.