InferenceComponentComputeResourceRequirements
Defines the compute resources to allocate to run a model that you assign to an inference component. These resources include CPU cores, accelerators, and memory.
Types
Properties
Link copied to clipboard
The maximum MB of memory to allocate to run a model that you assign to an inference component.
Link copied to clipboard
The minimum MB of memory to allocate to run a model that you assign to an inference component.
Link copied to clipboard
The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.
Link copied to clipboard
The number of CPU cores to allocate to run a model that you assign to an inference component.
Functions
Link copied to clipboard
inline fun copy(block: InferenceComponentComputeResourceRequirements.Builder.() -> Unit = {}): InferenceComponentComputeResourceRequirements