InferenceComponentComputeResourceRequirements

Defines the compute resources to allocate to run a model that you assign to an inference component. These resources include CPU cores, accelerators, and memory.

Types

Link copied to clipboard
class Builder
Link copied to clipboard
object Companion

Properties

Link copied to clipboard

The maximum MB of memory to allocate to run a model that you assign to an inference component.

Link copied to clipboard

The minimum MB of memory to allocate to run a model that you assign to an inference component.

Link copied to clipboard

The number of accelerators to allocate to run a model that you assign to an inference component. Accelerators include GPUs and Amazon Web Services Inferentia.

Link copied to clipboard

The number of CPU cores to allocate to run a model that you assign to an inference component.

Functions

Link copied to clipboard
open operator override fun equals(other: Any?): Boolean
Link copied to clipboard
open override fun hashCode(): Int
Link copied to clipboard
open override fun toString(): String