InferenceComponentSpecification
Details about the resources to deploy with this inference component, including the model, container, and compute resources.
Types
Properties
Link copied to clipboard
The name of an existing inference component that is to contain the inference component that you're creating with your request.
Link copied to clipboard
The compute resources allocated to run the model, plus any adapter models, that you assign to the inference component.
Link copied to clipboard
Defines a container that provides the runtime environment for a model that you deploy with an inference component.
Link copied to clipboard
Settings that take effect while the model container starts up.
Functions
Link copied to clipboard
inline fun copy(block: InferenceComponentSpecification.Builder.() -> Unit = {}): InferenceComponentSpecification