provisionedConcurrency
The amount of provisioned concurrency to allocate for the serverless endpoint. Should be less than or equal to MaxConcurrency
.
This field is not supported for serverless endpoint recommendations for Inference Recommender jobs. For more information about creating an Inference Recommender job, see CreateInferenceRecommendationsJobs.