Interface InferenceMetrics.Builder

  • Method Details

    • maxInvocations

      InferenceMetrics.Builder maxInvocations(Integer maxInvocations)

      The expected maximum number of requests per minute for the instance.

      Parameters:
      maxInvocations - The expected maximum number of requests per minute for the instance.
      Returns:
      Returns a reference to this object so that method calls can be chained together.
    • modelLatency

      InferenceMetrics.Builder modelLatency(Integer modelLatency)

      The expected model latency at maximum invocations per minute for the instance.

      Parameters:
      modelLatency - The expected model latency at maximum invocations per minute for the instance.
      Returns:
      Returns a reference to this object so that method calls can be chained together.