Configurations for controlling the inference response of an InvokeAgent API call
Maximum length of output
List of stop sequences
Controls randomness, higher values increase diversity
Sample from the k most likely next tokens
Cumulative probability cutoff for token selection