OptionalextraExtra parameters to include in the payload
OptionalgrammarThe gnbf grammar to use for grammar-based sampling.
OptionalimagesThe base64 images data (for multimodal models).
Optionalmax_The number of predictions to return.
Optionalmin_The minimum probability for a token to be considered, relative to the probability of the most likely token.
OptionalmodelThe model configuration details for inference.
Optionalrepeat_Adjusts penalty for repeated tokens.
OptionalschemaA json schema to format the output.
OptionalstopList of stop words or phrases to halt predictions.
OptionalstreamIndicates if results should be streamed progressively.
OptionaltemperatureAdjusts randomness in sampling; higher values mean more randomness.
OptionaltemplateThe template to use, for the backends that support it.
OptionaltfsSet the tail free sampling value.
Optionaltop_Limits the result set to the top K results.
Optionaltop_Filters results based on cumulative probability.
OptionaltsA Typescript interface to be converted to a gnbf grammar to use for grammar-based sampling.
Describes the parameters for making an inference request.
InferenceParams
Example