Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
The deployment key to invoke
Key-value pairs variables to replace in your prompts. If a variable is not provided that is defined in the prompt, the default variables are used.
Key-value pairs that match your data model and fields declared in your deployment routing configuration
A list of messages to include after the System message, but before the  User and Assistant pairs configured in your deployment.
- Developer message
- System message
- User message
- Assistant message
- Tool message
A list of messages to send to the deployment.
- Developer message
- System message
- User message
- Assistant message
- Tool message
A list of file IDs that are associated with the deployment request.
Key-value pairs that you want to attach to the log generated by this request.
Utilized for passing additional parameters to the model provider. Exercise caution when using this feature, as the included parameters will overwrite any parameters specified in the deployment prompt configuration.
A list of relevant documents that evaluators and guardrails can cite to evaluate the user input or the model response based on your deployment settings.
A filter to apply to the knowledge base chunk metadata when using knowledge bases in the deployment.
- Search operator
- and
- or
Response
Successful operation
A unique identifier for the response. Can be used to add metrics to the transaction.
A timestamp indicating when the object was created. Usually in a standardized format like ISO 8601
Indicates the type of model used to generate the response
chat, completion, image The model used to generate the response
The provider used to generate the response
cohere, openai, anthropic, huggingface, replicate, google, google-ai, azure, aws, anyscale, perplexity, groq, fal, leonardoai, nvidia, jina, togetherai, elevenlabs, litellm, openailike, cerebras, bytedance, mistral Indicates if the response is the final response
A list of choices generated by the model
Indicates integration id used to generate the response
A timestamp indicating when the object was finalized. Usually in a standardized format like ISO 8601
Provider backed system fingerprint.
List of documents retrieved from the knowledge base. This property is only available when the include_retrievals flag is set to true in the invoke settings. When stream is set to true, the retrievals property will be returned in the last streamed chunk where the property is_final is set to true.
Response returned by the model provider. This functionality is only supported when streaming is not used. If streaming is used, the provider_response property will be set to null.