Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.orq.ai/llms.txt

Use this file to discover all available pages before exploring further.

Datasets hold the test data that powers Experiments. Each dataset row contains up to three fields:
  • Inputs: Variables injected into the prompt at runtime, e.g. {{firstname}}.
  • Messages: The prompt template, structured with system, user, and assistant roles.
  • Expected Outputs: Reference responses evaluators compare against model outputs.
You don’t need all three fields in every dataset. A dataset with only inputs, or only messages, is valid.

Use Cases

Run the same dataset through your prompts before and after a change to verify that updates haven’t degraded performance in any area.
Use the same dataset across multiple models or prompt configurations in an Experiment to find the best combination of quality, cost, and latency.
Have domain experts review and correct model outputs, then save those verified input/output pairs as a curated dataset to use as fine-tuning reference data.
Use the Orq MCP to generate hundreds of realistic test cases programmatically and add them directly to a dataset without leaving your IDE.
Build datasets with image messages for testing vision models. Supports JPEG, PNG, GIF, and WebP via the AI Studio or API.

Create a Dataset

Use the + button on a Project folder and select Dataset. Enter a title to open the Table View.The table has three columns: Inputs, Messages, and Expected Outputs. Add as many rows as needed.

Add Datapoints

Manually: Click Add Row and fill in each cell.From CSV: Click Import and drag-and-drop a .csv file. Map each CSV column to a Dataset field (Inputs, Messages, Expected Outputs). Each row becomes a separate datapoint.

Create Image Datasets

Start by creating a dataset, then add messages with images.When editing a message cell:
  1. Click Add image in the message editor.
  1. Choose how to provide the image:
    • Upload locally: Select a file from your computer.
    • Enter URL: Paste an image URL directly.
Supported formats: JPEG, PNG, GIF, WebP.

Create Curated Datasets

Curated datasets are human-evaluated input and output sets: a prompt paired with a verified expected output. They are used for fine-tuning and as a gold-standard reference in Experiments. Within any module, open the Logs tab and select a log entry. The Feedback panel appears on the right. To add a correction, click Add correction below the assistant response:
Edit the response in the Correction message that opens, then click Save.
Click the Add to Dataset icon at the top-right of the response to save the corrected entry to a dataset:
Import a curated dataset into an Experiment, attach an Evaluator, and see which model or prompt scores best against the curated reference outputs.

List and Retrieve Datasets

List datasets:
curl --request GET \
     --url https://api.orq.ai/v2/datasets \
     --header 'accept: application/json' \
     --header 'authorization: Bearer ORQ_API_KEY'
Retrieve a dataset by ID:
curl --request GET \
     --url https://api.orq.ai/v2/datasets/DATASET_ID \
     --header 'accept: application/json' \
     --header 'authorization: Bearer ORQ_API_KEY'
See the List Datasets and Retrieve a Dataset API references.