Prerequisites
- Kilo Code extension installed in VS Code (
kilocode.kilo-code) - Active Orq.ai account
- Orq.ai API key
Setup
Open the Kilo Code panel → Settings → Providers → Custom Provider. Fill the fields:| Field | Value |
|---|---|
| Provider | orq (any string) |
| Display Name | orqai (any string; appears as prefix in the model selector) |
| Base URL | https://api.orq.ai/v3/router |
| API Key | <ORQ_API_KEY> |
| Model | Deselect all, then activate the models to use (e.g. anthropic/claude-sonnet-4-6, openai/gpt-4o) |
orqai / openai/gpt-4o).
Max output tokens
Kilo Code defaults to 32,000 max output tokens. The Custom Provider GUI does not expose per-model token limits. For models with a smaller native cap (e.g.openai/gpt-4o at 16k), open ~/.config/kilo/kilo.jsonc via Kilo Settings → Global Config and add a limit object for the model under the root models key:
Troubleshooting
400 invalid model format
400 invalid model format
The model ID is missing the provider prefix. Change
claude-sonnet-4-6 to anthropic/claude-sonnet-4-6, gpt-4o to openai/gpt-4o, and so on.Model exceeds token limit
Model exceeds token limit
Add a
limit object for the model in ~/.config/kilo/kilo.jsonc as shown in the Max output tokens section above. The GUI does not expose this field.Requests not appearing in Orq.ai Traces
Requests not appearing in Orq.ai Traces
Verify the Base URL is set to
https://api.orq.ai/v3/router. Requests sent to a provider’s own endpoint bypass the AI Gateway and produce no Traces.