What is the Orq MCP?
The Orq Model Context Protocol (MCP) server provides AI code assistants with direct access to an Orq.ai workspace. Invoke models, query traces, and search the workspace without leaving the IDE.Installation
Point your assistant at the MCP server and authenticate with your API key:| Endpoint | https://my.orq.ai/v2/mcp |
| Auth Header | Authorization: Bearer YOUR_ORQ_API_KEY |
Code Assistants
See detailed documentation for the following code assistants:Claude Code
Official Anthropic CLI for Claude with MCP integration
Claude Desktop
Use Orq MCP in Claude’s desktop application
Codex
AI coding assistant with MCP protocol support
Cursor
AI-first code editor with native MCP support
VS Code
AI-powered editor with GitHub Copilot and native MCP support
Warp
AI-powered terminal with native MCP support
Available Tools
The Orq MCP provides 8 tools across 3 categories:| Category | Tool | Description |
|---|---|---|
| Models | list_models | List available AI models by type (chat, embedding, image, tts, stt, and more) |
| Models | invoke_model | Invoke any model directly via the Responses API. Supports reasoning effort control and response content inclusion |
| Search | search_entities | Search any entity type: project, deployment, prompt, knowledge, or memory store (supports cursor pagination) |
| Search | search_directories | List directories within a project |
| Search | search_docs | Query the Orq.ai documentation for feature guidance and API reference |
| Traces | list_traces | List traces with filtering by model, type, project, thread ID, time range, and more |
| Traces | get_span | Retrieve a single span (compact or full mode) |
| Traces | list_spans | List all spans in a trace |
Examples
Invoking a Model
Invoking a Model
Use
Call an o-series model with reasoningThe assistant will:
Include encrypted reasoning contentThe assistant will:
invoke_model to call any model directly via the Responses API.Parameters| Parameter | Type | Description |
|---|---|---|
model | string | Model ID in provider/model format (e.g. openai/gpt-5, openai/o3) |
reasoning | object | Reasoning configuration. Supported on OpenAI GPT-5 and o-series models only. effort: none, low, medium, high, or xhigh. summary: auto, concise, or detailed |
include | array | Response content to include: reasoning.encrypted_content, message.output_text.logprobs |
Call an o-series model with reasoning
- Use
invoke_modelwithmodel: "openai/o3"andreasoning: { effort: "medium", summary: "concise" } - Return the model response along with the reasoning summary
Include encrypted reasoning content
- Use
invoke_modelwithmodel: "openai/gpt-5"andinclude: ["reasoning.encrypted_content"] - Return the response with the encrypted reasoning block attached
Investigating Traces
Investigating Traces
Find errors from the last 24 hoursThe assistant will:
Find the slowest tracesThe assistant will:
Filter traces by thread IDThe assistant will:
- Calculate the unix timestamp for 24 hours ago
- Use
list_traceswith filterstatus:=ERROR && timestamp:>TIMESTAMPand sort bytimestamp:desc - Display trace IDs, names, durations, and timestamps
- Summarize the most common error types and their frequency
Find the slowest traces
- Use
list_tracessorted byduration_ms:desc, filtered to today, limit 5 - Use
list_spanswith eachtrace_idto retrieve the full span tree - Surface bottlenecks and latency outliers
Filter traces by thread ID
- Use
list_traceswiththread_id: "thread_abc123" - Return all traces associated with that conversation thread
- Surface turn count, total cost, and any errors across the session