Skip to main content

What is the Orq MCP?

The Orq Model Context Protocol (MCP) server provides AI code assistants with direct access to an Orq.ai workspace. Invoke models, query traces, and search the workspace without leaving the IDE.

Installation

Point your assistant at the MCP server and authenticate with your API key:
Endpointhttps://my.orq.ai/v2/mcp
Auth HeaderAuthorization: Bearer YOUR_ORQ_API_KEY

Code Assistants

See detailed documentation for the following code assistants:
https://mintcdn.com/orqai/d-t0Z04KwFlGVsS1/images/logos/claude-code.svg?fit=max&auto=format&n=d-t0Z04KwFlGVsS1&q=85&s=1654adb689b324322f3d63dd9eef6ad0

Claude Code

Official Anthropic CLI for Claude with MCP integration
https://mintcdn.com/orqai/d-t0Z04KwFlGVsS1/images/logos/claude-desktop.svg?fit=max&auto=format&n=d-t0Z04KwFlGVsS1&q=85&s=bccf0e29820481de693393dd71aa84f3

Claude Desktop

Use Orq MCP in Claude’s desktop application
https://mintcdn.com/orqai/d-t0Z04KwFlGVsS1/images/logos/codex.svg?fit=max&auto=format&n=d-t0Z04KwFlGVsS1&q=85&s=feb92a7651c85d15d424f900821581b6

Codex

AI coding assistant with MCP protocol support
https://mintcdn.com/orqai/d-t0Z04KwFlGVsS1/images/logos/cursor.svg?fit=max&auto=format&n=d-t0Z04KwFlGVsS1&q=85&s=ba1ed7e66516eb920ae1462050041bdf

Cursor

AI-first code editor with native MCP support
https://mintcdn.com/orqai/9VWlkBlGuTGPnFjG/images/logos/vscode.svg?fit=max&auto=format&n=9VWlkBlGuTGPnFjG&q=85&s=4c770da22e84990dee79094d1fab3179

VS Code

AI-powered editor with GitHub Copilot and native MCP support
https://mintcdn.com/orqai/d-t0Z04KwFlGVsS1/images/logos/warp.svg?fit=max&auto=format&n=d-t0Z04KwFlGVsS1&q=85&s=2f4b99ef604241484380367ae742ea4d

Warp

AI-powered terminal with native MCP support

Available Tools

The Orq MCP provides 8 tools across 3 categories:
CategoryToolDescription
Modelslist_modelsList available AI models by type (chat, embedding, image, tts, stt, and more)
Modelsinvoke_modelInvoke any model directly via the Responses API. Supports reasoning effort control and response content inclusion
Searchsearch_entitiesSearch any entity type: project, deployment, prompt, knowledge, or memory store (supports cursor pagination)
Searchsearch_directoriesList directories within a project
Searchsearch_docsQuery the Orq.ai documentation for feature guidance and API reference
Traceslist_tracesList traces with filtering by model, type, project, thread ID, time range, and more
Tracesget_spanRetrieve a single span (compact or full mode)
Traceslist_spansList all spans in a trace

Examples

Use invoke_model to call any model directly via the Responses API.Parameters
ParameterTypeDescription
modelstringModel ID in provider/model format (e.g. openai/gpt-5, openai/o3)
reasoningobjectReasoning configuration. Supported on OpenAI GPT-5 and o-series models only. effort: none, low, medium, high, or xhigh. summary: auto, concise, or detailed
includearrayResponse content to include: reasoning.encrypted_content, message.output_text.logprobs

Call an o-series model with reasoning
Use invoke_model to call openai/o3 with medium reasoning effort and return a concise reasoning summary
The assistant will:
  1. Use invoke_model with model: "openai/o3" and reasoning: { effort: "medium", summary: "concise" }
  2. Return the model response along with the reasoning summary

Include encrypted reasoning content
Invoke gpt-5 and include the encrypted reasoning content in the response
The assistant will:
  1. Use invoke_model with model: "openai/gpt-5" and include: ["reasoning.encrypted_content"]
  2. Return the response with the encrypted reasoning block attached
Find errors from the last 24 hours
Show me all traces with errors from the last 24 hours
The assistant will:
  1. Calculate the unix timestamp for 24 hours ago
  2. Use list_traces with filter status:=ERROR && timestamp:>TIMESTAMP and sort by timestamp:desc
  3. Display trace IDs, names, durations, and timestamps
  4. Summarize the most common error types and their frequency

Find the slowest traces
Find the 5 slowest traces from today and show me their span details
The assistant will:
  1. Use list_traces sorted by duration_ms:desc, filtered to today, limit 5
  2. Use list_spans with each trace_id to retrieve the full span tree
  3. Surface bottlenecks and latency outliers

Filter traces by thread ID
Show me all traces for thread ID thread_abc123
The assistant will:
  1. Use list_traces with thread_id: "thread_abc123"
  2. Return all traces associated with that conversation thread
  3. Surface turn count, total cost, and any errors across the session