Models | AI Gateway - Orq.ai Documentation

Enabling new Models

To see available Models and enable them for use, navigate to the Models page in AI Gateway.

Models page listing available LLM models with columns for name, input and output pricing, feature badges, release date, max output tokens, context length, and location.

Each model displays its full name alongside a set of sortable columns:

Name: full model name and provider
Input / Output pricing: per-token cost for input and output
Features: capability badges indicating support for ZDR, BYOK, and other model-specific features
Released: the model’s release date
Max Output Tokens: maximum tokens the model can generate per response
Context Length: total token window (input + output)
Location: the region where the model is served

Use Sort: Newest to reorder by Newest, Pricing (low to high or high to low), Context (low to high or high to low), or Max Output Tokens. Use Columns to show or hide individual columns.

Use the Status Toggle to Enable a model for use with the AI Gateway.

Filters

Use the modality tabs at the top of the list to scope models by type: All Text Image Audio Speech Embedding Moderation Rerank The sidebar provides additional filters:

Filter	Description
Location	Filter by region: Europe, United States, Global, APAC, Australia, Singapore
Access	Toggle Zero data retention for ZDR-compliant providers, or BYOK for providers where an API key has been added
Providers	Filter by LLM provider. See Providers to configure API keys
Status	Show Enabled or Disabled models
Features	Filter by capability: Base64, Code Execution, Image Edit, JSON Mode, PDF, Reasoning, Streaming, Tool Calling, URL, Vision, Web Search
Context length	Drag the range slider to filter by context window size (512 to 2M tokens)
Owner	Filter between Public (Orq.ai-provided) and Private (onboarded) models

To enable a model, toggle it on. It will immediately be available to call with the AI Gateway.

Onboarding Private Models

Onboard private models by choosing Model at the top-right of the screen. This is useful when hosting a fine-tuned model or any model deployed on a private provider such as Azure AI Foundry or Vertex AI.

Private Models Providers

Show Azure AI Foundry

From the Azure AI Foundry project homepage, copy the API key and one of the two endpoints shown at the top. Orq.ai accepts the following endpoints:

Endpoint type	Format	What it imports
Azure OpenAI endpoint	`https://<resource>.openai.azure.com/openai/v1`	OpenAI-compatible deployments
Project endpoint	`https://<resource>.services.ai.azure.com/api/projects/<project>`	All deployments from publishers in the project (Anthropic, Cohere, and xAI)

Paste the endpoint URL exactly as shown in Azure AI Foundry. Orq.ai does not append any path suffix.

To learn more about the Azure AI Foundry deployment, see our Provider Documentation.

Open Add Model

In the AI Gateway sidebar, go to Models, then click Model at the top-right and select Azure.

Enter credentials

Enter the Base URL and API Key.

Azure AI Foundry credentials panel showing the API key, Project endpoint, and Azure OpenAI endpoint at the top of the project homepage.

Azure private model configuration form showing endpoint URL and API Key fields.

Fetch deployments

Click Fetch deployments to automatically import all available deployments. The imported models appear in the Models list. Toggle each model Enabled before use. Enabled models are available for routing requests through Routing Rules.

Reference imported models in code using <workspacename>@azure/<modelname> (see Referencing Private Models in Code).

Show Vertex AI

In the AI Gateway sidebar, go to Models, then click Model at the top-right and select Vertex AI. Enter the JSON configuration from your Google Cloud project to make the model available on the platform.

Google Vertex AI model configuration form showing the JSON configuration input to add a private model.

For full Vertex AI setup instructions, see Google Vertex AI.

Show LiteLLM

To import LiteLLM models, first create an Integration for the LiteLLM instance. After creation, return to the AI Gateway and import models from the connected instance.

LiteLLM model import screen showing how to select and import models from a connected LiteLLM instance.

Referencing Private Models in Code

When referencing private models through the SDKs, API, or Supported Libraries, the model is referenced by the following string: <workspacename>@<provider>/<modelname>.

Example: corp@azure/gpt-4o-2024-05-13

Bring Your Own Key (BYOK)

To start using models, connect provider API keys via BYOK in the AI Gateway sidebar.

​Enabling new Models

​Filters

​Onboarding Private Models

​Private Models Providers

​Referencing Private Models in Code

​Bring Your Own Key (BYOK)

Enabling new Models

Filters

Onboarding Private Models

Private Models Providers

Referencing Private Models in Code

Bring Your Own Key (BYOK)