Skip to main content

Enabling new Models

To see available Models and enable them for use, navigate to the Models page in AI Gateway.
Models page listing available LLM models with columns for name, input and output pricing, feature badges, release date, max output tokens, context length, and location.
Each model displays its full name alongside a set of sortable columns:
  • Name: full model name and provider
  • Input / Output pricing: per-token cost for input and output
  • Features: capability badges indicating support for ZDR, BYOK, and other model-specific features
  • Released: the model’s release date
  • Max Output Tokens: maximum tokens the model can generate per response
  • Context Length: total token window (input + output)
  • Location: the region where the model is served
Use Sort: Newest to reorder by Newest, Pricing (low to high or high to low), Context (low to high or high to low), or Max Output Tokens. Use Columns to show or hide individual columns.
Use the Status Toggle to Enable a model for use with the AI Gateway.

Filters

Use the modality tabs at the top of the list to scope models by type: All Text Image Audio Speech Embedding Moderation Rerank The sidebar provides additional filters:
FilterDescription
LocationFilter by region: Europe, United States, Global, APAC, Australia, Singapore
AccessToggle Zero data retention for ZDR-compliant providers, or BYOK for providers where an API key has been added
ProvidersFilter by LLM provider. See Providers to configure API keys
StatusShow Enabled or Disabled models
FeaturesFilter by capability: Base64, Code Execution, Image Edit, JSON Mode, PDF, Reasoning, Streaming, Tool Calling, URL, Vision, Web Search
Context lengthDrag the range slider to filter by context window size (512 to 2M tokens)
OwnerFilter between Public (Orq.ai-provided) and Private (onboarded) models
To enable a model, toggle it on. It will immediately be available to call with the AI Gateway.

Onboarding Private Models

Onboard private models by choosing Model at the top-right of the screen. This is useful for fine-tuned models.

Private Models Providers

Referencing Private Models in Code

When referencing private models through the SDKs, API, or Supported Libraries, the model is referenced by the following string: <workspacename>@<provider>/<modelname>.
Example: corp@azure/gpt-4o-2024-05-13

Bring Your Own Key (BYOK)

To start using models, connect provider API keys via BYOK in the AI Gateway sidebar.