Skip to main content
This page describes features extending the AI Gateway, which provides a unified API for accessing multiple AI providers. To learn more, see AI Gateway.

List of supported models

Responses API

curl "https://api.orq.ai/v2/proxy/responses" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/gpt-4o",
    "input": "Write a one-sentence bedtime story about a unicorn."
  }'

Supported Models

ProviderModel
OpenAIopenai/gpt-3.5-turbo
OpenAIopenai/gpt-3.5-turbo-0125
OpenAIopenai/gpt-3.5-turbo-16k
OpenAIopenai/gpt-4-0125-preview
OpenAIopenai/gpt-4-turbo
OpenAIopenai/gpt-4-turbo-2024-04-09
OpenAIopenai/gpt-4.1
OpenAIopenai/gpt-4.1-2025-04-14
OpenAIopenai/gpt-4.1-mini
OpenAIopenai/gpt-4.1-mini-2025-04-14
OpenAIopenai/gpt-4.1-nano
OpenAIopenai/gpt-4.1-nano-2025-04-14
OpenAIopenai/gpt-4o
OpenAIopenai/gpt-4o-2024-05-13
OpenAIopenai/gpt-4o-2024-08-06
OpenAIopenai/gpt-4o-mini
OpenAIopenai/gpt-4o-mini-2024-07-18
OpenAIopenai/gpt-5
OpenAIopenai/gpt-5-chat-latest
OpenAIopenai/gpt-5-mini
OpenAIopenai/gpt-5-nano
OpenAIopenai/gpt-5-pro
OpenAIopenai/gpt-5.1
OpenAIopenai/gpt-5.1-chat-latest
OpenAIopenai/o1
OpenAIopenai/o1-2024-12-17
OpenAIopenai/o3
OpenAIopenai/o3-2025-04-16
OpenAIopenai/o3-mini
OpenAIopenai/o3-mini-2025-01-31
OpenAIopenai/o4-mini
OpenAIopenai/o4-mini-2025-04-16

Chat models

curl https://api.orq.ai/v2/proxy/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/gpt-4o",
    "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "Hello!"
      }
    ]
  }'
ProviderModel
Anthropicanthropic/claude-3-5-haiku-20241022
Anthropicanthropic/claude-3-5-sonnet-20241022
Anthropicanthropic/claude-3-7-sonnet-20250219
Anthropicanthropic/claude-3-7-sonnet-latest
Anthropicanthropic/claude-3-haiku-20240307
Anthropicanthropic/claude-haiku-4-5-20251001
Anthropicanthropic/claude-opus-4-1-20250805
Anthropicanthropic/claude-opus-4-20250514
Anthropicanthropic/claude-sonnet-4-20250514
Anthropicanthropic/claude-sonnet-4-5-20250929
AWS Bedrockaws/anthropic.claude-3-5-sonnet-20241022-v2:0
AWS Bedrockaws/anthropic.claude-3-haiku-20240307-v1:0
AWS Bedrockaws/anthropic.claude-3-opus-20240229-v1:0
AWS Bedrockaws/anthropic.claude-3-sonnet-20240229-v1:0
AWS Bedrockaws/eu.anthropic.claude-3-5-sonnet-20240620-v1:0
AWS Bedrockaws/eu.anthropic.claude-3-7-sonnet-20250219-v1:0
AWS Bedrockaws/eu.anthropic.claude-sonnet-4-20250514-v1:0
Azureazure/gpt-4.1
Azureazure/gpt-4.1-mini
Azureazure/gpt-4.1-nano
Azureazure/gpt-4o
Azureazure/gpt-4o-mini
Azureazure/gpt-5-chat
Azureazure/gpt-5-mini
Azureazure/gpt-5-nano
Azureazure/llama-3.1-405B-instruct
Azureazure/llama-3.1-8B
Azureazure/o1
Azureazure/o1-mini
Azureazure/o3-mini
Cerebrascerebras/gpt-oss-120b
Cerebrascerebras/llama-3.3-70b
Cerebrascerebras/llama-4-scout-17b-16e-instruct
Cerebrascerebras/llama3.1-8b
Cerebrascerebras/qwen-3-235b-a22b-instruct-2507
Cerebrascerebras/qwen-3-32b
Cerebrascerebras/qwen-3-coder-480b
Coherecohere/command-a-03-2025
Coherecohere/command-a-reasoning-08-2025
Coherecohere/command-a-translate-08-2025
Coherecohere/command-a-vision-07-2025
Coherecohere/command-r-08-2024
Coherecohere/command-r-plus-08-2024
Coherecohere/command-r7b-12-2024
Vertex AIgoogle/claude-3-5-haiku@20241022
Vertex AIgoogle/claude-3-5-sonnet-v2@20241022
Vertex AIgoogle/claude-3-7-sonnet@20250219
Vertex AIgoogle/claude-3-opus@20240229
Vertex AIgoogle/claude-haiku-4-5@20251001
Vertex AIgoogle/claude-opus-4-1@20250805
Vertex AIgoogle/claude-opus-4@20250514
Vertex AIgoogle/claude-sonnet-4-5@20250929
Vertex AIgoogle/claude-sonnet-4@20250514
Vertex AIgoogle/gemini-2.0-flash
Vertex AIgoogle/gemini-2.0-flash-001
Vertex AIgoogle/gemini-2.0-flash-lite-001
Vertex AIgoogle/gemini-2.5-flash
Vertex AIgoogle/gemini-2.5-flash-lite
Vertex AIgoogle/gemini-2.5-flash-lite-preview-09-2025
Vertex AIgoogle/gemini-2.5-flash-preview-09-2025
Vertex AIgoogle/gemini-2.5-pro
Vertex AIgoogle/gemini-3-pro-preview
Vertex AIgoogle/meta/llama-3.3-70b-instruct-maas
Vertex AIgoogle/meta/llama-4-maverick-17b-128e-instruct-maas
Vertex AIgoogle/meta/llama-4-scout-17b-16e-instruct-maas
Vertex AIgoogle/mistral-small-2503
Google AIgoogle-ai/gemini-2.0-flash
Google AIgoogle-ai/gemini-2.0-flash-001
Google AIgoogle-ai/gemini-2.0-flash-lite-001
Google AIgoogle-ai/gemini-2.0-flash-lite-preview-02-05
Google AIgoogle-ai/gemini-2.0-flash-thinking-exp-01-21
Google AIgoogle-ai/gemini-2.0-pro-exp-02-05
Google AIgoogle-ai/gemini-2.5-flash
Google AIgoogle-ai/gemini-2.5-flash-lite
Google AIgoogle-ai/gemini-2.5-pro
Google AIgoogle-ai/gemini-3-pro-preview
Groqgroq/llama-3.3-70b-versatile
Groqgroq/meta-llama/llama-4-maverick-17b-128e-instruct
Groqgroq/meta-llama/llama-4-scout-17b-16e-instruct
Groqgroq/meta-llama/llama-guard-4-12b
Groqgroq/meta-llama/llama-prompt-guard-2-86m
Groqgroq/moonshotai/kimi-k2-instruct-0905
Groqgroq/openai/gpt-oss-120b
Groqgroq/openai/gpt-oss-20b
mistralmistral/magistral-medium-2509
mistralmistral/ministral-3b-2410
mistralmistral/ministral-8b-2410
mistralmistral/mistral-large-2411
mistralmistral/mistral-medium-2508
mistralmistral/mistral-medium-latest
mistralmistral/mistral-small-2409
mistralmistral/mistral-small-latest
mistralmistral/pixtral-large-2411
OpenAIopenai/gpt-3.5-turbo
OpenAIopenai/gpt-3.5-turbo-0125
OpenAIopenai/gpt-3.5-turbo-16k
OpenAIopenai/gpt-4-0125-preview
OpenAIopenai/gpt-4-turbo
OpenAIopenai/gpt-4-turbo-2024-04-09
OpenAIopenai/gpt-4.1
OpenAIopenai/gpt-4.1-2025-04-14
OpenAIopenai/gpt-4.1-mini
OpenAIopenai/gpt-4.1-mini-2025-04-14
OpenAIopenai/gpt-4.1-nano
OpenAIopenai/gpt-4.1-nano-2025-04-14
OpenAIopenai/gpt-4o
OpenAIopenai/gpt-4o-2024-05-13
OpenAIopenai/gpt-4o-2024-08-06
OpenAIopenai/gpt-4o-mini
OpenAIopenai/gpt-4o-mini-2024-07-18
OpenAIopenai/gpt-5
OpenAIopenai/gpt-5-chat-latest
OpenAIopenai/gpt-5-mini
OpenAIopenai/gpt-5-nano
OpenAIopenai/gpt-5-pro
OpenAIopenai/gpt-5.1
OpenAIopenai/gpt-5.1-chat-latest
OpenAIopenai/o1
OpenAIopenai/o1-2024-12-17
OpenAIopenai/o3
OpenAIopenai/o3-2025-04-16
OpenAIopenai/o3-mini
OpenAIopenai/o3-mini-2025-01-31
OpenAIopenai/o4-mini
OpenAIopenai/o4-mini-2025-04-16
Perplexityperplexity/sonar
Perplexityperplexity/sonar-deep-research
Perplexityperplexity/sonar-pro
Perplexityperplexity/sonar-reasoning
Perplexityperplexity/sonar-reasoning-pro
Together AItogetherai/deepseek-ai/DeepSeek-R1
Together AItogetherai/deepseek-ai/DeepSeek-V3
Together AItogetherai/deepseek-ai/DeepSeek-V3.1
Together AItogetherai/meta-llama/Llama-3.3-70B-Instruct-Turbo
Together AItogetherai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8
Together AItogetherai/meta-llama/Llama-4-Scout-17B-16E-Instruct
Together AItogetherai/meta-llama/Llama-Guard-4-12B

Completion models

curl https://api.orq.ai/v2/proxy/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/gpt-3.5-turbo-instruct",
    "prompt": "Once upon a time",
    "max_tokens": 100
  }'
ProviderModel
OpenAIopenai/gpt-3.5-turbo-instruct

Embedding models

curl https://api.orq.ai/v2/proxy/embeddings \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/text-embedding-3-small",
    "input": "Hello world"
  }'
ProviderModel
Azureazure/text-embedding-3-small
Azureazure/text-embedding-ada-002
Coherecohere/embed-english-light-v3.0
Coherecohere/embed-english-v3.0
Coherecohere/embed-multilingual-light-v3.0
Coherecohere/embed-multilingual-v3.0
Coherecohere/embed-v4.0
Vertex AIgoogle/gemini-embedding-001
Vertex AIgoogle/multimodalembedding@001
Vertex AIgoogle/text-multilingual-embedding-002
Google AIgoogle-ai/text-embedding-004
Jina AIjina/jina-clip-v1
Jina AIjina/jina-clip-v2
Jina AIjina/jina-embeddings-v2-base-code
Jina AIjina/jina-embeddings-v2-base-de
Jina AIjina/jina-embeddings-v2-base-en
Jina AIjina/jina-embeddings-v2-base-es
Jina AIjina/jina-embeddings-v2-base-zh
Jina AIjina/jina-embeddings-v3
mistralmistral/mistral-embed
OpenAIopenai/text-embedding-3-large
OpenAIopenai/text-embedding-3-small
OpenAIopenai/text-embedding-ada-002

Image models

Image Generation

curl https://api.orq.ai/v2/proxy/images/generations \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/dall-e-3",
    "prompt": "A beautiful sunset over mountains",
    "n": 1,
    "size": "1024x1024"
  }'

Image Edit

curl https://api.orq.ai/v2/proxy/images/edits \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -F model="openai/gpt-image-1" \
  -F image="@body-lotion.png" \
  -F image="@bath-bomb.png" \
  -F image="@incense-kit.png" \
  -F image="@soap.png" \
  -F prompt="Generate a photorealistic image of a gift basket on a white background labeled 'Relax & Unwind' with a ribbon and handwriting-like font, containing all the items in the reference pictures."

Image Variations

curl https://api.orq.ai/v2/proxy/images/variations \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -F model="openai/dall-e-2" \
  -F image="@image_edit_original.png" \
  -F n=2 \
  -F size="1024x1024"

Supported Image Models

ProviderModelCapabilities
Azureazure/dall-e-3Generation, Edit
bytedancebytedance/seededit-3-0-i2i-250628Generation, Edit
bytedancebytedance/seedream-3-0-t2i-250415Generation
bytedancebytedance/seedream-4-0-250828Generation, Edit
FALfal/flux-pro/newGeneration
FALfal/flux/devGeneration
FALfal/flux/schnellGeneration
FALfal/gemini-25-flash-imageGeneration
Vertex AIgoogle/imagen-3.0-fast-generate-001Generation
Vertex AIgoogle/imagen-3.0-generate-001Generation
Vertex AIgoogle/imagen-4.0-fast-generate-001Generation
Vertex AIgoogle/imagen-4.0-generate-001Generation
Vertex AIgoogle/imagen-4.0-ultra-generate-001Generation
Leonardo AIleonardoai/leonard-diffusion-xlGeneration, Edit
Leonardo AIleonardoai/leonard-kino-xlGeneration, Edit
Leonardo AIleonardoai/leonard-lightning-xlGeneration, Edit
Leonardo AIleonardoai/leonard-vision-xlGeneration, Edit
OpenAIopenai/dall-e-2Generation, Edit
OpenAIopenai/dall-e-3Generation
OpenAIopenai/gpt-image-1Generation, Edit

Moderations models

curl https://api.orq.ai/v2/proxy/moderations \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "openai/text-moderation-latest",
    "input": "I want to check if this text is appropriate."
  }'
ProviderModel
mistralmistral/mistral-moderation-2411

Rerank models

curl https://api.orq.ai/v2/proxy/rerank \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -d '{
    "model": "cohere/rerank-english-v3.0",
    "query": "What is machine learning?",
    "documents": [
      "Machine learning is a branch of AI",
      "Machine learning uses data to improve",
      "AI is changing the world"
    ]
  }'
ProviderModel
Coherecohere/rerank-english-v3.0
Coherecohere/rerank-multilingual-v3.0
Coherecohere/rerank-v3.5
Jina AIjina/jina-colbert-v2
Jina AIjina/jina-reranker-v1-base-en
Jina AIjina/jina-reranker-v1-tiny-en
Jina AIjina/jina-reranker-v1-turbo-en
Jina AIjina/jina-reranker-v2-base-multilingual

Speech-to-Text models

curl https://api.orq.ai/v2/proxy/audio/transcriptions \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -F file="@/path/to/audio.mp3" \
  -F model="openai/whisper-1"
ProviderModel
Azureazure/whisper
Eleven Labselevenlabs/scribe_v1
Groqgroq/whisper-large-v3
Groqgroq/whisper-large-v3-turbo
mistralmistral/voxtral-mini-2507
OpenAIopenai/gpt-4o-mini-transcribe
OpenAIopenai/gpt-4o-transcribe
OpenAIopenai/whisper-1

Text-to-Speech models

curl https://api.orq.ai/v2/proxy/audio/speech \
  -H "Authorization: Bearer $ORQ_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "openai/tts-1",
    "input": "Hello world!",
    "voice": "alloy"
  }' --output speech.mp3
ProviderModel
Eleven Labselevenlabs/eleven_flash_v2
Eleven Labselevenlabs/eleven_flash_v2_5
Eleven Labselevenlabs/eleven_multilingual_v2
Eleven Labselevenlabs/eleven_turbo_v2_5
Vertex AIgoogle/gemini-2.5-flash-preview-tts
Vertex AIgoogle/gemini-2.5-pro-preview-tts
OpenAIopenai/gpt-4o-mini-tts
OpenAIopenai/tts-1
OpenAIopenai/tts-1-hd

Text-to-Speech Voices

The following voices are available for Text-to-Speech models:

OpenAI

  • alloy: Neutral, versatile voice
  • echo: Neutral, soft-spoken voice
  • fable: Expressive, narrative-focused voice
  • onyx: Deep, authoritative voice
  • nova: Warm, natural voice
  • shimmer: Clear, optimistic voice

ElevenLabs

  • aria: Neutral, versatile voice
  • roger: Deep, authoritative voice
  • sarah: Warm, friendly voice
  • laura: Soft, gentle voice
  • charlie: Casual, conversational voice
  • george: Professional, articulate voice
  • callum: Youthful, energetic voice
  • river: Calm, soothing voice
  • liam: Clear, confident voice
  • charlotte: Elegant, refined voice
  • alice: Bright, cheerful voice
  • matilda: Thoughtful, measured voice
  • will: Reliable, trustworthy voice
  • jessica: Engaging, expressive voice
  • eric: Authoritative, commanding voice
  • chris: Friendly, approachable voice
  • brian: Mature, distinguished voice
  • daniel: Versatile, balanced voice
  • lily: Sweet, melodious voice
  • bill: Grounded, authentic voice

Retries & Error Handling Streaming