Skip to main content
Not every request needs your most expensive model. The Auto Router automatically routes each request to the optimal model based on your optimization strategy, so you can reduce costs without sacrificing the quality that matters.

How It Works

The Auto Router sits between your application and your model pool. When a request comes in, it analyzes the complexity of the task and routes it to the most appropriate model based on your chosen optimization strategy.

Optimize for cost

Set a high-quality model as your baseline, and the Auto Router will route simpler requests to cheaper models while escalating complex ones. You save on the requests that don’t need your most powerful model.

Optimize for quality

Start with a cost-efficient model and let the Auto Router escalate to more capable models only when the task demands it. Get the best output for every request without overspending.

Configure your model pool

Pick which models the router can choose from, mixing expensive and affordable options. The router learns which requests need which level of capability. To set up the Auto Router:
  1. Navigate to the AI Router section in your workspace.
  2. Select Auto Router to configure your routing strategy.
  3. Choose your optimization goal: cost or quality.
  4. Select the models you want to include in your routing pool.
The Auto Router is available for all providers supported in the AI Router. You can combine models from different providers in a single routing pool.