added

OpenAI's new o3, o4-mini, GPT-4.1 models

OpenAI's latest models can be found in the model garden.

When to use what model

  • o-series models are purpose-built for reasoning and tool-use. Choose o3 when raw capability beats everything else; use o4-mini when you still need strong reasoning but want 10× cheaper tokens and snappier replies.
  • GPT-4.1 family is the evolution of the “generalist” GPT line. The full model gives you 1 M-token context with strong instruction following; mini trims cost/latency for mainstream workloads; nano pushes pricing and speed to the floor for simple or massive-scale tasks.
FeatureGPT-4.1GPT-4.1 miniGPT-4.1 nanoo3o4 mini
Training cutoff dateMay 2024May 2024May 2024May 2024May 2024
Context window1M1M1M200k200k
Max output tokens32.76832.76832.768100k100k
Input price$2/M Tokens$0.40/M Tokens$0.10/M Tokens$10/M tokens$1.10/M tokens
Output price$8/M Tokens$1.60/M Tokens$0.40/M Tokens$40/M tokens$4.40/M tokens
LatencyModerateFastVery fastSlowModerate
When to use?Long-context chat, knowledge-work, multimodal apps needing top GPT qualityEveryday product features, prototyping & chat where speed and price countReal-time, latency-critical or large-scale batch jobs on a tight budgetDeep research, complex multi-step reasoning, high-stakes coding/science tasksReasoning workloads where cost & throughput matter; API agents, math & data-science