OpenAI's latest models can be found in the model garden.

When to use what model

o-series models are purpose-built for reasoning and tool-use. Choose o3 when raw capability beats everything else; use o4-mini when you still need strong reasoning but want 10× cheaper tokens and snappier replies.
GPT-4.1 family is the evolution of the “generalist” GPT line. The full model gives you 1 M-token context with strong instruction following; mini trims cost/latency for mainstream workloads; nano pushes pricing and speed to the floor for simple or massive-scale tasks.

Feature	GPT-4.1	GPT-4.1 mini	GPT-4.1 nano	o3	o4 mini
Training cutoff date	May 2024	May 2024	May 2024	May 2024	May 2024
Context window	1M	1M	1M	200k	200k
Max output tokens	32.768	32.768	32.768	100k	100k
Input price	$2/M Tokens	$0.40/M Tokens	$0.10/M Tokens	$10/M tokens	$1.10/M tokens
Output price	$8/M Tokens	$1.60/M Tokens	$0.40/M Tokens	$40/M tokens	$4.40/M tokens
Latency	Moderate	Fast	Very fast	Slow	Moderate
When to use?	Long-context chat, knowledge-work, multimodal apps needing top GPT quality	Everyday product features, prototyping & chat where speed and price count	Real-time, latency-critical or large-scale batch jobs on a tight budget	Deep research, complex multi-step reasoning, high-stakes coding/science tasks	Reasoning workloads where cost & throughput matter; API agents, math & data-science