GPT 5.4
GPT 5.4 is the standard tier of the GPT-5.4 model family, extending the agentic and reasoning capabilities of GPT-5.3 Codex to all domains including knowledge work, multi-step workflows, and analysis.
import { streamText } from 'ai'
const result = streamText({ model: 'openai/gpt-5.4', prompt: 'Why is the sky blue?'})What To Consider When Choosing a Provider
- Configuration: GPT 5.4 brings the agentic and reasoning leaps from GPT-5.3 Codex into all domains, not just coding.
- Configuration: It's faster and more token-efficient than GPT-5.2, meaning lower costs and shorter latencies on comparable tasks.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use GPT 5.4
Best For
- Complex multi-step workflows: Tasks involving tools, research, and pulling from multiple sources
- Knowledge work: Reports, spreadsheets, presentations, and analysis across business domains
- Advanced code generation: Strong coding performance with GPT-5.4 generation reasoning improvements
- Agentic applications: Autonomous agents that coordinate tools, research, and multi-source workflows
- General-purpose AI: New projects that benefit from GPT-5.4 generation capability
Consider Alternatives When
- Cost optimization: GPT-5.4 mini for production workloads where cost efficiency matters
- High-volume lightweight tasks: GPT-5.4 nano for classification, routing, and sub-agent workflows
- Extended reasoning: GPT-5.4 pro for maximum performance on the most complex tasks
- Pure chain-of-thought: O3 for mathematical and scientific reasoning tasks
Conclusion
GPT 5.4 brings agentic reasoning to all domains with improved speed and efficiency over GPT-5.2, available through AI Gateway. It is the standard tier of the GPT-5.4 family.
Frequently Asked Questions
How does GPT 5.4 improve over GPT-5.2?
It extends the agentic and reasoning leaps from GPT-5.3 Codex to all domains. It's also faster and more token-efficient, reducing cost per task.
What context window does GPT 5.4 support?
1.1M tokens, supporting extensive document and codebase processing.
What types of workflows does GPT 5.4 handle well?
Complex multi-step workflows involving tools, research, and pulling from multiple sources. It also handles knowledge work like reports, presentations, and analysis.
Does GPT 5.4 support function calling and structured outputs?
Yes. It supports the full API feature set including function calling, structured outputs, vision, and system messages.
Should I migrate from GPT-5.2?
If your application benefits from improved agentic reasoning and you want better speed and token efficiency, yes. Test with your specific workloads to confirm the improvement.
How does AI Gateway handle authentication for GPT 5.4?
AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.
What are typical latency characteristics?
This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.