Skip to content

GPT 5.4

GPT 5.4 is the standard tier of the GPT-5.4 model family, extending the agentic and reasoning capabilities of GPT-5.3 Codex to all domains including knowledge work, multi-step workflows, and analysis.

ReasoningTool UseVision (Image)File InputImplicit CachingWeb Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5.4',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Configuration: GPT 5.4 brings the agentic and reasoning leaps from GPT-5.3 Codex into all domains, not just coding.
  • Configuration: It's faster and more token-efficient than GPT-5.2, meaning lower costs and shorter latencies on comparable tasks.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use GPT 5.4

Best For

  • Complex multi-step workflows: Tasks involving tools, research, and pulling from multiple sources
  • Knowledge work: Reports, spreadsheets, presentations, and analysis across business domains
  • Advanced code generation: Strong coding performance with GPT-5.4 generation reasoning improvements
  • Agentic applications: Autonomous agents that coordinate tools, research, and multi-source workflows
  • General-purpose AI: New projects that benefit from GPT-5.4 generation capability

Consider Alternatives When

  • Cost optimization: GPT-5.4 mini for production workloads where cost efficiency matters
  • High-volume lightweight tasks: GPT-5.4 nano for classification, routing, and sub-agent workflows
  • Extended reasoning: GPT-5.4 pro for maximum performance on the most complex tasks
  • Pure chain-of-thought: O3 for mathematical and scientific reasoning tasks

Conclusion

GPT 5.4 brings agentic reasoning to all domains with improved speed and efficiency over GPT-5.2, available through AI Gateway. It is the standard tier of the GPT-5.4 family.

Frequently Asked Questions

  • How does GPT 5.4 improve over GPT-5.2?

    It extends the agentic and reasoning leaps from GPT-5.3 Codex to all domains. It's also faster and more token-efficient, reducing cost per task.

  • What context window does GPT 5.4 support?

    1.1M tokens, supporting extensive document and codebase processing.

  • What types of workflows does GPT 5.4 handle well?

    Complex multi-step workflows involving tools, research, and pulling from multiple sources. It also handles knowledge work like reports, presentations, and analysis.

  • Does GPT 5.4 support function calling and structured outputs?

    Yes. It supports the full API feature set including function calling, structured outputs, vision, and system messages.

  • Should I migrate from GPT-5.2?

    If your application benefits from improved agentic reasoning and you want better speed and token efficiency, yes. Test with your specific workloads to confirm the improvement.

  • How does AI Gateway handle authentication for GPT 5.4?

    AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

  • What are typical latency characteristics?

    This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.