Skip to content
Dashboard

GPT-4.1 mini

GPT-4.1 mini delivers GPT-4o-class intelligence at reduced cost with nearly half the latency, making it a cost-performance option in the GPT-4.1 family for high-volume production workloads.

File InputImplicit CachingTool UseVision (Image)Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-4.1-mini',
prompt: 'Why is the sky blue?'
})

More models by OpenAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
1.4s
46tps
$5.00/M
$30.00/M
Read:
$0.5/M
Write:
$10.00/K
+ input costs
+4
azure logo
bedrock logo
openai logo
04/24/2026
400K
1.2s
271tps
$0.75/M$4.50/M
Read:$0.07/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/17/2026
400K
0.5s
153tps
$0.20/M$1.25/M
Read:$0.02/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/17/2026
1.1M
1.7s
94tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/05/2026
400K
3.0s
158tps
$0.25/M$2.00/M
Read:$0.03/M
Write:
$14/K
+ input costs
+4
azure logo
openai logo
08/07/2025
128K
0.5s
87tps
$0.15/M$0.60/M
Read:$0.07/M
Write:
$14/K
+ input costs
+3
azure logo
openai logo
07/18/2024