Skip to content
Dashboard

gpt-realtime-2

GPT Realtime 2 is our most capable realtime voice model. It supports speech-to-speech interactions with configurable reasoning effort, stronger instruction following, and more reliable tool use for complex voice-agent workflows.

index.ts
import { gateway } from '@ai-sdk/gateway';
export async function POST() {
const { token, url } = await gateway.experimental_realtime.getToken({
model: 'openai/gpt-realtime-2',
});
return Response.json({ token, url, tools: [] });
}

More models by OpenAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
1.1s
60tps
$5.00/M
$30.00/M
Read:
$0.5/M
Write:
$10.00/K
+ input costs
+4
azure logo
bedrock logo
openai logo
04/24/2026
400K
1.3s
162tps
$0.75/M$4.50/M
Read:$0.07/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/17/2026
400K
0.5s
55tps
$0.20/M$1.25/M
Read:$0.02/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/17/2026
1.1M
2.1s
91tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
$10.00/K
+ input costs
+4
azure logo
openai logo
03/05/2026
400K
3.6s
159tps
$0.25/M$2.00/M
Read:$0.03/M
Write:
$14/K
+ input costs
+4
azure logo
openai logo
08/07/2025
1M
0.6s
97tps
$0.40/M$1.60/M
Read:$0.1/M
Write:
$14/K
+ input costs
+3
azure logo
openai logo
05/14/2025