What does 'non-reasoning' mean for Grok 4 Fast Non-Reasoning?

The model produces direct answers without generating chain-of-thought reasoning traces. This reduces latency and output token consumption compared to the reasoning variant.

How does Grok 4 Fast Non-Reasoning differ from Grok 4 Fast Reasoning?

Both share the same Grok 4 Fast foundation. The reasoning variant generates chain-of-thought traces for analytical tasks, while Grok 4 Fast Non-Reasoning produces direct responses optimized for speed.

What does Grok 4 Fast Non-Reasoning cost?

Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.

How do I authenticate with Grok 4 Fast Non-Reasoning through Vercel AI Gateway?

Use your Vercel AI Gateway API key with `xai/grok-4-fast-non-reasoning` as the model identifier. No separate xAI account is required for gateway-managed access.

Can Grok 4 Fast Non-Reasoning call tools and functions?

Yes. Grok 4 Fast Non-Reasoning supports tool calling and function invocation, making it suitable for agentic workflows that need fast decision-making.

Does Vercel AI Gateway support Zero Data Retention for Grok 4 Fast Non-Reasoning?

Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

Grok 4 Fast Non-Reasoning

Grok 4 Fast Non-Reasoning is the speed-optimized, non-reasoning variant of xAI's Grok 4 Fast. It delivers fast inference without chain-of-thought overhead, tailored for high-throughput applications within a context window of 2M tokens.

Tool UseImplicit Cachingtiered-costVision (Image)File InputWeb Search

import { streamText } from 'ai'

const result = streamText({
  model: 'xai/grok-4-fast-non-reasoning',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Frequently Asked Questions

What does 'non-reasoning' mean for Grok 4 Fast Non-Reasoning?
The model produces direct answers without generating chain-of-thought reasoning traces. This reduces latency and output token consumption compared to the reasoning variant.
How does Grok 4 Fast Non-Reasoning differ from Grok 4 Fast Reasoning?
Both share the same Grok 4 Fast foundation. The reasoning variant generates chain-of-thought traces for analytical tasks, while Grok 4 Fast Non-Reasoning produces direct responses optimized for speed.
What is the context window?
2M tokens.
What does Grok 4 Fast Non-Reasoning cost?
Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.
How do I authenticate with Grok 4 Fast Non-Reasoning through Vercel AI Gateway?
Use your Vercel AI Gateway API key with xai/grok-4-fast-non-reasoning as the model identifier. No separate xAI account is required for gateway-managed access.
Can Grok 4 Fast Non-Reasoning call tools and functions?
Yes. Grok 4 Fast Non-Reasoning supports tool calling and function invocation, making it suitable for agentic workflows that need fast decision-making.
Does Vercel AI Gateway support Zero Data Retention for Grok 4 Fast Non-Reasoning?
Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Grok 4 Fast Non-Reasoning

Frequently Asked Questions