What are the three reasoning modes in Seed 1.6 and when should I use each?

The three modes are `FullCoT`, `NoCoT`, and `AdaCoT`. `FullCoT` generates an extended chain-of-thought trace before answering, best for complex multi-step problems. `NoCoT` skips the reasoning trace for direct, low-latency responses. `AdaCoT` selects between the two based on estimated question difficulty, making it the practical default for mixed workloads.

Does Seed 1.6 support image inputs?

Yes. Seed 1.6 incorporates Vision-Language Model (VLM) capabilities. You can process both text and visual data in the same request.

What is parallel decoding in the context of Seed 1.6?

It's a training-free inference enhancement that generates additional thinking tokens without changing the base model weights. It deepens reasoning capacity at inference time, yielding eight-point improvements on the BeyondAIME benchmark.

How does the sparse MoE architecture affect cost?

Only 23B of the 230B total parameters activate per forward pass. Inference compute stays closer to a 23B dense model than a 230B one. This lowers per-token cost while retaining the representational capacity of the full parameter count.

What academic benchmarks were used to evaluate Seed 1.6?

ByteDance evaluated Seed 1.6 on China's 2025 Gaokao (683/750 in humanities, ranked first; 648/750 in science, ranked second, rising to 676/750 with higher-resolution images) and India's JEE Advanced entrance exam (top-10 placement, 100% math accuracy across five sampling rounds). See https://console.byteplus.com/ark/region:ark+ap-southeast-1/model/detail?Id=seed-1-6 for the full tables.

Is Seed 1.6 available for commercial use through AI Gateway?

Yes. You can access Seed 1.6 through bytedance via AI Gateway with an API key or OIDC token. You don't manage upstream provider credentials yourself.

Seed 1.6

Seed 1.6 is a sparse Mixture-of-Experts (MoE) model with 23B active parameters out of 230B total, a context window of 256K tokens, and three reasoning modes including adaptive chain-of-thought (CoT) that calibrates thinking depth to question complexity.

ReasoningTool UseImplicit Cachingtiered-cost

import { streamText } from 'ai'

const result = streamText({
  model: 'bytedance/seed-1.6',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Seed 1.6 by ByteDance. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

About Seed 1.6

Seed 1.6 uses a sparse Mixture-of-Experts (MoE) architecture with 23B active parameters drawn from 230B total. It builds on MoE design principles from ByteDance's Seed 1.5 research. Only the parameter subset most relevant to each token activates during inference, which keeps costs closer to a 23B dense model than a 230B one.

Seed 1.6 offers three reasoning modes. FullCoT enables extended chain-of-thought for demanding analytical problems. NoCoT produces direct completions without a visible reasoning trace, suited to latency-sensitive applications. AdaCoT, the adaptive mode, decides whether reasoning tokens are warranted based on inferred question difficulty. It saves compute on simple queries while engaging deeper reflection when the problem demands it. A parallel decoding enhancement also generates additional thinking tokens without retraining, yielding eight-point improvements on the BeyondAIME benchmark.

The context window of 256K tokens supports long-document processing and multimodal input. On China's 2025 Gaokao, Seed 1.6 scored 683/750 in humanities (ranked first) and 648/750 in science (ranked second), rising to 676/750 with higher-resolution images. On India's JEE Advanced entrance exam, the model placed in the top 10 with 100% accuracy in math across all five sampling rounds.

See https://console.byteplus.com/ark/region:ark+ap-southeast-1/model/detail?Id=seed-1-6 for the full write-up, tables, and methodology.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

256K

1.2s

121tps

$0.25/M

$2.00/M

Read:

$0.05/M

Write:

—

09/01/2025

More models by ByteDance

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

—

04/14/2026

—

04/14/2026

256K

1.1s

144tps

$0.25/M

$2.00/M

Read:

$0.05/M

Write:

—

09/01/2025

—

06/11/2025

—

06/01/2025

—

06/01/2025

What To Consider When Choosing a Provider

Configuration: If your workload relies on extended thinking, confirm that your chosen provider exposes the full reasoning token budget without truncation. Compare token pricing ($0.25 in, $2 out per million tokens when listed).
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Seed 1.6

Best For

Long-document analysis: The context window of 256K tokens lets you ingest entire contracts, research papers, or codebases in one prompt
Mixed-difficulty workloads: AdaCoT calibrates compute spend automatically, routing complex queries to full reasoning and simple ones to direct response
Multimodal pipelines: Integrated VLM capability combines text and visual data in the same request
GUI-based interaction: Tasks that require understanding screenshots or interface layouts alongside natural language instructions
Competitive academic domains: Mathematics, science, and humanities workloads where benchmark results confirm consistent performance

Consider Alternatives When

Pure text generation: A dense model may offer more predictable routing behavior when no visual input is involved
Output length limits: Requests needing more than 32K tokens in a single response exceed capacity
Deterministic latency: Pipelines can't tolerate variable thinking token overhead from FullCoT or AdaCoT
Cost-sensitive workloads: A smaller distilled model may meet your quality bar at lower cost per token

Conclusion

Seed 1.6 combines a context window of 256K tokens, sparse MoE efficiency, and three reasoning modes in one deployment. If you previously needed separate fast and slow models, you can consolidate onto a single endpoint and let AdaCoT manage the tradeoff at inference time.

Frequently Asked Questions

What are the three reasoning modes in Seed 1.6 and when should I use each?
The three modes are FullCoT, NoCoT, and AdaCoT. FullCoT generates an extended chain-of-thought trace before answering, best for complex multi-step problems. NoCoT skips the reasoning trace for direct, low-latency responses. AdaCoT selects between the two based on estimated question difficulty, making it the practical default for mixed workloads.
Does Seed 1.6 support image inputs?
Yes. Seed 1.6 incorporates Vision-Language Model (VLM) capabilities. You can process both text and visual data in the same request.
What is parallel decoding in the context of Seed 1.6?
It's a training-free inference enhancement that generates additional thinking tokens without changing the base model weights. It deepens reasoning capacity at inference time, yielding eight-point improvements on the BeyondAIME benchmark.
How does the sparse MoE architecture affect cost?
Only 23B of the 230B total parameters activate per forward pass. Inference compute stays closer to a 23B dense model than a 230B one. This lowers per-token cost while retaining the representational capacity of the full parameter count.
What academic benchmarks were used to evaluate Seed 1.6?
ByteDance evaluated Seed 1.6 on China's 2025 Gaokao (683/750 in humanities, ranked first; 648/750 in science, ranked second, rising to 676/750 with higher-resolution images) and India's JEE Advanced entrance exam (top-10 placement, 100% math accuracy across five sampling rounds). See https://console.byteplus.com/ark/region:ark+ap-southeast-1/model/detail?Id=seed-1-6 for the full tables.
Is Seed 1.6 available for commercial use through AI Gateway?
Yes. You can access Seed 1.6 through bytedance via AI Gateway with an API key or OIDC token. You don't manage upstream provider credentials yourself.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Seed 1.6

Playground

About Seed 1.6

Providers

More models by ByteDance

What To Consider When Choosing a Provider

When to Use Seed 1.6

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions

Playground

About Seed 1.6

Providers

More models by ByteDance