What makes Kimi K2.5 different from earlier K2 models?

It's the successor generation after K2. It adds frontend coding and visual inputs in Moonshot AI's documentation, which earlier K2-focused releases did not emphasize.

Does Kimi K2.5 support visual or image inputs?

Yes for vision-style tasks in Moonshot AI's materials. Confirm input modalities and limits on https://platform.moonshot.ai/docs/pricing/chat#product-pricing before you build a vision pipeline.

What kind of frontend code can Kimi K2.5 generate?

Moonshot AI documents interactive user interfaces with dynamic layouts and animations, not only static markup.

Is Kimi K2.5 open source?

Yes. Moonshot AI ships K2.5 as open source in the same lineage as other open-weight Kimi models.

When was Kimi K2.5 released on AI Gateway?

Kimi K2.5 became available through AI Gateway on January 26, 2026. Timing and scope are documented in the K2.5 on AI Gateway changelog post and on https://platform.moonshot.ai/docs/pricing/chat#product-pricing.

Should I use K2.5 or K2 Thinking for complex reasoning tasks?

Use K2 Thinking when you need extended chain-of-thought traces (math proofs, step-by-step algorithm design). K2.5 covers broad tasks including reasoning, but K2 Thinking is the match when explicit deliberation is the main requirement.

How do I use Kimi K2.5 with the AI SDK?

Set the model to `moonshotai/kimi-k2.5` in your AI SDK call. No other configuration changes are required.

Kimi K2.5

Kimi K2.5 is Moonshot AI's successor to the K2 family: multimodal inputs, upgraded frontend coding, and a context window of 262.1K tokens, available through AI Gateway via moonshotai, fireworks, novita, togetherai, bedrock.

ReasoningVision (Image)Tool UseImplicit Caching

import { streamText } from 'ai'

const result = streamText({
  model: 'moonshotai/kimi-k2.5',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Kimi K2.5 by Moonshot AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

About Kimi K2.5

Kimi K2.5, released on January 26, 2026, is the generation after the K2 line. Moonshot AI describes K2.5 across agent tasks, coding, visual understanding, and general intelligence benchmarks in its release materials. K2.5 extends both text-based and visual tasks.

Frontend code generation is a highlighted change. Moonshot AI documents more capable frontend coding, including interactive UI with dynamic layouts and animations, beyond bare syntax-level output.

Access K2.5 through AI Gateway by setting the model string to moonshotai/kimi-k2.5. No extra provider accounts are required for gateway-managed access. AI Gateway's observability layer tracks token usage and costs across requests, which helps when usage patterns vary.

Kimi K2.5 is available through AI Gateway at $0.5 per million input tokens and $2.8 per million output tokens.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

262K

0.8s

55tps

$0.60/M

$3.00/M

Read:$0.1/M

Write:—

—

01/26/2026

Legal:Terms

•

Privacy

256K

3.6s

55tps

$0.60/M

$3.00/M

Read:$0.1/M

Write:—

—

01/26/2026

Legal:Terms

•

Privacy

262K

7.7s

47tps

$0.60/M

$3.00/M

Read:$0.1/M

Write:—

—

01/26/2026

Legal:Terms

•

Privacy

256K

1.3s

9tps

$0.50/M

$2.80/M

—

01/26/2026

Legal:Terms

•

Privacy

256K

0.3s

62tps

$0.60/M

$3.00/M

Read:$0.1/M

Write:—

—

01/26/2026

More models by Moonshot AI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

262K

0.8s

104tps

$0.95/M

$4.00/M

Read:$0.16/M

Write:—

—

04/20/2026

262K

0.7s

35tps

$0.60/M

$2.50/M

Read:$0.15/M

Write:—

—

11/06/2025

262K

0.6s

108tps

$1.15/M

$8.00/M

Read:$0.15/M

Write:—

—

11/06/2025

131K

0.8s

39tps

$0.57/M

$2.30/M

—

09/05/2025

256K

0.7s

55tps

$1.15/M

$8.00/M

Read:$0.15/M

Write:—

—

09/05/2025

What To Consider When Choosing a Provider

Configuration: Evaluate Kimi K2.5 against your specific use case. The expanded capabilities may not justify the cost relative to K2 variants for every workload.
Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Kimi K2.5

Best For

Interactive frontend code: Generating UI with dynamic layouts, animations, and interactive components
Multi-capability pipelines: Workloads spanning agent tasks, coding, visual understanding, and general intelligence in one pipeline
Multimodal or frontend gaps: Kimi-family projects where earlier K2 variants lack the visual or frontend scope you need
General-purpose assistants: Teams building AI assistants that must handle diverse task types from a single model

Consider Alternatives When

Extended reasoning traces: Kimi K2 Thinking is built for explicit chain-of-thought output
Sufficient K2 checkpoint: The September 2025 K2 checkpoint covers your workload and K2.5's added scope isn't needed
Speed-first workloads: Kimi K2 Turbo or K2 Thinking Turbo are better fits when you don't need K2.5's broader capabilities
Cost-sensitive deployments: K2 variants may meet your quality bar at lower cost per token

Conclusion

Kimi K2.5 adds multimodal inputs and frontend coding emphasis to the Kimi line on AI Gateway, alongside agent, coding, and vision workloads. As of January 26, 2026, it's the K2 successor listed for those combined use cases on AI Gateway.

Frequently Asked Questions

What makes Kimi K2.5 different from earlier K2 models?
It's the successor generation after K2. It adds frontend coding and visual inputs in Moonshot AI's documentation, which earlier K2-focused releases did not emphasize.
Does Kimi K2.5 support visual or image inputs?
Yes for vision-style tasks in Moonshot AI's materials. Confirm input modalities and limits on https://platform.moonshot.ai/docs/pricing/chat#product-pricing before you build a vision pipeline.
What kind of frontend code can Kimi K2.5 generate?
Moonshot AI documents interactive user interfaces with dynamic layouts and animations, not only static markup.
Is Kimi K2.5 open source?
Yes. Moonshot AI ships K2.5 as open source in the same lineage as other open-weight Kimi models.
When was Kimi K2.5 released on AI Gateway?
Kimi K2.5 became available through AI Gateway on January 26, 2026. Timing and scope are documented in the K2.5 on AI Gateway changelog post and on https://platform.moonshot.ai/docs/pricing/chat#product-pricing.
Should I use K2.5 or K2 Thinking for complex reasoning tasks?
Use K2 Thinking when you need extended chain-of-thought traces (math proofs, step-by-step algorithm design). K2.5 covers broad tasks including reasoning, but K2 Thinking is the match when explicit deliberation is the main requirement.
How do I use Kimi K2.5 with the AI SDK?
Set the model to moonshotai/kimi-k2.5 in your AI SDK call. No other configuration changes are required.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Kimi K2.5

Playground

About Kimi K2.5

Providers

More models by Moonshot AI

What To Consider When Choosing a Provider

When to Use Kimi K2.5

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions

Playground

About Kimi K2.5

Providers

More models by Moonshot AI