Skip to content

Kimi K2.5

Kimi K2.5 is Moonshot AI's successor to the K2 family: multimodal inputs, upgraded frontend coding, and a context window of 262.1K tokens, available through AI Gateway via moonshotai, fireworks, novita, togetherai, bedrock.

ReasoningVision (Image)Tool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'moonshotai/kimi-k2.5',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Configuration: Evaluate Kimi K2.5 against your specific use case. The expanded capabilities may not justify the cost relative to K2 variants for every workload.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Kimi K2.5

Best For

  • Interactive frontend code: Generating UI with dynamic layouts, animations, and interactive components
  • Multi-capability pipelines: Workloads spanning agent tasks, coding, visual understanding, and general intelligence in one pipeline
  • Multimodal or frontend gaps: Kimi-family projects where earlier K2 variants lack the visual or frontend scope you need
  • General-purpose assistants: Teams building AI assistants that must handle diverse task types from a single model

Consider Alternatives When

  • Extended reasoning traces: Kimi K2 Thinking is built for explicit chain-of-thought output
  • Sufficient K2 checkpoint: The September 2025 K2 checkpoint covers your workload and K2.5's added scope isn't needed
  • Speed-first workloads: Kimi K2 Turbo or K2 Thinking Turbo are better fits when you don't need K2.5's broader capabilities
  • Cost-sensitive deployments: K2 variants may meet your quality bar at lower cost per token

Conclusion

Kimi K2.5 adds multimodal inputs and frontend coding emphasis to the Kimi line on AI Gateway, alongside agent, coding, and vision workloads. As of January 26, 2026, it's the K2 successor listed for those combined use cases on AI Gateway.

Frequently Asked Questions

  • What makes Kimi K2.5 different from earlier K2 models?

    It's the successor generation after K2. It adds frontend coding and visual inputs in Moonshot AI's documentation, which earlier K2-focused releases did not emphasize.

  • Does Kimi K2.5 support visual or image inputs?

    Yes for vision-style tasks in Moonshot AI's materials. Confirm input modalities and limits on https://platform.moonshot.ai/docs/pricing/chat#product-pricing before you build a vision pipeline.

  • What kind of frontend code can Kimi K2.5 generate?

    Moonshot AI documents interactive user interfaces with dynamic layouts and animations, not only static markup.

  • Is Kimi K2.5 open source?

    Yes. Moonshot AI ships K2.5 as open source in the same lineage as other open-weight Kimi models.

  • When was Kimi K2.5 released on AI Gateway?

    Kimi K2.5 became available through AI Gateway on January 26, 2026. Timing and scope are documented in the K2.5 on AI Gateway changelog post and on https://platform.moonshot.ai/docs/pricing/chat#product-pricing.

  • Should I use K2.5 or K2 Thinking for complex reasoning tasks?

    Use K2 Thinking when you need extended chain-of-thought traces (math proofs, step-by-step algorithm design). K2.5 covers broad tasks including reasoning, but K2 Thinking is the match when explicit deliberation is the main requirement.

  • How do I use Kimi K2.5 with the AI SDK?

    Set the model to moonshotai/kimi-k2.5 in your AI SDK call. No other configuration changes are required.