MoonshotAI: Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in Kimi K2, it activates 32 billion parameters per forward pass and supports 256k-token context windows. The model is optimized for persistent step-by-step thought, dynamic tool invocation, and complex reasoning workflows that span hundreds of turns. It interleaves step-by-step reasoning with tool use, enabling autonomous research, coding, and writing that can persist for hundreds of sequential actions without drift. It sets new open-source benchmarks on HLE, BrowseComp, SWE-Multilingual, and LiveCodeBench, while maintaining stable multi-agent behavior through 200–300 tool calls. Built on a large-scale MoE architecture with MuonClip optimization, it combines strong reasoning depth with high inference efficiency for demanding agentic and analytical tasks. With a 262K token context window and 4K token max output, it's priced at $0.40/1.75 per 1M tokens (input/output) and offers functions, code, and streaming capabilities.
✅ Best For
🚀 Capabilities
❌ Limitations
Specifications
| Provider | moonshotai |
| Context Window | 262,144 tokens |
| Max Output | 4,096 tokens |
| Minimum Plan | Premium |
Pricing
| Input Price | $0.4000 / 1M tokens |
| Output Price | $1.7500 / 1M tokens |
💡 With PRO subscription, cost is reduced by 20%