Moonshot AI Unveils Kimi K2, an “Open Agentic Intelligence” Model Trained on 15.5 Trillion Tokens

Moonshot AI, a Chinese AI startup founded in 2023, has launched Kimi K2, a groundbreaking open-source language model designed to rival leading proprietary models like OpenAI’s GPT-4.1 and Anthropic’s Claude 4 Sonnet. With a massive one trillion total parameters and 32 billion activated parameters, Kimi K2 leverages a Mixture-of-Experts (MoE) architecture to deliver exceptional performance in coding, reasoning, and agentic tasks, setting a new benchmark for open-source AI.

Kimi K2, released on July 11, 2025, comes in two variants: Kimi-K2-Base, a foundational model for researchers and developers seeking customization, and Kimi-K2-Instruct, a post-trained version optimized for general-purpose chat and agentic workflows. Unlike traditional chatbots, Kimi K2 is engineered for “agentic intelligence,” enabling it to autonomously execute complex tasks, use tools, and orchestrate workflows with minimal human oversight. The model was pre-trained on 15.5 trillion tokens using the MuonClip optimizer, a custom solution that ensured zero training instability, a significant achievement for a model of this scale.

The model excels across multiple benchmarks, particularly in coding and reasoning. On SWE-bench Verified, a software engineering benchmark testing real-world code error patching, Kimi K2 scores 65.8% in agent mode, surpassing GPT-4.1’s 54.6% and trailing just behind Claude 4 Sonnet. It achieves 53.7% on LiveCodeBench, outperforming DeepSeek-V3 (46.9%) and GPT-4.1 (44.7%), and scores 97.4% on MATH-500, beating GPT-4.1’s 92.4%. Additionally, Kimi K2 supports a 128,000-token context window, enabling it to handle lengthy documents and complex multi-step tasks effectively.

Kimi K2’s agentic capabilities stem from its training on millions of synthetic tool-use scenarios and reinforcement learning, allowing it to perform tasks like analyzing salary data, generating interactive webpages, and executing shell commands. The model’s API, available at platform.moonshot.ai, is compatible with OpenAI and Anthropic standards, priced at $0.15 per million input tokens for cache hits and $2.50 per million output tokens—significantly lower than competitors. Developers can also access the model’s weights on Hugging Face for local deployment, though it requires substantial hardware, such as multiple GPUs or a strong cluster.

Moonshot AI’s decision to open-source Kimi K2, licensed under a Modified MIT License, has sparked enthusiasm in the AI community. Posts on X highlight its competitive edge, with some users noting it outperforms DeepSeek-V3 and rivals top proprietary models. However, running the full model locally remains impractical for most due to its size, positioning it as a “deploy your own” solution for enterprises and researchers.

Kimi K2’s release marks a pivotal moment in open-source AI, offering unmatched accessibility and performance. Moonshot AI invites developers to explore its capabilities via the API or Hugging Face, with comprehensive documentation available to support integration and deployment