MoonshotAI's Kimi k1.5: A Breakthrough in RL and LLMs
2025-01-21
MoonshotAI has unveiled Kimi k1.5, a new multi-modal large language model trained with reinforcement learning, achieving state-of-the-art results across various benchmarks. Key to Kimi k1.5's success is its 128k context window and improved policy optimization, enabling strong reasoning capabilities without complex techniques like Monte Carlo tree search. It outperforms GPT-4o and Claude Sonnet 3.5 on tests like AIME, MATH-500, and Codeforces, also showing significant improvements in short-context reasoning. Kimi k1.5 will soon be available at https://kimi.ai.
AI