DeepCoder-14B: Open-Source Code Reasoning Model Matches OpenAI's o3-mini
2025-04-09

Agentica and Together AI have released DeepCoder-14B-Preview, a code reasoning model fine-tuned via distributed RL from Deepseek-R1-Distilled-Qwen-14B. Achieving an impressive 60.6% Pass@1 accuracy on LiveCodeBench, it rivals OpenAI's o3-mini, using only 14B parameters. The project open-sources its dataset, code, training logs, and system optimizations, showcasing a robust training recipe built on high-quality data and algorithmic improvements to GRPO. This advancement democratizes access to high-performing code-generation models.