Tencent's Hunyuan-T1: Redefining Reasoning Efficiency with the First Mamba-Powered Ultra-Large Model
Tencent unveiled Hunyuan-T1, the latest addition to its Hunyuan large model series. Built upon TurboS, the world's first ultra-large-scale Hybrid-Transformer-Mamba MoE large model, Hunyuan-T1 boasts significantly enhanced reasoning capabilities and improved alignment with human preferences after extensive post-training. Compared to its preview version, Hunyuan-T1 shows a substantial performance boost, doubling its decoding speed. It achieves comparable or slightly better results than R1 on various public benchmarks, and outperforms R1 in internal human evaluations, particularly in cultural and creative instruction following, text summarization, and agent capabilities. This release marks a significant advancement in leveraging reinforcement learning for post-training optimization of large language models.