Tencent's HunyuanWorld-Voyager: World-Consistent 3D Video Generation from a Single Image

2025-09-03
Tencent's HunyuanWorld-Voyager: World-Consistent 3D Video Generation from a Single Image

Tencent's AI team introduces HunyuanWorld-Voyager, a novel video diffusion framework generating world-consistent 3D point cloud sequences from a single image with user-defined camera paths. Voyager produces 3D-consistent scene videos for exploring virtual worlds along custom trajectories, also generating aligned depth and RGB video for efficient 3D reconstruction. Trained on over 100,000 video clips combining real-world and Unreal Engine synthetic data, Voyager achieves state-of-the-art results on the WorldScore benchmark. Code and pre-trained models are publicly available.