FramePack: A Revolutionary Next-Frame Prediction Model for AI Video Generation
2025-04-20
FramePack is a groundbreaking next-frame prediction neural network architecture that compresses input contexts to a fixed length, making the generation workload independent of video length. This achieves O(1) computational complexity for streaming, setting a new benchmark in AI video generation. It generates high-quality videos using only 6GB of GPU memory on laptops with RTX 3060. Generation speed reaches 1.5-2.5 seconds per frame on an RTX 4090, but is 4-8 times slower on laptops with 3070ti/3060. Its bi-directional sampling method effectively eliminates the common drifting problem in video generation.