/ Introduction Wan 2.1 Models (1.3B and 14B)
By Wan 2.1
07 Mar 2025
09
1203
Wan 2.1 represents a groundbreaking suite of open-source video foundation models that sets new standards in video generation technology. This article explores its key features and capabilities.
There is a new leader in open source video generation! Alibaba's new Wan 2.1 model is now the leading open weights model in the Artificial Analysis Video Arena, surpassing former titleholder Mochi 1
Wan 2.1 is a 14B parameter model (1.3B variant also released) and stands out for its ability to generate realistic looking video with high-fidelity motion.
Key details regarding Wan 2.1:
Wan 2.1 consistently outperforms both existing open-source models and commercial solutions across multiple benchmarks. Its comprehensive evaluation across 14 major dimensions and 26 sub-dimensions demonstrates superior capabilities in motion quality, visual quality, style rendering, and multi-targeting scenarios.
One of the most remarkable aspects of Wan 2.1 is its accessibility. The T2V-1.3B model requires only 8.19 GB VRAM, making it compatible with consumer-grade GPUs. On an RTX 4090, it can generate a 5-second 480P video in approximately 4 minutes without any optimization techniques.
Wan 2.1 excels in multiple tasks including:
A unique feature of Wan 2.1 is its ability to generate both Chinese and English text within videos, making it the first video model with bilingual text generation capabilities.
The Wan-VAE component delivers exceptional efficiency in:
Wan 2.1 represents a significant advancement in video generation technology, offering state-of-the-art performance while maintaining accessibility for consumer-grade hardware. Its comprehensive feature set and multiple model variants make it a versatile solution for various video generation needs.
Explore how our models redefine the boundaries of high-resolution video rendering.
A comprehensive user guide on how to generate AI videos with Wan 2.1, including prompt formula, components, examples, and advanced techniques.
Discover how VACE , the All-in-One Video Creation and Editing framework, redefines AI-powered video generation with its groundbreaking capabilities and the Wan 2.1 model.