How to Generate AI Videos with Wan 2.1

  • Home
  • / How to Generate AI Videos with Wan 2.1

image

07 Mar 2025

5

5765

Wan 2.1 is a powerful text-to-video (T2V) and image-to-video (I2V) AI model developed by Alibaba Cloud. It supports generating high-quality videos from text prompts or images, with features like complex motion generation, physical law simulation, and multilingual text rendering. Below is a guide to crafting effective prompts and examples for Wan 2.1.


1. Prompt Guidelines

1.1 Key Principles

  • Be Specific: Clearly describe the scene, characters, actions, and environment. The more detailed the prompt, the better the output.
  • Use Descriptive Language: Include adjectives, adverbs, and sensory details to enhance the visual and emotional impact.
  • Incorporate Motion: Describe dynamic actions (e.g., "a bird flying gracefully") to leverage Wan 2.1's strength in motion generation.
  • Leverage Physical Laws: Mention realistic physical interactions (e.g., "water splashing") for more authentic results.
  • Specify Style and Mood: Indicate the desired artistic style (e.g., "cinematic," "cartoonish") or mood (e.g., "serene," "intense").

1.2 Advanced Techniques

  • Prompt Enhancement: Use tools like Dashscope API to expand and refine your prompts for better results.
  • Negative Prompts: Specify what you don’t want in the video (e.g., "no blurry backgrounds") to guide the model.
  • Multilingual Support: Wan 2.1 supports both English and Chinese prompts, making it versatile for global users.

2. Prompt Examples

2.1 Text-to-Video (T2V) Examples

  1. Dynamic Action Scene
    "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage, with dramatic camera angles and slow-motion effects."

  2. Nature and Wildlife
    "A serene forest at sunrise, with sunlight filtering through the trees, a deer drinking from a crystal-clear stream, and birds chirping in the background."

  3. Urban Life
    "A bustling city street at night, with neon lights reflecting on wet pavement, people walking briskly, and a food vendor serving steaming hot noodles."

  4. Fantasy and Creativity
    "A magical castle floating in the clouds, with dragons flying around, glowing orbs of light, and a rainbow arching across the sky."

  5. Sports and Adventure
    "A professional snowboarder performing a backflip off a snowy cliff, with snow spraying in slow motion and a breathtaking mountain backdrop."

2.2 Image-to-Video (I2V) Examples

  1. Historical Scene
    "A medieval knight standing on a hill, overlooking a battlefield with armies clashing below, and a storm brewing in the distance."

  2. Sci-Fi Setting
    "A futuristic city with towering skyscrapers, flying cars zooming through the air, and holographic advertisements lighting up the night."

  3. Artistic Rendering
    "A watercolor painting of a tranquil countryside, with rolling hills, a small cottage, and a winding river under a pastel-colored sky."

  4. Product Showcase
    "A sleek smartphone rotating in mid-air, showcasing its design and features, with glowing particles swirling around it."

  5. Cultural Theme
    "A traditional Chinese festival with lanterns lighting up the night, people dancing in colorful costumes, and fireworks exploding in the sky."


By following these guidelines and examples, you can unlock the full potential of Tongyi Wanxiang Wan 2.1 for your video generation projects. For more details, refer to the official documentation and community resources.

Related Articles

Introducing Qwen-Image - Advanced Text Rendering and Image Editing Model image
05 Aug 2025

Introducing Qwen-Image - Advanced Text Rendering and Image Editing Model

A comprehensive overview of Qwen-Image, a 20B MMDiT image foundation model that excels in complex text rendering and precise image editing
Wan 2.2 is Live Today! image
29 Jul 2025

Wan 2.2 is Live Today!

Easy Creation with One Click - AI Videos with Major Upgrade Announcement
Introducing Wan 2.1 FLF2V - First-Last-Frame Video Generation Model image
18 Apr 2025

Introducing Wan 2.1 FLF2V - First-Last-Frame Video Generation Model

A deep dive into Wan 2.1 FLF2V, an innovative video generation model that creates seamless transitions between start and end frames