User Guide: How to Generate AI Videos with Wan 2.1

  • Home
  • / User Guide: How to Generate AI Videos with Wan 2.1

image

07 Mar 2025

09

1203

Wan 2.1 is a powerful text-to-video (T2V) and image-to-video (I2V) AI model developed by Alibaba Cloud. It supports generating high-quality videos from text prompts or images, with features like complex motion generation, physical law simulation, and multilingual text rendering. Below is a guide to crafting effective prompts and examples for Wan 2.1.


1. Prompt Guidelines

1.1 Key Principles

  • Be Specific: Clearly describe the scene, characters, actions, and environment. The more detailed the prompt, the better the output.
  • Use Descriptive Language: Include adjectives, adverbs, and sensory details to enhance the visual and emotional impact.
  • Incorporate Motion: Describe dynamic actions (e.g., "a bird flying gracefully") to leverage Wan 2.1's strength in motion generation.
  • Leverage Physical Laws: Mention realistic physical interactions (e.g., "water splashing") for more authentic results.
  • Specify Style and Mood: Indicate the desired artistic style (e.g., "cinematic," "cartoonish") or mood (e.g., "serene," "intense").

1.2 Advanced Techniques

  • Prompt Enhancement: Use tools like Dashscope API to expand and refine your prompts for better results.
  • Negative Prompts: Specify what you don’t want in the video (e.g., "no blurry backgrounds") to guide the model.
  • Multilingual Support: Wan 2.1 supports both English and Chinese prompts, making it versatile for global users.

2. Prompt Examples

2.1 Text-to-Video (T2V) Examples

  1. Dynamic Action Scene
    "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage, with dramatic camera angles and slow-motion effects."

  2. Nature and Wildlife
    "A serene forest at sunrise, with sunlight filtering through the trees, a deer drinking from a crystal-clear stream, and birds chirping in the background."

  3. Urban Life
    "A bustling city street at night, with neon lights reflecting on wet pavement, people walking briskly, and a food vendor serving steaming hot noodles."

  4. Fantasy and Creativity
    "A magical castle floating in the clouds, with dragons flying around, glowing orbs of light, and a rainbow arching across the sky."

  5. Sports and Adventure
    "A professional snowboarder performing a backflip off a snowy cliff, with snow spraying in slow motion and a breathtaking mountain backdrop."

2.2 Image-to-Video (I2V) Examples

  1. Historical Scene
    "A medieval knight standing on a hill, overlooking a battlefield with armies clashing below, and a storm brewing in the distance."

  2. Sci-Fi Setting
    "A futuristic city with towering skyscrapers, flying cars zooming through the air, and holographic advertisements lighting up the night."

  3. Artistic Rendering
    "A watercolor painting of a tranquil countryside, with rolling hills, a small cottage, and a winding river under a pastel-colored sky."

  4. Product Showcase
    "A sleek smartphone rotating in mid-air, showcasing its design and features, with glowing particles swirling around it."

  5. Cultural Theme
    "A traditional Chinese festival with lanterns lighting up the night, people dancing in colorful costumes, and fireworks exploding in the sky."


By following these guidelines and examples, you can unlock the full potential of Tongyi Wanxiang Wan 2.1 for your video generation projects. For more details, refer to the official documentation and community resources.

Related Articles

image
07 Mar 2025

Introduction to Wan 2.1 Models

A comprehensive overview of Wan 2.1 video foundation models

image
07 Mar 2025

User Guide: How to Generate AI Videos with Wan 2.1

A comprehensive guide on how to generate AI videos with Wan 2.1

image
07 Mar 2025

Wan 2.1 vs Sora: A Comprehensive Comparison

An in-depth analysis of two leading video generation models