How to Generate AI Videos with Wan 2.1

  • Home
  • / How to Generate AI Videos with Wan 2.1

image

07 Mar 2025

09

1203

Wan 2.1 is a powerful text-to-video (T2V) and image-to-video (I2V) AI model developed by Alibaba Cloud. It supports generating high-quality videos from text prompts or images, with features like complex motion generation, physical law simulation, and multilingual text rendering. Below is a guide to crafting effective prompts and examples for Wan 2.1.


1. Prompt Guidelines

1.1 Key Principles

  • Be Specific: Clearly describe the scene, characters, actions, and environment. The more detailed the prompt, the better the output.
  • Use Descriptive Language: Include adjectives, adverbs, and sensory details to enhance the visual and emotional impact.
  • Incorporate Motion: Describe dynamic actions (e.g., "a bird flying gracefully") to leverage Wan 2.1's strength in motion generation.
  • Leverage Physical Laws: Mention realistic physical interactions (e.g., "water splashing") for more authentic results.
  • Specify Style and Mood: Indicate the desired artistic style (e.g., "cinematic," "cartoonish") or mood (e.g., "serene," "intense").

1.2 Advanced Techniques

  • Prompt Enhancement: Use tools like Dashscope API to expand and refine your prompts for better results.
  • Negative Prompts: Specify what you don’t want in the video (e.g., "no blurry backgrounds") to guide the model.
  • Multilingual Support: Wan 2.1 supports both English and Chinese prompts, making it versatile for global users.

2. Prompt Examples

2.1 Text-to-Video (T2V) Examples

  1. Dynamic Action Scene
    "Two anthropomorphic cats in comfy boxing gear and bright gloves fight intensely on a spotlighted stage, with dramatic camera angles and slow-motion effects."

  2. Nature and Wildlife
    "A serene forest at sunrise, with sunlight filtering through the trees, a deer drinking from a crystal-clear stream, and birds chirping in the background."

  3. Urban Life
    "A bustling city street at night, with neon lights reflecting on wet pavement, people walking briskly, and a food vendor serving steaming hot noodles."

  4. Fantasy and Creativity
    "A magical castle floating in the clouds, with dragons flying around, glowing orbs of light, and a rainbow arching across the sky."

  5. Sports and Adventure
    "A professional snowboarder performing a backflip off a snowy cliff, with snow spraying in slow motion and a breathtaking mountain backdrop."

2.2 Image-to-Video (I2V) Examples

  1. Historical Scene
    "A medieval knight standing on a hill, overlooking a battlefield with armies clashing below, and a storm brewing in the distance."

  2. Sci-Fi Setting
    "A futuristic city with towering skyscrapers, flying cars zooming through the air, and holographic advertisements lighting up the night."

  3. Artistic Rendering
    "A watercolor painting of a tranquil countryside, with rolling hills, a small cottage, and a winding river under a pastel-colored sky."

  4. Product Showcase
    "A sleek smartphone rotating in mid-air, showcasing its design and features, with glowing particles swirling around it."

  5. Cultural Theme
    "A traditional Chinese festival with lanterns lighting up the night, people dancing in colorful costumes, and fireworks exploding in the sky."


By following these guidelines and examples, you can unlock the full potential of Tongyi Wanxiang Wan 2.1 for your video generation projects. For more details, refer to the official documentation and community resources.

Related Articles

image
18 Apr 2025

Introducing Wan 2.1 FLF2V - First-Last-Frame Video Generation Model

A deep dive into Wan 2.1 FLF2V, an innovative video generation model that creates seamless transitions between start and end frames
image
11 Apr 2025

40 Transformative Video Effects available

40 Transformative Video Effects available
image
07 Apr 2025

Creating Kissing Videos Using an AI Image-to-Video Tool

Creating Kissing Videos Using an AI Image-to-Video Tool