WAN 2.1 I2V Plus

Video

Description

Qwen WAN2.1-i2v-plus is an advanced AI video generation model by Alibaba that revolutionizes multimedia content creation. Based on Diffusion Transformer and innovative Wan-VAE architecture, the model supports various tasks including Image-to-Video generation.

Key Features:

Transforms static images into fluid video sequences, preserving original aspect ratio and visual elements. Accepts source images and text prompts to guide video generation.
High Performance: Outperforms existing alternatives, scoring 84.7%+ in authoritative benchmarks, especially adept at handling complex dynamic scenes and multi-object interactions.
Unique ability to generate text in both Chinese and English directly within videos.
Accessibility: Lightweight 1.3B model runs on consumer GPUs, requiring only 8.19 GB of RAM to create a 5-second 480P video.
Powerful Video VAE capable of encoding and decoding videos up to 1080P of any length while preserving temporal information.

Pricing

Pricing depends on the model type. For text models, prices are shown per 1 million tokens, with example request estimates below.

Video price

$2.00

per 5 seconds

Request price

800 sparks

per 5 seconds

Video cost examples

5-second video

Estimated cost of a short video.

≈ $2.00

20-second video

Estimated cost of a video around 20 seconds long.

≈ $8.00

Actual cost may vary depending on prompt length, output length, generation settings, and selected model.