⌘K

Wan 2.6 - Image to Video

wan26_i2v

Alibaba Wanx 26 Image to Video converts static images into dynamic video content with advanced AI capabilities.

Authentication

authorization string required

All APIs require authentication via Bearer Token.

Get API Key:

Visit API Key Management Page to get your API Key

Usage:

Add to request header:

Authorization: Bearer YOUR_API_KEY

Parameters

model string required

Model ID to use for the request

Value: wan26_i2v


input object required

Input parameters for the generation request

prompt string

Text prompt used to describe the expected elements and visual characteristics in the generated video. Supports Chinese and English. Each Chinese character/letter counts as one character, and excess content will be automatically truncated.

Maximum length: 1500 characters for wan2.6-i2v

Note: When using video effects parameters (i.e., template is not empty), the prompt parameter is invalid and does not need to be filled.

Example: A kitten running on the grass

img_url string required

URL or Base64 encoded data of the first frame image.

Image Limitations:
Supported formats: JPEG, JPG, PNG (no transparency), BMP, WEBP
Image resolution: Width and height range [360, 2000] pixels
File size: Maximum 10MB

Input Methods:
Method 1: Publicly accessible URL
Supports HTTP or HTTPS protocol
Example: https://cdn.translate.alibaba.com/r/wanx-demo-1.png

Method 2: Base64 encoded string
Format: data:{MIME_type};base64,{base64_data}
Example: data:image/png;base64,GDU7MtCZzEbTbmRZ...

audio_url string

URL of the audio file. The model will use this audio to generate the video.

Supports HTTP or HTTPS protocol. Local files can obtain temporary URLs by uploading files.

Audio Limitations:
Supported formats: wav, mp3
Duration: 3 - 30 seconds
File size: Maximum 15MB

Handling Excess:
If audio length exceeds the duration value (5 or 10 seconds), the first 5 or 10 seconds are automatically extracted, and the rest is discarded.
If audio length is less than video duration, the portion beyond audio length will be silent. For example, if audio is 3 seconds and video duration is 5 seconds, the output video has sound for the first 3 seconds and is silent for the last 2 seconds.

Example: https://help-static-aliyun-doc.aliyuncs.com/file-manage-files/zh-CN/20250925/ozwpvi/rap.mp3


parameters object

Generation parameters

resolution string

Important: The resolution parameter directly affects billing costs. For the same model: 1080P > 720P > 480P. Please confirm the model pricing before calling.

Specifies the video resolution level to adjust video clarity (total pixels). The model automatically scales to similar total pixels based on the selected resolution level. The video aspect ratio will try to match the aspect ratio of the input image img_url.

Options: 720P, 1080P

Default: 1080P

duration integer

Important: The duration parameter directly affects billing costs. Billing is per second - longer duration means higher cost. Please confirm the model pricing before calling.

Duration of the generated video in seconds (integers only).

Options: 5, 10, 15

Range: 2 - 15

Default: 5

prompt_extend boolean

Whether to enable intelligent prompt rewriting.

When enabled, uses a large model to intelligently rewrite the input prompt. This significantly improves generation results for shorter prompts but adds processing time.

Options: true, false

Default: true

watermark boolean

Whether to add a watermark identifier. The watermark is located in the lower right corner of the video with fixed text “AI Generated”.

Options: true, false

Default: false

audio boolean

Whether to generate video with audio.

Parameter priority: audio > audio_url. When audio=false, even if audio_url is provided, the output will still be a silent video, and billing will be calculated as silent video.

Options: true, false

Default: true

shot_type string

Specifies the shot type of the generated video, i.e., whether the video consists of one continuous shot or multiple switching shots.

Effective condition: Only takes effect when prompt_extend: true.

Parameter priority: shot_type > prompt. For example, if shot_type is set to single, even if the prompt contains “generate multi-shot video”, the model will still output a single-shot video.

Note: When strict control over video narrative structure is needed (e.g., single shot for product demonstrations, multi-shot for short stories), this parameter can be specified.

Options: single, multi

Default: single

negative_prompt string

Negative prompt describing content you don’t want to appear in the video

seed integer

Random seed. If not specified, the system automatically generates a random seed. To improve reproducibility of generation results, it is recommended to fix the seed value.

Note: Due to the probabilistic nature of model generation, even with the same seed, it cannot guarantee that generation results will be completely consistent every time.

Example: 12345

Range: 0 - 2147483647


Polling

Since video generation takes time, you need to poll the task status after creation.

The initial response returns the task ID and initial status. The actual generation results must be obtained through polling the task status endpoint.


Response Format

error object

Error information. Only present when status is failed.

code string

Error code

message string

Detailed error message


output array

Generation results. Only present when status is completed.

status string

Task status

Options: in_queue, processing, done, failed

content array

List of generated video content

type string

Resource type

Value: video

url string

Processed video URL (CDN address)

jobId string

Remote task ID


usage object

Usage statistics. Only present when status is completed.

cost string

Total cost in USD

discount number

Discount amount


metadata object

Metadata information.


Error Codes

Error CodeDescription
001027095Internal generation error
001027096Result parsing error
001027097HTTP error response
001027098Status check error
001027099Task creation error