⌘K

Spark Avatar Picture Omni V2

spark_avatar_picture_omni_v2

Spark Avatar Picture Omni V2 digital human avatar generation model. Generates digital human animation videos from images and audio through audio-driven lip-sync technology.

API Tips

Image and Audio Requirements

Image formats: JPG (JPEG), PNG, JFIF and other common formats (JPG format recommended)

Image requirements: Less than 5 MB, less than 4096×4096

Input image: Input image should contain a single person with a large face proportion and frontal view for best results. Other types of images may produce poor results

Audio duration: Recommended less than 15 seconds to ensure generation quality. Audio that is too long may cause quality degradation


Authentication

authorization string required

All APIs require authentication via Bearer Token.

Get API Key:

Visit API Key Management Page to get your API Key.

Usage:

Add to request header:

Authorization: Bearer YOUR_API_KEY

Parameters

model string required

Model ID to use for the request

Value: spark_avatar_picture_omni_v2


image_url string required

Input image URL


audio_url string required

Audio URL for digital human animation


Polling

Since video generation takes time, you need to poll the task status after creation

The initial response returns the task ID and initial status. The actual generation results must be obtained through polling the task status endpoint

Response Format

error object

Error information. Only present when status is failed

code string

Error code

error_message string

Detailed error message


output array

Generation results. Only present when status is completed

content array

List of generated content

type string

Resource type

url string

Content URL

size integer

File size

jobId string

Remote job ID


usage object

Usage statistics. Only present when status is completed

cost string

Total cost in USD

discount number

Discount amount

input_tokens integer

Number of input tokens (fixed at 0)

output_tokens integer

Number of output tokens (fixed at 0)

total_tokens integer

Total number of tokens (actual comfyui_cost)

generated_videos integer

Number of generated videos

generated_images integer

Number of generated images


metadata object

Metadata information


Error Codes

Error CodeDescription
003016001Image URL missing
003016002Audio URL missing
003016095Internal generation error
003016096Result parsing error
003016097HTTP error response
003016098Status check error
003016099Task creation error