Avatar

curl --request POST \ --url https://api.cometapi.com/kling/v1/videos/avatar/image2video \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data ' { "image": "example" } '

Task flow

Create the avatar task

Submit the image and one audio source, then save the returned task id.

Poll the task

Continue with Individual Queries until the task reaches a terminal state.

Store the finished result

Copy the final asset into your own storage if you need retention beyond the provider delivery URL.

Authorizations

Authorization

string

header

required

Bearer token authentication. Use your CometAPI key.

Headers

Content-Type

string

Optional content type header.

Body

application/json

Option 1
Option 2

image

string

default:example

required

Avatar reference image. Accepts an image URL or raw Base64 string (no data: prefix). Supported formats: JPG, JPEG, PNG. Max file size 10 MB. Minimum dimension 300 px on each side; aspect ratio between 1:2.5 and 2.5:1.

audio_id

string

required

Audio ID returned by the Kling TTS API. Only audio clips between 2 and 60 seconds generated within the last 30 days are accepted. Mutually exclusive with sound_file — exactly one must be provided.

prompt

string

required

Text prompt to guide avatar actions, emotions, and camera movements. Max 2500 characters. Required — the API rejects requests without this field.

sound_file

string

Audio file as a URL or Base64 string. Accepted formats: MP3, WAV, M4A, AAC. Max 5 MB, duration 2–60 seconds. Mutually exclusive with audio_id — exactly one must be provided.

mode

string

Generation mode. std (standard, faster and more cost-effective) or pro (professional, higher quality output).

callback_url

string

Webhook URL for task status notifications. The server sends a callback when the task status changes.

external_task_id

string

Optional user-defined task ID for your own tracking. Does not replace the system-generated task ID. Must be unique per account.

Response

200 - application/json

Task accepted.

code

integer

required

message

string

required

data

object

required

Show child attributes

Text Models

Image Models

Video Models

Audio Models

Before you call it

Audio source rules

Task flow

Authorizations

Headers

Body

Response

Text Models

Image Models

Video Models

Audio Models

Documentation Index

​Before you call it

​Audio source rules

​Task flow

Authorizations

Headers

Body

Response

Before you call it

Audio source rules

Task flow