Create a Kling image-to-video task
Convert a single image into a Kling video via CometAPI POST /kling/v1/videos/image2video, with async task creation and progress tracking by task ID.
Before you call it
- Provide one public image URL or one base64 image string
- Use an image that meets Kling pixel requirements; tiny thumbnails are rejected by the generation task
- Start with the default
kling-v2-6example, then choose anothermodel_namefrom the OpenAPI enum when you need a different model track - Keep the first request simple: one input image, one prompt, no tail frame, no motion masks
- Use
dynamic_masksas an array of mask-and-trajectory objects when you need controlled local motion - Use
sound: offfor a deterministic no-audio first request on model tracks that support generated sound
Model naming
Use ordinary Kling video model IDs on this endpoint. Keep Omni model IDs for Omni Video.Task flow
Poll the task
Authorizations
Bearer token authentication. Use your CometAPI key.
Body
Source image URL or base64 image string. Use an image that meets Kling pixel requirements; very small thumbnails are rejected. For base64 input, send the encoded image string as the field value.
Text prompt describing the desired motion. Maximum 500 characters.
Elements to exclude from the video. Maximum 200 characters.
Webhook URL to receive task status updates when the task completes.
Generation mode. std for standard (faster), pro for professional (higher quality).
std, pro Model ID for this image-to-video request. Use kling-v3 for new requests. Use Omni model IDs only with the Omni Video endpoint.
kling-v1, kling-v1-5, kling-v1-6, kling-v2-master, kling-v2-1, kling-v2-1-master, kling-v2-5-turbo, kling-v2-6, kling-v3 Tail-frame reference image as a Base64 string or public URL. Same format requirements as image. Controls the last frame of the generated video.
Prompt adherence strength. Higher values follow the prompt more closely. Range: 0–1.
Output video length in seconds. Use 5 or 10; omit to use 5.
Static brush mask image as a Base64 string or public URL. White areas are frozen in place during video generation. Must match the aspect ratio and resolution of the input image.
Optional motion masks. Each entry contains a mask image and an ordered trajectory list.
Custom task id for your own tracking. Does not replace the system-generated task id but can be used to query tasks. Must be unique per user.
Optional generated-audio switch for models that support video sound. Use on or off, or omit the field for the model default.
on, off