Omni Video (Beta)
Create Kling O1 videos via the beta CometAPI Omni Video endpoint (POST /kling/v1/videos/omni-video) and track task status with Omni Query.
Choose the input mode
- Text-to-video: send
prompt,mode,aspect_ratio, andduration - Image-referenced generation: add
image_list - Element-guided generation: add
element_list - Video transformation: add
video_listwithrefer_type: base
Duration and aspect ratio
| Setting | Supported values | Default | Boundary behavior |
|---|---|---|---|
duration for text or first-frame video | 5, 10 | 5 | Use one of the two fixed generation lengths. |
duration with reference video input | input video length from 3 to 10 seconds | input length | The generated duration follows the base video and ignores the request value. |
aspect_ratio | 16:9, 9:16, 1:1 | 16:9 | Required when no first-frame image or base video controls the frame. |
mode | std, pro | pro | pro is the default for Omni Video. |
size field. Treat aspect_ratio as a frame-shape request; a 16:9 text-to-video request can render as full-HD 1920x1080.
Task flow
Poll the Omni task
Authorizations
Bearer token authentication. Use your CometAPI key.
Headers
Content type of the request body.
Body
Text prompt describing the desired video. Max 2500 characters. Reference images or elements inline using <<<image_1>>>, <<<element_1>>>, <<<video_1>>> syntax.
Model to use for Omni Video. Use kling-video-o1 for this endpoint.
Reference images for the generation. Each entry can optionally include a type field set to first_frame or end_frame to pin the image as the video's start or end frame. Accepts URLs or Base64 strings. Formats: JPG, JPEG, PNG. Max 10 MB per image, minimum 300 px per side, aspect ratio 1:2.5-2.5:1. When used with reference videos, image + element count must not exceed 4; without reference videos, the limit is 7.
Pre-registered element references by ID. Combined with images, the total count must not exceed 4 (with reference video) or 7 (without).
Generation mode. std (standard, faster and more cost-effective) or pro (professional, higher quality output). The default is pro.
Aspect ratio request for beta Kling Omni generation. The endpoint does not expose an exact size field; a 16:9 text-to-video request can render as 1920x1080.
Video length in seconds. Values: 3-10. Text-to-video and first-frame-to-video only support 5 and 10. When editing an existing video, duration matches the input and this parameter is ignored.
Reference video for style transfer or the base video to edit. Use refer_type to distinguish: feature for style reference, base for the video to modify. Set keep_original_sound to yes or no to control audio retention. Only MP4/MOV, 3-10 seconds, 720-2160 px, 24-60 fps, max 200 MB. Only one video is accepted.
Webhook URL for task status notifications. The server sends a callback when the task status changes.
Optional user-defined task ID for your own tracking. Does not replace the system-generated task ID. Must be unique per account.
Response
Success
The response is of type object.