API Doc-CometAPI
HomeDashBoardModel_Price
HomeDashBoardModel_Price
Discord_Support
  1. OpenAI Compatiable Endpoint
  • GET START
    • Model New Release Announcement
    • Platform notification
    • Help Center
    • Quick Start
    • About Pricing
    • About Grouping
    • Interface Stability
    • Privacy policy
    • Terms of service
    • Error code description
    • Code example
    • Must see for use
    • Common Misconceptions
    • Confusion about use
    • Best Practices
      • CometAPI Account Balance Query API Usage Instructions
      • Retry Logic Documentation for CometAPI and OpenAI Official API
      • Midjourney Best Practices
      • Runway Best Practices
  • OpenAI Compatiable Endpoint
    • gpt-4o-image generates image
      POST
    • Chat
      POST
    • Recognizing Images
      POST
    • Models
      GET
    • Embeddings
      POST
    • Images
      POST
    • Realtime
      POST
    • Image Editing (gpt-image-1)
      POST
  • Audio
    • Create speech
    • Create transcription
    • Create translation
  • Anthropic Compatiable Endpoint
    • Anthropic Claude
  • Music Generation Endpoint
    • Suno
      • Setting suno Version
      • Generate lyrics
      • Generate music clip
      • Upload clip
      • Submit concatenation
      • Single task query
      • Batch query tasks
    • Udio(Temporarily unavailable)
      • Generate music
      • Task query
  • Image Generation Endpoint
    • Midjourney(images)
      • Quick Tutorial - Complete Process in One Go
      • Task Fetching API
        • List by Condition
        • Fetch Single Task (most recommended)
      • Imagine
      • Action (UPSCALE; VARIATION; REROLL; ZOOM, etc.)
      • Blend (image -> image)
      • Describe (image -> text)
      • Modal (Area Redesign & Zoom)
    • Ideogram(images)
      • Official documentation (updated in real time)
      • Generate 3.0 (text to image)
      • Remix 3.0 (hybrid image)
      • Reframe 3.0(Reconstruction)
      • Replace Background 3.0(Background replacement)
      • Edit 3.0(Editing images)
      • ideogram Text Raw Image
      • ideogram Hybrid image
      • ideogram enlargement HD
      • ideogram describes the image
      • ideogram Edit image((legacy))
    • Flux(images)
      • Generate image (replicate format)
      • flux fine-tune images(Temporarily unavailable)
      • flux generate image(Temporarily unavailable)
      • flux query
    • Replicate(image)
      • replicate Generate
      • replicate query
    • Recraft(images)
      • Appendix
      • Recraft Generate Image
      • Recraft Vectorize Image
      • Recraft Remove Background
      • Recraft Clarity Upscale
      • Recraft Create style
      • Recraft Generative Upscale
  • Video Generation Ednpoint
    • runway(video)
      • official format
        • runway images raw video
        • runway to get task details
      • Reverse Format
        • generate(text)
        • generate(Reference images)
        • Video to Video Style Redraw
        • Act-one Expression Migration
        • feed-get task
    • kling (video)
      • callback_url
      • Generating images
      • Text Generation Video
      • Image Generation Video
      • Video Extension
      • virtual try-on
      • lip sync
      • effects
      • Individual queries (videos)
    • MiniMax Conch(video)
      • MiniMax Conch Official Documentation
      • MiniMax Conch Generation
      • MiniMax Conch Query
      • MiniMax Conch Download
    • luma (video)
      • Official api interface format
        • luma generate
        • luma search
    • PIKA(video)
      • pika feed
      • PIKA Reference Video Generation
      • PIKA Reference Image Generation
      • PIKA reference text generation
    • sora
      • Reverse Format
        • Create Video
        • Query Video Task
        • Create Video
  • Software Integration Guide
    • cometapi Site API Call Testing
    • OpenManus
    • Chatbox
    • CherryStudio
    • Cursor
    • ChatHub
    • COZE
    • FastGPT
    • cline
    • dify
    • gptme
    • Immersive Translation
    • Lobe-Chat
    • Zotero
    • LangChain
    • AnythingLLM
    • Eudic Translation
    • OpenAI Translator
    • ChatAll Translation
    • Pot Translation
    • GPT Academic Optimization (gpt_academic)
    • NEXT CHAT (ChatGPT Next Web)
    • Obsidian's Text Generator Plugin
    • Open WebUI
    • avante.nvim
    • librechat
    • Lazy Customer Service
    • utools-ChatGPT Friend
    • IntelliJ Translation Plugin
    • n8n
  1. OpenAI Compatiable Endpoint

Image Editing (gpt-image-1)

Developing
POST
https://api.comet.com/v1/images/edits
Maintainer:Not configured
official docs:

Image Editing Request Parameters#

ParameterTypeRequiredDescription
imagefile or file array✅The image(s) to edit. Must be a supported image file or array.
• gpt-image-1: PNG, WEBP, or JPG, each <25MB.
• dall-e-3: 1 square PNG <4MB.
promptstring✅A text description of the desired image(s).
• Max length: 1000 chars for dall-e-3, 32000 chars for gpt-image-1.
maskfileOptional PNG mask image. Transparent areas (alpha = 0) indicate regions to edit.
Must match size of image and be <4MB.
Applies to first image.
modelstringModel to use: "dall-e-3" or "gpt-image-1".
ninteger or nullNumber of images to generate.
Must be between 1 and 10.
Default: 1.
qualitystring or nullFor gpt-image-1 only.
Options: "high", "medium", "low".
Default: "auto".
response_formatstring or nullResponse format.
Options: "url" or "b64_json".
Only dall-e-3 supports "url" (valid for 60 minutes).
sizestring or nullImage size:
• gpt-image-1: "1024x1024", "1536x1024", "1024x1536", "auto" (default)
• dall-e-3: "256x256", "512x512", "1024x1024"

Request

Header Params
Authorization
string 
optional
Example:
Bearer {{api-key}}
Body Params multipart/form-data
image
file 
required

The image(s) to edit. Must be a supported image file or array.

  • gpt-image-1: PNG, WEBP, or JPG, each <25MB.
  • dall-e-3: 1 square PNG <4MB.
Example:
MQ==/test.png
prompt
string 
required
A text description of the desired image(s).
Max length: 1000 chars for dall-e-3, 32000 chars for gpt-image-1.
Example:
Put on glasses
model
string 
optional
Model to use: "dall-e-3" or "gpt-image-1".
Example:
gpt-image-1
mask
file 
optional

Optional PNG mask image. Transparent areas (alpha = 0) indicate regions to edit.
Must match size of image and be <4MB. Applies to first image.

n
string 
optional
Number of images to generate. Must be between 1 and 10.
Default:
1
quality
enum<string> 
optional
For gpt-image-1 only. Options: "high", "medium", "low".
Default: "auto".
Allowed values:
highmediumlow
Default:
auto
response_format
enum<string> 
optional
Response format. Options: "url" or "b64_json".
Only "dall-e-3" supports "url" (valid for 60 minutes).
Allowed values:
urlb64_json
size
string 
optional
Image size:
gpt-image-1: "1024x1024", "1536x1024", "1024x1536", "auto" (default)
dall-e-3: "256x256", "512x512", "1024x1024"

Request samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.comet.com/v1/images/edits' \
--header 'Authorization: Bearer {{api-key}}' \
--form 'image=@"MQ==/test.png"' \
--form 'prompt="Put on glasses"' \
--form 'model="gpt-image-1"' \
--form 'mask=@""' \
--form 'n=""' \
--form 'quality=""' \
--form 'response_format=""' \
--form 'size=""'

Responses

🟢200success
application/json
Body
data
array [object {1}] 
required
Contains a list of response data objects.
b64_json
string 
required
Base64 encoded image data
created
integer 
required
The Unix timestamp (in seconds) of when the chat completion was created.
usage
object 
required
Usage statistics for the completion request.
completion_tokens
integer 
required
Number of tokens in the generated completion.
completion_tokens_details
object 
required
Breakdown of tokens used in a completion.
prompt_tokens
integer 
required
Number of tokens in the prompt. It equals prompt_cache_hit_tokens + prompt_cache_miss_tokens.
prompt_tokens_details
object 
required
Breakdown of tokens used in a promt.
total_tokens
integer 
required
Total number of tokens used in the request (prompt + completion).
Example
{
    "created": 1713833628,
    "data": [
        {
            "b64_json": "..."
        }
    ],
    "usage": {
        "total_tokens": 100,
        "input_tokens": 50,
        "output_tokens": 50,
        "input_tokens_details": {
            "text_tokens": 10,
            "image_tokens": 40
        }
    }
}
Previous
Realtime
Next
Create speech
Built with