API Doc-CometAPI
HomeDashBoardModel Marketplace
HomeDashBoardModel Marketplace
Discord_Support
  1. šŸš€ Overview
  • How to Use CometAPI?
  • šŸš€ Overview
    • Quick Start
    • Important Guidelines
    • Release Notes
    • Quickly request CometAPI via ApiDog
    • Models
      GET
  • šŸ’¬ Text Models
    • Chat
    • Responses
    • Anthropic Messages
    • Gemini Generating Content
    • Embeddings
  • šŸ–¼ļø Image Models
    • OpenAI
      • gpt-4o-image generates image
      • Images
      • Image Editing (gpt-image-1)
    • Gemini
      • Guide to calling gemini-3-pro-image (Nano Banana)
      • Gemini generates image
    • Midjourney
      • Midjourney Quick Start: Complete Image Generation Workflow in One Go
      • Task Fetching API
        • List by Condition
        • Fetch Single Task (most recommended)
      • Imagine
      • Submit Video
      • Submit Editor
      • Action (UPSCALE; VARIATION; REROLL; ZOOM, etc.)
      • Blend (image -> image)
      • Describe (image -> text)
      • Modal (Area Redesign & Zoom)
    • Replicate(image)
      • Create Predictions - General
      • replicate query
    • seededit/seedream
      • bytedance-Image Editing (seededit)
      • bytedance-image-generation(seedream)
    • bria
      • Generate Image
      • Generate Vector Graphics - Base (Beta)
      • Image Editing
      • Query Status
    • Flux(images)
      • flux generate image
      • flux query
    • Hunyuan3D
  • šŸ“ŗ Video Models
    • sora-2
      • official
        • Create video
        • Remix video
        • Retrieve video
        • Retrieve video content
      • self-developed
        • Create video
        • Retrieve video
    • veo3
      • veo3-chat format
      • Submit video generation task
      • Query video generation status
    • kling (video)
      • callback_url
      • Multimodal Video Editing
        • Initialize Video for Editing
        • Add Video Selection
        • Delete Video Selection
        • Clear Video Selection
        • Preview Selected Video Area
        • Create Task
      • Text to Video
      • Image to Video
      • Multi-Image To Video
      • Video Extension
      • Avatar
      • Lip-Sync
      • Video Effects
      • Text to Audio
      • Video to Audio
      • TTS
      • Image Generation
      • Multi-Image to Image
      • Image Expansion
      • Image Recognize
      • Virtual Try-On
      • [Counterpart] Creating Tasks
      • lip sync
      • Individual queries
    • runway(video)
      • official format
        • runway images raw video
        • Generate a video from a video
        • Generate an image from text
        • Upscale a video
        • Control a character
        • runway to get task details
      • Reverse Format
        • generate(text)
        • generate(Reference images)
        • Video to Video Style Redraw
        • Act-one Expression Migration
        • feed-get task
    • bytedance
      • bytedance-video
      • bytedance-video get
    • MiniMax Conch(video)
      • MiniMax Conch Official Documentation
      • MiniMax Conch Generation
      • MiniMax Conch Query
      • MiniMax Conch Download
  • šŸŽµ Music Models
    • Suno
      • Setting suno Version
      • Suno API Scenario Application Guide
      • Generate lyrics
      • Generate music clip
      • Upload clip
      • Submit concatenation
      • Full Track Audio Separation
      • Single Track Audio Separation
      • Create New Persona
      • add style tags
      • Single task query
      • Generate mp4 mv video
      • Timing: lyrics, audio timeline
      • Get wav format file
      • Get midi
      • Batch query tasks
  • šŸ”Š Audio Models
    • Realtime
    • Create speech
    • Create transcription
    • Create translation
  • 🧩 Integration Guides
    • LiteLLM
    • Dify
    • Make
    • n8n
    • Lobe-Chat
    • COZE
    • Zapier
    • Activepieces
    • LlamaIndex
    • Continue
    • FlowiseAI
    • Chatbox
    • CherryStudio
    • AnythingLLM
    • LangChain
    • BuildShip
    • gptme
    • Immersive Translation
    • Cline
    • Eudic Translation
    • ChatHub
    • OpenAI Translator
    • ChatAll Translation
    • Pot Translation
    • Zotero
    • NEXT CHAT (ChatGPT Next Web)
    • Obsidian's Text Generator Plugin
    • librechat
    • utools-ChatGPT Friend
    • avante.nvim
    • Open WebUI
    • GPT Academic Optimization (gpt_academic)
    • OpenManus
    • IntelliJ Translation Plugin
    • FastGPT
    • n8n Local Deployment
  • āš ļø Errors
    • Error Codes & Handling
  • šŸ“ Code Examples
    • Text-to-Image Generation
    • Image-to-image generation URL upload
    • Regular Post Text Conversation
    • OpenAI Official Library Usage Example
    • Streamed Output
    • Json Fixed Format Output Code Display
    • Embedding code example
    • o1-preview Model Code Example
    • LangChain Usage Example (Successful Test Date: 2024-11-25)
    • Openai dall-e-3 & flux series drawing model
    • gpt, claude, gemini multimodal network image parsing example
    • Multimodal PDF File Parsing Examples for GPT, Claude, and Gemini
    • Code example
  • šŸ„šŸ¼ā€ā™€ļø Best Practices
    • Midjourney Best Practices
    • Retry Logic Documentation for CometAPI and OpenAI Official API
    • Runway Best Practices
    • Claude Code Installation and Usage Guide
    • Gemini CLI Installation and Usage Guide
    • Codex Usage Guide
    • CometAPI Account Balance Query API Usage Instructions
  • šŸ’³ Pricing & Billing
    • About Pricing
  • šŸ¤šŸ¼ Support
    • Help Center
    • Interface Stability
    • Privacy policy
    • Terms of service
    • Common Misconceptions
    • Confusion about use
  1. šŸš€ Overview

Release Notes

🌟 2025-11-27#

🚨 [URGENT] Announcement: Deprecation and Upgrade of Claude 3 Series & Gemini 2.5 Preview Models#

According to the latest official notifications from Anthropic and Google, our platform will officially deprecate the legacy Claude 3 Series and Gemini 2.5 Preview Series Seriesmodels on December 1st at 00:00. To avoid API call failures, please ensure you switch to the following Model IDs before the deadline:

1. Claude Series (Upgrade to 4.5)#

VersionPlease replace with new Model ID
Intelligent (Sonnet)claude-sonnet-4-5-20250929
Most Powerful (Opus)claude-opus-4-5-20251101
Fastest (Haiku)claude-haiku-4-5-20251001

2. Gemini Series (Upgrade to 2.5 Stable / 3.0 Preview)#

VersionPlease replace with new Model ID
Standard (Flash)gemini-2.5-flash
Image Enhancedgemini-2.5-flash-image or gemini-3-pro-image-preview
Professional (Pro)gemini-2.5-pro or gemini-3-pro-preview
āš ļø Note: The old models will cease to function immediately after December 1st. Please migrate as soon as possible to ensure business continuity.
img_v3_02se_e9c309d6-be04-4c77-800e-31437957cdbg.png
img_v3_02se_5d117799-5b8b-482b-b56e-5c3d7eb35cbg.jpg
šŸ“… 2025-11-26
🌟 CometAPI Major Launch: FLUX.2 Series - Limited Time Offer šŸŽ‰
šŸš€ Now Supporting Asynchronous Format Models:
šŸ”¹ black-forest-labs/flux-2-pro
šŸ”¹ black-forest-labs/flux-2-dev
šŸ”¹ black-forest-labs/flux-2-flex
šŸ’° Limited Time Promotion: Lower than Official Pricing!
šŸ’” FLUX.2 Key Highlights:
šŸ–¼ļø Multi-Reference Editing: Supports 8-10 reference images to satisfy complex character generation needs.
šŸ“ø Ultra-High Quality: Up to 4MP resolution for ultimate natural realism.
⚔ Flexible Selection:
• Pro: Designed for high-efficiency production and fast delivery.
• Flex: Maximizes image quality with adjustable parameters.
• Dev: Developer-friendly optimization.
šŸ‘‡ Start Building Now
Create Predictions - API Doc
🌟 2025-11-25
šŸŽ‰ CometAPI Launches Claude Opus 4.5 Series!
šŸš€ Available Models:
šŸ”¹ claude-opus-4-5-20251101-thinking
šŸ”¹ claude-opus-4-5-20251101
šŸ”¹ cometapi-opus-4-5-20251101-thinking
šŸ”¹ cometapi-opus-4-5-20251101
šŸ’” Why Claude Opus 4.5?
Top choice for intensive reasoning, code automation, and complex Agent systems.
✨ Key Highlights:
🧠 Superior Reasoning: Handles complex logic.
šŸ“ Automation: Enterprise-grade efficiency.
šŸ¤– Agents: Advanced tool integration.
⚔ Stability: Reliable long-context performance.
šŸ“– Documentation:
šŸ‘‰ Chat - API Doc-CometAPI
šŸ‘‰ Anthropic Messages - API Doc-CometAPI

🌟 2025-11-20#

šŸŽ‰ CometAPI Launches Nano Banana Pro ! šŸŽ‰#

šŸ”¹ gemini-3-pro-image-preview,gemini-3-pro-image
Gemini 3 Pro Image (also known as nanobanana pro) is Google’s flagship image generation model designed for high-fidelity professional workflows. This release introduces "Deep-Context" understanding for highly complex prompts, perfects in-image typography generation, offers distinct object editing without manual masking, and significantly enhances photorealism and lighting physics.
Follows the Google standard format. See details: CometAPI Chat Documentation https://apidoc.cometapi.com/gemini-generates-image-20873272e0
GUIDE:https://apidoc.cometapi.com/guide-to-calling-gemini-2-5-flash-image-1425263m0

šŸŽ‰ CometAPI Launches Grok 4.1 Fast Series Models! šŸŽ‰#

šŸš€ Available Models:#

šŸ”¹ grok-4-1-fast-reasoning, grok-4-1-fast-non-reasoning
A cutting-edge multimodal model designed specifically for high-performance tool calling and complex interaction scenarios. It delivers exceptional logical processing capabilities while maintaining ultra-fast response speeds. Supports a maximum context of 2M tokens.
Flexible Dual Modes:
reasoning: Enhanced logical reasoning, ideal for complex problem-solving.
non-reasoning: Optimized for extreme speed, ideal for high-concurrency tasks.
Format Support: Chat format
Documentation: šŸ‘‰ Check out the Chat API documentation

🌟 2025-11-19#

šŸŽ‰ CometAPI Launches Gemini 3 Pro Model! šŸŽ‰#

šŸ”¹ gemini-3-pro-preview,gemini-3-pro-preview-thinking
gemini-3-pro-preview: Google's most intelligent model with SOTA (state-of-the-art) reasoning and multimodal understanding capabilities, featuring powerful agentic and vibe coding abilities. Max context: 2M tokens; Knowledge cutoff: January 1, 2025.

Key Features:#

Unified Multimodal: Text, image processing with real-time analysis
Million-Token Context: Handle massive documents and codebases
Advanced Reasoning: Multi-step problem-solving with RL optimization
High Performance: Sparse MoE architecture + Google TPU v6
Best For: code generation, multimodal understanding
Format Support: Chat format
Documentation: šŸ‘‰ Check out the Chat API documentation

🌟 2025-11-14#

šŸŽ‰ Major Update! CometAPI Now Supports the Full GPT-5.1 Model Series! šŸŽ‰#

šŸš€ Available Models & Usage Guide#

GPT-5.1 is OpenAI's latest flagship model, designed for advanced coding and agent tasks.
General Specs: 400k context window, 128k max output, with a knowledge cutoff of September 30, 2024.

šŸ”¹ gpt-5.1 & gpt-5.1-chat-latest (For Dialogue & General Tasks)#

Model IDs: gpt-5.1, gpt-5.1-chat-latest
Description: OpenAI's flagship models, ideal for building multi-turn conversational applications that demand powerful reasoning and comprehension.
Recommended Endpoint: /v1/chat
Documentation: šŸ‘‰ Check out the Chat API documentation

šŸ”¹ gpt-5.1-codex (For Professional Code Tasks)#

Model ID: gpt-5.1-codex
Description: Optimized for coding tasks like code generation, completion, and analysis to leverage its best-in-class coding capabilities.
Required Endpoint: /v1/responses (Note: This endpoint must be used for this model.)
Documentation: šŸ‘‰ Check out the Responses API documentation

šŸŽ‰ CometAPI Grand Launch of qwen-image and qwen-image-edit! šŸŽ‰#

šŸš€ Available Models:#

šŸ”¹ qwen-image#

šŸ”¹ qwen-image-edit#

qwen-image: It is a universal image generation model, mainly used to generate completely new images based on text, emphasizing the ability to create from scratch. It is suitable for scenarios such as creative generation, stylized drawing, and more.
It is trained on large-scale vision-language models, supports multi-language prompts, but its core focus is on generation rather than editing.
qwen-image-edit: An optimized version based on Qwen-Image, specifically tailored for image editing tasks. It features stronger capabilities in local modifications and consistency preservation. It can not only generate new images but also perform precise edits on existing images.
The above models follow the OpenAI standard image generation format for calls. For details, refer to: Text-to-Image, Image-to-Image

🌟 2025-11-12#

šŸŽ‰ CometAPI proudly launches the new gpt-image-1-mini model! šŸŽ‰#

šŸš€ Available Models:#

šŸ”¹ gpt-image-1-mini#

gpt-image-1-mini: OpenAI's cost-effective image generation model, supporting text/image as input and outputting images; suitable for large-scale, cost-sensitive generation scenarios.
The above models follow the OpenAI standard image generation format for calls. For details, refer to: Text-to-Image, Image-to-Image

šŸ“¢ Additional Announcements:#

CometAPI Partners with Bria! CometAPI has reached a cooperation with Bria, and in November 2025, bria all interfaces will be freely open to all users for calls. Have a try!
Sora Asynchronous Format Update: CometAPI has completed the replacement of Sora's asynchronous format and no longer supports the open chat format.
Please use sora-2-pro or sora-2 models, which call this interface (official per-second billing): Sora API.
Use sora-2-all or sora-2-pro-all models, which call this interface (billed per item, after discount: sora-2-all: 0.08, sora-2-pro-all: 0.8): Sora All API.

🌟 2025-11-10#

šŸŽ‰ CometAPI Excitingly Launches K2-Thinking Series New Models! šŸŽ‰#

šŸš€ Available Models:#

šŸ”¹ k2-thinking#

šŸ”¹ k2-thinking-turbo#

k2-thinking: Moonshot AI's most advanced open reasoning model, extending the K2 series. It is a thinking model with universal Agentic capabilities and reasoning abilities. Supports 256K tokens context window.
k2-thinking-turbo: Based on k2-thinking, it provides faster response speeds and higher concurrency capabilities, supporting the same 256K context and reasoning functions, suitable for high-efficiency scenarios.
The above models follow the OpenAI Chat standard format for invocation. For details, refer to: https://apidoc.cometapi.com/chat

🌟 2025.11.07#

Comet Major Update Announcement: Sora-2 Invocation Method Optimization#

To improve efficiency and stability, we will optimize the Sora-2 invocation method starting from UTC 2025-11-11 8:00.

Key Changes#

No longer supported: Using the OpenAI reverse-engineered Chat format for invocation.
New asynchronous format: Model name switches to sora-2-all or sora-2-pro-all to call the asynchronous interface format (notification will be sent as soon as it's live).
Pricing remains unchanged.

Recommended Actions#

Please complete the interface switch by the update time to avoid service interruption.
We will provide the new format as soon as possible for testing to ensure a smooth transition.
Currently, you can continue using the official format (billed per second). For details, see the documentation: https://apidoc.cometapi.com/create-video-22425640e0.
If you have any questions, please contact customer service. We are committed to providing a better experience—thank you for your support!

🌟 2025-10-27#

šŸ“¢ Comet Launches Suno Music Style Tags Expansion Feature!#

šŸŽµ Feature Introduction
Struggling with how to describe music styles? Now simply input keywords, and the system will automatically generate professional, detailed music style descriptions!
Feature Highlights:
šŸš€ One-click Expansion - Input simple words, get professional style descriptions
šŸŽÆ Precise Description - Includes multi-dimensional descriptions of instruments, rhythm, atmosphere, etc.
⚔ Instant Response - Quickly obtain expansion results, boost creative efficiency
šŸ’” Creative Inspiration - Provide more inspiration for music creation
Request Example:
click :https://apidoc.cometapi.com/add-style-tags-23376169e0

🌟 2025-10-17#

šŸŽ‰ CometAPI New Models Announcement šŸŽ‰#

šŸš€ New Models
šŸ”¹ claude-haiku-4-5-20251001 / cometapi-haiku-4-5-20251001
⚔ Low Latency & High Throughput: Optimized for real-time, high-concurrency scenarios.
🧠 Configurable Reasoning Depth: Supports "extended thinking" mode.
šŸ“„ Massive Context: Up to 200K input tokens, 8K output tokens.
šŸ’» Strong Code Capabilities: Code generation, debugging, tool calling.
šŸ’° Cost Advantage: ~1/3 the cost of Sonnet 4.
šŸ”§ Format Support: Claude native message format + chat format.
šŸ”¹ glm-4.6
Zhipu AI's latest flagship model with 355B total params, 32B active.
šŸ’» Coding Excellence: Aligns with Claude Sonnet 4, best in China.
šŸ“š Extended Context: Expanded from 128K to 200K tokens.
🧠 Enhanced Reasoning: Supports tool calling during inference.
šŸ” Search Optimization: Improved tool calling and agent performance.
āœļø Better Writing: Enhanced style, readability, and role-playing alignment.
šŸŒ Multilingual: Boosted cross-language translation capabilities.
šŸ”§ Format Support: Chat format.
šŸ”¹ veo3.1 / veo3.1-pro
Google's latest AI video generation models for high-quality video creation.
šŸŽ¬ High Resolution: 1080p video generation.
šŸŽµ Synchronized Audio: Dialogue, ambient sounds, effects with native lip-sync.
ā±ļø Video Length: Generate seamless clips up to 8 seconds.
šŸŽØ Creative Control: Reference image support, first/last frame setting, cinematic presets.
⚔ Dual Variants: Veo3.1 (standard quality) + Veo3.1-Pro (maximum quality).
šŸ”§ Format Support: Async calls + chat format.
All models support chat format calls, with Claude models additionally supporting native message format for maximum integration flexibility!
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-10-10#

šŸŽ‰ Major Model Update - 3 New AI Services! šŸŽ‰#

šŸ”„ GPT-5 Complete Series (7 Models)
World's most advanced reasoning models with 400k context window.
gpt-5-minimal: Lightning-fast for simple tasks.
gpt-5-low: Speed-optimized (212 tokens/sec).
gpt-5-medium: Balanced performance for general use.
gpt-5-high: Maximum "deep thinking" mode.
gpt-5-codex-low / gpt-5-codex-medium / gpt-5-codex-high: Specialized for coding & software engineering.
✨ Features: State-of-the-art coding, mathematics, visual perception & complex reasoning.
šŸ“‹ API Documentation: GPT-5 Series
šŸ”¹ suno-midi
šŸŽµ Suno MIDI API: Convert AI-generated music to MIDI files.
Perfect for DAW workflows, remixing & music production.
šŸ“‹ API Documentation: Suno MIDI
šŸ”¹ sora-2 / sora-2-pro
šŸŽ¬ Official OpenAI video generation with synced audio.
Realistic physics & object interactions.
Professional-grade cinematic quality.
Same pricing as OpenAI official rates.
šŸ“‹ API Documentation: Sora-2
šŸš€ All models live now!
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-10-06#

šŸŽ‰ CometAPI Now Supports GPT-5 Pro! šŸŽ‰#

Enhanced reasoning for complex tasks.
Advanced problem solving with high accuracy.
Strong multi-domain performance.

šŸ›  Usage#

Use either model name: gpt-5-pro or gpt-5-pro-2025-10-06.
Call Details: https://apidoc.cometapi.com/response-18535147e0

🌟 2025-10-01#

šŸŽ‰ CometAPI Now Supports Sora 2! šŸŽ‰#

Physics-accurate, highly realistic video generation.
Synchronized audio & dialogue.
Strong temporal consistency.
Multi-style output (cinematic, anime, etc.).
Real-world cameo injection and precise re-render controls.
Built-in safety and moderation.
Note: Initial capacity may be limited; use streaming with chat format.

šŸ›  Usage#

Use Chat Completions with model set to sora-2 and stream enabled.
Call Details: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025-09-30#

šŸŽ‰ CometAPI Now Supports Claude Sonnet 4.5, DeepSeek-V3.2-Exp, and Gemini 2.5 Flash New Versions! šŸŽ‰#

šŸš€ Claude Sonnet 4.5#

Available Model Names: claude-sonnet-4-5-20250929-thinking,claude-sonnet-4-5-20250929,claude-sonnet-4-5,cometapi-sonnet-4-5-20250929-thinking,cometapi-sonnet-4-5-20250929,cometapi-sonnet-4-5
Claude Sonnet 4.5 has world-leading coding capabilities (SOTA-Level Coding). It achieved an astonishing 77.2% accuracy on the authoritative SWE-bench benchmark, which measures real-world software engineering abilities, making it the world's strongest coding model. This means it has made a qualitative leap in handling complex programming tasks, debugging, and even architectural design.

šŸš€ DeepSeek-V3.2-Exp Highlights#

The DeepSeek-V3.2-Exp model is an experimental (Experimental) version. As an intermediate step towards the next-generation architecture, V3.2-Exp introduces DeepSeek Sparse Attention (a sparse attention mechanism) based on V3.1, and conducts exploratory optimization and verification for the training and inference efficiency of long texts.

šŸš€ Gemini 2.5 Flash Highlights#

gemini-2.5-flash-preview-09-2025: A model that excels in cost-effectiveness and provides comprehensive features. 2.5 Flash is best suited for large-scale processing of low-latency, high-data-volume tasks that require thinking, as well as agent application scenarios.
gemini-2.5-flash-lite-preview-09-2025: The fastest Flash model, specially optimized for cost-benefit and high throughput.
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-09-24#

šŸŽ‰ CometAPI Now Supports Suno v5! šŸŽ‰#

Superior Audio Quality: Significantly enhanced audio clarity, vocal performance, and mixing precision.
Immersive Experience: Delivers lifelike vocals and powerful creative control.
Professional Creation: Generates emotionally rich, genre-accurate, high-quality songs.

šŸ›  Usage#

Set the request parameter mv to chirp-crow.
Call Details: https://apidoc.cometapi.com/generate-music-clip-13851480e0

🌟 2025-09-23#

šŸš€ New and Updated Models:#

šŸ”¹ grok-4-fast-non-reasoning#

šŸ”¹ grok-4-fast-reasoning#

šŸ”¹ grok-code-fast-1#

grok-4-fast-non-reasoning: The non-reasoning variant of xAI's Grok-4 Fast series, with a unified architecture for handling fast responses, suitable for real-time search and simple queries. It possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 2,000,000 tokens, cost-efficient (input $0.20/million tokens), leading mainstream models.
grok-4-fast-reasoning: The reasoning variant of xAI's Grok-4 Fast series, supporting long-chain thinking and tool calls, suitable for complex tasks such as mathematical reasoning and agent workflows. Ranked first in the LMArena search arena (1163 Elo), it possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 2,000,000 tokens, leading mainstream models.
grok-code-fast-1: xAI's fast model specifically designed for agent coding, optimized for tool integration such as grep and file editing, achieving 70.8% performance on SWE-Bench-Verified, suitable for automated code generation and debugging. Currently supports text modality, with vision and other features coming soon. It possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 256,000 tokens, leading coding-specific models.
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-09-11#

šŸš€ New and Updated Models: minimax-hailuo-02, bytedance-seedream-4-0-250828, VEO3 Updated!#

šŸ”¹ minimax-hailuo-02#

Support for minimax-hailuo-02 model, which is MiniMax's latest masterpiece, an AI video generation model aimed at completely transforming the video creation process. It not only inherits the advantages of the previous generation Hailuo 01, but also achieves a qualitative leap in core technology and user experience.
Click the link to experience it now: https://apidoc.cometapi.com/minimax-conch-generation-14660582e0

šŸ”¹ bytedance-seedream-4-0-250828#

Support for bytedance-seedream-4-0-250828, as a new-generation image creation model, Seedream 4.0 integrates image generation and image editing capabilities into a unified architecture. This enables it to flexibly handle complex multimodal tasks, including knowledge-based generation, complex reasoning, and reference consistency. Compared to its predecessor, it has faster inference speed and can produce stunning high-definition images up to 4K resolution.
Click the link to experience it now: https://apidoc.cometapi.com/bytedance-image-generation-19773064e0

šŸ”¹ VEO3#

The entire VEO3 series follows the official price reduction, with comet prices reduced to half of the original, welcome to call.
VEO3 now supports asynchronous interfaces for task processing, optimizing the calling efficiency of long-duration tasks and enhancing the overall experience.
Click the link to experience it now: https://apidoc.cometapi.com/submit-video-generation-task-18941528e0

🌟 2025-09-07#

šŸŽ‰ cometapi Heavyweight Launch: kimi-k2-250905 and qwen3-max-preview! šŸŽ‰#

šŸ”¹ kimi-k2-250905#

kimi-k2-250905: Moonshot AI's Kimi K2 series 0905 version, supporting ultra-long context (up to 256k tokens, frontend and tool calling).
🧠 Enhanced Tool Calling: 100% accuracy, seamless integration, suitable for complex tasks and integration optimization.
āš”ļø More Efficient Performance: TPS up to 60-100 (standard API), up to 600-100 in Turbo mode, providing faster responses and improved reasoning capabilities, with knowledge cutoff to mid-2025.

šŸ”¹ qwen3-max-preview#

qwen3-max-preview: Alibaba's Tongyi Qianwen team's latest developed Qwen3-Max-Preview model, positioned as the peak performance in the series.
🧠 Powerful Multimodal and Reasoning: Supports ultra-long context (up to 128k tokens) and multimodal input, excels in complex reasoning, code generation, translation, and creative content.
āš”ļø Breakthrough Improvements: Significant optimization in multiple technical indicators, faster response speed, knowledge cutoff to 2025, suitable for enterprise-level high-precision AI applications.

āœ… All models belong to the default group, with seamless integration. It is recommended to choose the most suitable version based on your specific business scenarios (performance, speed, cost) to maximize application value.#

Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-08-27#

šŸ”¹ gemini-2.5-flash-image-preview,gemini-2.5-flash-image#

-gemini-2.5-flash-image-preview,gemini-2.5-flash-image :
Gemini 2.5 Flash Image (also known as nano-banana) is Google’s most advanced image generation and editing model. This update enables you to blend multiple images into a single image, maintain character consistency to tell richer stories, perform targeted transformations using natural language, and use Gemini’s world knowledge to generate and edit images.
Follows the OpenAI chat standard format. See details: CometAPI Chat Documentation https://apidoc.cometapi.com/gemini-generates-image-20873272e0
GUIDE:https://apidoc.cometapi.com/guide-to-calling-gemini-2-5-flash-image-1425263m0

🌟 2025-08-22#

šŸ”¹ deepseek-v3.1, deepseek-v3-1-250821#

deepseek-v3.1, deepseek-v3-1-250821: DeepSeek-V3.1 is DeepSeek's all-new hybrid inference model.
🧠 Hybrid inference: Think & Non-Think — one model, two modes
āš”ļø Faster thinking: DeepSeek-V3.1 reaches answers in less time vs. DeepSeek-R1-0528
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-08-20#

šŸŽ‰ CometAPI Update: Suno Adds Instrumental & Vocals, Plus Major Upgrades for Kling's Effects, Quality, and Models! šŸŽ‰#

šŸ”¹ Suno#

šŸŽµ Introducing Two Major Music Creation Features: Easily add accompaniment to vocals and generate lyrics & vocals for instrumental tracks.
Add Instrumental: Upload an a cappella vocal track, and Suno will intelligently generate and add a matching accompaniment.
Add Vocals: Upload an instrumental track, and Suno will generate lyrics and a vocal performance to match.
Documentation: Suno Scenario Application Guide

šŸ”¹ Kling#

✨ Massive Video Effects Library Expansion: Added 63 new video effects (62 single-subject effects and 1 two-person interactive effect), bringing the total to 80 available effects for more creative choices.
šŸ”Š Video-to-Audio Optimization: The video-to-audio generation feature now supports full-resolution video uploads for more precise sound effect matching.
šŸ“ˆ Multi-Image to Video Performance Skyrockets: Experience a 102% improvement over the previous version! See significant enhancements in subject consistency, dynamic quality, and interaction naturalness. This is a seamless upgrade with no code changes required.
šŸŽ¬ Text-to-Video Quality Upgrade: Version 1.6 now supports the generation of higher-quality videos.
Parameter Example: "mode": "pro"
Documentation: Kling Video Generation
šŸŽØ Image Generation Model Update: The new kling-v2-new model is now live, supporting nearly 300 image styles to maximize your creativity!
Documentation: Kling Image Generation

🌟 2025-08-18#

šŸš€ New and Updated Models: Runway, VEO3, hunyuan-3D, Midjourney Fully Updated!#

šŸ”¹ Runway#

Runway model adds multiple core functions, expanding video and image generation capabilities:
Video to Video: Video to video generation.
Text to Image: Text to image generation.
Video Upscale: Video super-resolution enhancement.
Control a Character: Character control function.
Click the link to experience it now: https://apidoc.cometapi.com/generate-a-video-from-a-video-20308134e0

šŸ”¹ VEO3#

VEO3 now supports asynchronous interface for task processing, optimizing the calling efficiency of long-duration tasks and enhancing the overall experience.
Click the link to experience it now: https://apidoc.cometapi.com/submit-video-generation-task-18941528e0

šŸ”¹ Huanyuan3D#

Supports Hunyuan3D-2, providing powerful 3D content creation capabilities to assist in efficiently generating high-quality 3D models.
Click the link to experience it now: https://apidoc.cometapi.com/hunyuan3d-20073774e0

🌟 2025-08-08#

šŸ”¹ GPT-5 Series
gpt-5, gpt-5-2025-08-07: OpenAI's flagship model, widely recognized as the industry's most powerful for coding, reasoning, and agentic tasks. It is designed to handle the most complex cross-domain challenges and excels in code generation, advanced reasoning, and autonomous agents, making it the premier choice for users demanding peak performance.
gpt-5-chat-latest: The continuously updated version of GPT-5. It always incorporates the latest features and optimizations, recommended for applications that need to stay current with the latest model capabilities.
šŸ”¹ GPT-5 Mini Series
gpt-5-mini, gpt-5-mini-2025-08-07: The cost-effective version of GPT-5, specifically optimized for speed and cost. It strikes an excellent balance between performance and affordability, making it the ideal choice for everyday tasks like general chat, content creation, and routine Q&A.
šŸ”¹ GPT-5 Nano Series
gpt-5-nano, gpt-5-nano-2025-08-07: The fastest and most cost-effective lightweight version in the GPT-5 family. It is perfect for scenarios requiring high throughput and instant responses, such as text classification, sentiment analysis, summary extraction, and data formatting.
API Call Instructions: gpt-5-chat-latest should be called using the standard /v1/chat/completions format. For other models (gpt-5, gpt-5-mini, gpt-5-nano, and their dated versions), using the /v1/responses format is recommended. For details, please refer to: https://apidoc.cometapi.com/api-13851472

Note#

Important: top_p is not supported by this series of models.
Temperature Settings
gpt-5-chat-latest: Supports custom temperature values between 0 and 1 (inclusive).
All other GPT-5 models: The temperature is fixed at 1. You may set it to 1 or omit it (defaults to 1).
When calling the GPT-5 series models (excluding gpt-5-chat-latest), the max_tokens field should be changed to max_completion_tokens.

🌟 2025.08.06#

šŸ”¹ claude-opus-4-1-20250805
claude-opus-4-1-20250805: Anthropic's flagship Claude Opus 4.1 model, achieving major breakthroughs in programming, reasoning, and agentic tasks, with SWE-bench Verified reaching 74.5%.
Significantly enhanced multi-file code refactoring, debugging precision, and detail-oriented reasoning capabilities. This model is suitable for demanding programming and reasoning scenarios.
We have also added cometapi-opus-4-1-20250805 specifically for Cursor integration.
šŸ”¹ claude-opus-4-1-20250805-thinking
claude-opus-4-1-20250805-thinking: Claude Opus 4.1 version with extended thinking capabilities, providing up to 64K tokens of deep reasoning capacity.
Optimized for research, data analysis, and tool-assisted reasoning tasks, with powerful detail-oriented reasoning abilities.
We have also added cometapi-opus-4-1-20250805-thinking specifically for Cursor integration.
šŸ”¹ gpt-oss-120b
gpt-oss-120b: OpenAI's released 117B parameter Mixture of Experts (MoE) open-source model, designed for high-level reasoning, agentic, and general production use cases.
šŸ”¹ gpt-oss-20b
gpt-oss-20b: 21B parameter open-source MoE model with 3.6B active parameter architecture, optimized for low-latency inference and consumer-grade hardware deployment.
All above models follow the OpenAI chat standard format for API calls. For details, please refer to: https://apidoc.cometapi.com/api-13851472

🌟 2025.08.05#

šŸš€ Feature Updates: gemini-2.5-flash-lite, o3 & o4-mini Deep Research, Volcano Engine Generation Models
gemini-2.5-flash-lite - Google's most cost-effective model, built for large-scale tasks!
āš”ļø High Efficiency: Designed for large-scale, low-latency applications.
šŸ”§ Standard Format: Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation
o3 & o4-mini Deep Research Agents - Get in-depth analysis reports with web-connected research agents!
🧠 Advanced Analysis: Supports multi-step reasoning and provides reports with citations.
šŸ¤– Available Models: o3-deep-research, o3-deep-research-2025-06-26, o4-mini-deep-research, o4-mini-deep-research-2025-06-26
šŸ“š How to Call: The four deep research models above must be called using the following format:
Volcano Engine Video & Image Models - Experience powerful new video and image models!
šŸŽ¬ Video Generation: Create videos from images (bytedance-seedance-1-0-pro, bytedance-seedance-1-0-lite-i2v-250428) or text (bytedance-seedance-1-0-lite-t2v-250428).
šŸŽØ Image Generation & Editing: Generate images with bytedance-seedream-3.0-t2i or edit them using prompts with bytedance-seedEdit-3.0-i2i.
šŸ“š How to Call:https://apidoc.cometapi.com/api-19771367

🌟 2025.07.31#

šŸš€ Feature Updates: MJ Video Generation, Flux-Kontext Multi-Image Reference, Kling-v1-6 Multi-Image Reference
MJ Video Generation - Transform static images into dynamic video effects!
šŸŽ¬ New capability: MJ original image generation endpoint /mj/submit/imagine now supports video generation
šŸŽØ Bring creativity to life: Perfect for creating animated effects, creative video generation, and various other applications
šŸ“š Learn more: View Documentation
Flux-Kontext Series Multi-Image Reference - Enhanced AI creation with multiple references!
šŸ–¼ļø Expanded support: Now supports uploading up to 4 reference images (previously only single image supported)
šŸŽÆ Precision boost: Multi-image reference makes AI creation more precise with richer inspiration
šŸ”§ Compatible models: Only supported by black-forest-labs/flux-kontext-max and black-forest-labs/flux-kontext-pro models
šŸ“š Learn more: View Documentation
Kling-v1-6 Multi-Image Reference - Elevate your video generation quality!
šŸ“ø Multi-image input: Supports up to 4 images as reference input
⚔ Quality enhancement: Significantly improves video generation quality
šŸŽÆ Model specific: Only available for `kling- View Documentation

🌟 2025.07.29#

šŸ”¹ Latest Support: glm-4.5, glm-4.5-air, glm-4.5-x, glm-4.5-airx, glm-4.5-flash
glm-4.5: Flagship model with 355B total parameters and 32B active parameters, designed for agentic applications, supporting hybrid reasoning modes and excelling in complex reasoning, tool calling, and web browsing.
glm-4.5-air: Cost-effective model with 106B total parameters and 12B active parameters, maintaining strong performance while significantly reducing costs, ideal for resource-sensitive applications.
glm-4.5-x: High-performance model optimized for ultra-fast inference and powerful reasoning capabilities, delivering millisecond-level response times for scenarios requiring speed and logic.
glm-4.5-airx: Lightweight yet powerful model combining Air's cost advantages with X's speed benefits, offering the perfect balance between performance and efficiency.
glm-4.5-flash: Efficient multi-purpose model with high generation speed, specifically optimized for coding and reasoning tasks, suitable for developers getting started and rapid prototyping.
-Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025.07.25#

šŸ”¹ New Model: gemini-2.5-pro-all, gemini-2.5-flash-all, gemini-2.5-pro-deepsearch, gemini-2.5-flash-deepsearch, deepseek-r1t2-chimera
gemini-2.5-pro-all: A multimodal version of the Gemini model,
supporting analysis of files, videos, and images, as well as image generation and real-time web access.
gemini-2.5-flash-all: A multimodal version of the Gemini model,
supporting analysis of files, videos, and images, as well as image generation and real-time web access.
gemini-2.5-pro-deepsearch: A deep search model with enhanced deep search and information retrieval capabilities
ideal for complex knowledge integration and analysis.
gemini-2.5-flash-deepsearch: A deep search model combining the rapid performance of the Flash model with advanced deep search capabilities for fast, in-depth information discovery.
deepseek-r1t2-chimera: A 671B parameter Mixture-of-Experts (MoE) text generation model merged from DeepSeek-AI's R1-0528, R1, and V3-0324, supporting a context of up to 60k tokens.
-Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025.07.24#

šŸ”¹ qwen3-coder-plus
qwen3-coder-plus: Focused on code generation, understanding, and optimization, excels in complex programming tasks.
šŸ”¹ qwen3-coder-plus-2025-07-22
qwen3-coder-plus-2025-07-22: Optimized version from 2025-07-22, stable and reliable, suitable for production.
šŸ”¹ qwen3-coder-480b-a35b-instruct
qwen3-coder-480b-a35b-instruct: Flagship model with 480 billion parameters, MoE architecture, capable of handling extremely complex programming.
Those models follows the openai chat standard format call, specific reference:
https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.07.18#

šŸ”¹ Suno v4.5+
Suno v4.5+: v4.5+ has richer sounds, new creation methods, and a maximum length of 8 minutes. This website currently supports Suno 4.5+. Please change the request parameter mv to chirp-bluejay.
The above model follows the suno format, please refer to: https://apidoc.cometapi.com/api-13851480

🌟 2025.07.17#

CometAPI supports Midjourney uploading masked images for local modifications
Refer to: https://apidoc.cometapi.com/api-18989894

🌟 2025.07.16#

šŸ”¹ kimi-k2-0711-preview
kimi-k2-0711-preview: Kimi K2 is a large-scale mixed-expertise (MoE) language model developed by Moonshot AI.
with 1 trillion total parameters and 32 billion active parameters per forward pass. It is optimized for agent capabilities including advanced tool usage, inference, and code synthesis.
Kimi K2 performs well in a variety of benchmarks, especially in coding (LiveCodeBench, SWE-bench), reasoning (ZebraLogic, GPQA) and tool usage (Tau2, AceBench) tasks.
It supports long contextual inference with up to 128K tokens and features a novel training stack design that includes the MuonClip optimizer for stable large-scale MoE training.
The model follows the openai chat standard format call, specific reference:
https://apidoc.cometapi.com/chat-api-13851472
🌟 Since Google officially took down the gemini-2.5 model with version number on 7-15, the preview model has been forwarded to the official version; the gemini-2.5-pro model can be called as a priority. Click here to see the official notice: https://cloud.google.com/vertex-ai/generative-ai/docs/models/gemini/2-5-flash

🌟 2025.07.14#

CometAPI now supports direct calls to the OpenAI API to process PDFs without uploading files by providing the URL of the PDF file.
For details on how to call it, see: https://apidoc.cometapi.com/api-18535147
🌟 As OpenAI has officially taken down the gpt-4.5 series models on 7-14; gpt-4.1 models can be called in priority Click here to see the official notice: https://platform.openai.com/docs/deprecations

🌟 2025.07.11#

šŸš€ CometAPI supports Claude code!
• Add power to your development workflow. We're excited to announce that CometAPI now fully supports the powerful Claude Code.
• What does this mean for you?
• Top Artificial Intelligence features: Easily generate, debug and optimize code using models built specifically for developers.
• āš™ļø Flexible Model Selection: Our comprehensive range of models allows you to develop more seamlessly.
• Seamless Integration: APIs are always available. Integrate Claude Code directly into your existing workflow in minutes.
• Ready to build faster? Please click on the link below to make a call.
• click :https://apidoc.cometapi.com/doc-1266358

🌟 2025.07.10#

šŸ”¹ grok-4
šŸ”¹ grok-4-0709
grok-4,grok-4-0709: Currently supports text modal, with visual, image generation and other features coming soon. Extremely powerful technical parameters and ecological capabilities: Context Window: Supports up to 256,000 tokens of contextualization, ahead of mainstream models.
The model follows the openai chat standard format call, specific reference:
https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.07.04#

Suno now supports stem separation, creating Persona, generating MP4 MV videos, getting WAV format files, and Timing: lyrics & audio timeline
Now supports Suno's full-track and single-track stem separation features, which can split your songs into up to 12 clean tracks—including vocals, drums, bass, etc., convenient for preview and download.
Create new Persona to generate Persona: singer style; generate music in different formats;
Note: Full-track stem separation feature is priced at 5 times the music generation cost; single-track billing is still being optimized, currently maintaining the same 5x pricing as full-track, will be charged at 1x base price in the future
For specific model usage, please refer to the above: https://apidoc.cometapi.com/api-18657316
šŸ”¹ veo3
šŸ”¹ veo3-pro
šŸ”¹ veo3-fast
šŸ”¹ veo3-frames
šŸ”¹ veo3-fast-frames
šŸ”¹ veo3-pro-frames
veo3,veo3-pro,veo3-fast: is the official Google's latest video generation model, the generated video with sound, the world's only video model with sound. veo3-frames,veo3-fast-frames,veo3-pro-frames Support first frame mode.
This model follows the OpenAI chat standard format for calls, refer to: https://apidoc.cometapi.com/api-18582532

🌟 2025.07.01#

šŸ”¹ mj_fast_video
Midjourney video generation is now supported
Synchronized support for official website low-dynamic, high-dynamic, auto-generated, manual generation.
please click : https://apidoc.cometapi.com/api-18581293

🌟 2025.07.01#

šŸ”¹ kling_image_expand
Now supports the Keyline API to expand the image.
please click : https://apidoc.cometapi.com/api-18584170

🌟 2025.06.25#

šŸ”¹ black-forest-labs/flux-kontext-pro
šŸ”¹ black-forest-labs/flux-kontext-max
šŸ”¹ flux-kontext-pro
šŸ”¹ flux-kontext-max
black-forest-labs/flux-kontext-pro, black-forest-labs/flux-kontext-max:
The above two models follow the replicate call format; see details: https://apidoc.cometapi.com/api-16455857
flux-kontext-pro, flux-kontext-max:
The above two models follow the OpenAI chat standard call format; see details: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.06.19#

šŸ”¹ gemini-2.5-flash-lite-preview-06-17
gemini-2.5-flash-lite-preview-06-17: Large scale processing,Lower cost.
This model follows the OpenAI chat standard format for API calls, please refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.06.11#

šŸ”¹ o3-pro
šŸ”¹ o3-pro-2025-06-10
o3-pro,o3-pro-2025-06-10: Supports web search, file analysis, visual input reasoning, Python programming, and personalized responses.
Compared to previous models, o3-pro shows significant improvements in clarity, completeness, instruction following, and accuracy.
This model adheres to the OpenAI v1/responses standard call format. For specific reference:
curl --location --request POST 'https://api.cometapi.com/v1/responses' \
--header 'Authorization: Bearer sk-xxxxxx' \
--header 'User-Agent: Apifox/1.0.0 (https://apifox.com)' \
--header 'Content-Type: application/json' \
--header 'Accept: */*' \
--header 'Host: api.cometapi.com' \
--header 'Connection: keep-alive' \
--data-raw '{
     "model": "o3-pro",
        "input": [{"role": "user", "content": "What’s the difference between inductive and deductive reasoning?"}]
    }'

🌟 2025.06.06#

šŸ”¹ gemini-2.5-pro-preview-06-05
gemini-2.5-pro-preview-06-05: With native multimodal processing capabilities and a very long context window of up to 1 million words (Token), it provides unprecedented power for processing complex, long sequence tasks.
This model follows the OpenAI chat standard format for API calls, please refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.05.30#

🌟 Gemini Models 1.5 series model off the shelf notice:
Due to the fact that Gemini no longer offers the 1.5 series officially, it will be phased out today;
Updated gemini-2.5-flash-preview-05-20 \ gemini-2.5-flash-preview-04-17 \ gemini-2.5-pro-preview-05-06 \ gemini-2.5-pro-preview-03-25 \ gemini-2.5-pro -preview-03-25 \ gemini-2.5-pro-exp-03-25 ;* gemini-2.5-pro-exp-03-25
Please continue to call the 2.5 series, the above models follow the openai chat standard format call, refer to: https://apidoc.cometapi.com/api-276386060
🌟 Notes on using gpt to generate images:
Since the interface gpt-4o-image is an interface realized by technical means, not an asynchronous interface, technically it can't achieve complete stability, so there will be instability.
If you have high stability requirements, we recommend using gpt-image-1, the official API call is more stable. Refer to the official images/generations format for the calling method, and https://api.cometapi.com/v1/images/generations for the url details.
Meanwhile, gpt-4o-image and gpt-image-1 support chat format, which can be called by technical means, please refer to the following url for details:
https://api.cometapi.com/v1/chat/completions

🌟 2025.05.29#

šŸ”¹ deepseek-r1-0528
deepseek-r1-0528: Advanced reasoning capabilities, large parameter scale, powerful performance, suitable for complex tasks.
This model follows the OpenAI chat standard format for API calls, please refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.05.23#

šŸ”¹ claude-sonnet-4-20250514
claude-sonnet-4-20250514: An important model in the Claude 4 series developed by Anthropic, significantly improving coding and reasoning capabilities compared to its predecessor Claude Sonnet 3.7.
It can respond more precisely to user instructions and efficiently handle complex tasks. This model is suitable for applications requiring high performance and cost-effectiveness. We've also added cometapi-sonnet-4-20250514 specifically for use in Cursor.
šŸ”¹ claude-sonnet-4-20250514-thinking
claude-sonnet-4-20250514-thinking: An important model in the Claude 4 series developed by Anthropic, significantly improving coding and reasoning capabilities compared to its predecessor Claude Sonnet 3.7.
It can respond more precisely to user instructions and efficiently handle complex tasks. This model is suitable for applications requiring high performance and cost-effectiveness. We've also added cometapi-sonnet-4-20250514-thinking specifically for use in Cursor.
šŸ”¹ claude-opus-4-20250514
claude-opus-4-20250514: Opus 4 is Anthropic's most advanced model, acclaimed as the world's best coding model.
It excels in handling complex, long-running tasks and intelligent agent workflows, particularly suitable for applications requiring high autonomy and intelligence.
šŸ”¹ claude-opus-4-20250514-thinking
claude-opus-4-20250514-thinking: Opus 4 is Anthropic's most advanced model, acclaimed as the world's best coding model.
It excels in handling complex, long-running tasks and intelligent agent workflows, particularly suitable for applications requiring high autonomy and intelligence.
This model follows the OpenAI chat standard format for calls, refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.05.07#

šŸ”¹ Suno v4.5
Suno v4.5: v4.5 has more expressive music and richer vocals, designed to enhance the user's expression and intuition in music creation. This site now supports Suno 4.5, change the request parameter mv to chirp-auk
The above model follows the suno format, please refer to: https://apidoc.cometapi.com/api-13851480

🌟 2025.04.29#

šŸ”¹ qwen3-235b-a22b
qwen3-235b-a22b: This is the flagship model of the Qwen3 series, with 235 billion parameters, utilizing a Mixture of Experts (MoE) architecture.
Particularly suitable for complex tasks requiring high-performance inference, such as coding, mathematics, and multimodal applications.
šŸ”¹ qwen3-30b-a3b
qwen3-30b-a3b: With 30 billion parameters, it balances performance and resource requirements, suitable for enterprise-level applications.
This model may use MoE or other optimized architectures, applicable for scenarios requiring efficient processing of complex tasks, such as intelligent customer service and content generation.
šŸ”¹ qwen3-8b
qwen3-8b: A lightweight model with 800 million parameters, designed specifically for resource-constrained environments (such as mobile devices or low-configuration servers).
Its efficiency and fast response capability make it suitable for simple queries, real-time interaction, and lightweight applications.
These models follow the OpenAI Chat standard format for calls. For specific details, please refer to:
https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.27#

šŸ”¹ gpt-image-1
gpt-image-1 introduces native multimodal models to the API, built on GPT-4o's image generation capabilities, designed to provide developers with a powerful and flexible tool for generating high-quality, diverse images.
Features: High-fidelity images; diverse visual styles; rich world knowledge; consistent text rendering; unlocking practical applications across multiple domains.
This model follows the openai v1/images/generations format for calls, see details at: https://apidoc.cometapi.com/images-api-13851474 ;Here's an example of input parameters:
{
    "model": "gpt-image-1",
    "prompt": "A cute baby sea otter",
    "n": 1,
    "size": "1024x1024"
}

🌟 2025.04.20#

šŸ”¹ gemini-2.5-flash-preview-04-17
gemini-2.5-flash-preview-04-17, Gemini 2.5 Flash is an AI model developed by Google, designed to provide developers with fast and cost-effective solutions, especially suitable for applications requiring enhanced reasoning capabilities.
According to the Gemini 2.5 Flash preview announcement, the model's preview version was released on April 17, 2025, supports multimodal input, and has a context window of up to 1 million tokens.
This model follows the OpenAI chat standard format for calling, refer to:https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.17#

šŸ”¹ o4-mini
šŸ”¹ o4-mini-2025-04-16
o4-mini, o4-mini-2025-04-16: A smaller, faster, and more economical model, research shows it performs well in mathematics, coding, and visual tasks, designed to be efficient and responsive, suitable for developers. Released on April 16, 2025.
šŸ”¹ o3
šŸ”¹ o3-2025-04-16
o3, o3-2025-04-16: A reflective generative pre-trained transformer (GPT) model designed to handle problems requiring step-by-step logical reasoning.
Research shows it excels at mathematics, coding, and scientific tasks. It can also use tools such as web browsing and image generation, with a release date of April 16, 2025.
The above models follow the OpenAI chat standard format for calls, refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.15#

šŸ”¹ gpt-4.1
gpt-4.1: Major advancements in coding and instruction following; GPT-4.1 has become the leading model for coding.
Long context: On Video-MME, a benchmark for multimodal long context understanding, GPT-4.1 has created a new state-of-the-art result.
The GPT-4.1 model series delivers superior performance at lower cost.
šŸ”¹ gpt-4.1-mini
gpt-4.1-mini: Represents a significant leap in small model performance, even outperforming GPT-4o on many benchmarks.
It matches or exceeds GPT-4o in intelligence assessment while reducing latency by nearly half and costs by 83%.
šŸ”¹ gpt-4.1-nano
gpt-4.1-nano: Features a larger context window—supporting up to 1 million context tokens
And can better utilize this context through improved long context understanding. Has an updated knowledge cutoff date of June 2024.
These models follows the standard OpenAI chat format for API calls, for reference see: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.14#

šŸ”¹ grok-3-deepersearch
grok-3-deepersearch: Features high data timeliness, excellent interactive experience, and thorough search thinking process; comprehensive webpage aggregation.
This model follows the OpenAI chat standard format for API calls, refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.13#

šŸ”¹ gemini-2.0-flash-exp-image-generation
This model supports conversation while enabling image generation and editing capabilities, outputting high-definition images.
This model follows the OpenAI chat standard format for API calls, refer to: https://apidoc.cometapi.com/api-15928299

🌟 2025.04.10#

šŸ”¹ grok-3-fast
šŸ”¹ grok-3-fast-latest
grok-3-fast, grok-3-fast-latest: grok-3 and grok-3-fast use exactly the same underlying model and provide the same response quality. However, grok-3-fast is served on faster infrastructure, delivering response times that are much quicker than the standard grok-3.
This model follows the OpenAI chat standard format for API calls, refer to: https://apidoc.cometapi.com/chat-api-13851472
šŸ”¹ grok-3-mini
šŸ”¹ grok-3-mini-latest
grok-3-mini, grok-3-mini-latest: A lightweight model that thinks before responding. Fast, intelligent, and ideal for logic-based tasks that don't require deep domain knowledge. The original thought traces are accessible.
This model follows the OpenAI chat standard format for API calls, refer to: https://apidoc.cometapi.com/chat-api-13851472
šŸ”¹ grok-3-mini-fast
šŸ”¹ grok-3-mini-fast-latest
grok-3-mini-fast, grok-3-mini-fast-latest: grok-3-mini and grok-3-mini-fast use exactly the same underlying model and provide the same response quality. However, grok-3-mini-fast is served on faster infrastructure, delivering response times that are much quicker than the standard grok-3-mini.
This model follows the OpenAI chat standard format for API calls, refer to: https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.04.07#

šŸ”¹ llama-4-maverick
llama-4-maverick, a high-capacity multimodal language model from Meta, supports multilingual text and image inputs and generates multilingual text and code output in 12 supported languages.
Maverick is optimized for visual language tasks and has instructions tuned for assistant-like behavior, image reasoning, and generic multimodal interaction.
Maverick features native multimodal early fusion and 1 million labeled context windows.
Maverick is released on April 5, 2025 under the Llama 4 Community License for research and commercial applications requiring advanced multimodal understanding and high model throughput.
The model follows the openai chat standard format call,cf:https://apidoc.cometapi.com/chat-api-13851472
šŸ”¹ llama-4-scout
llama-4-scout, is a mixed-expertise (MoE) language model developed by Meta. It supports native multimodal input (text and images) and multilingual output (text and code) for 12 supported languages.
Designed for assisted interaction and visual reasoning, Scout uses 16 experts per forward pass, a context length of 10 million words, and a training corpus of about 40 trillion words.
Designed for high efficiency and local or commercial deployment, llama-4-scout employs early fusion technology for seamless modal integration.
It is command-tuned for multilingual chat, subtitling, and image comprehension tasks.
It is released under the Llama 4 Community License, with last training data as of August 2024 and a public release on April 5, 2025.
The model follows the openai chat standard format for calls, cf:https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.03.29#

šŸ”¹ gpt-4o-all
gpt-4o-all has support for ChatGPT's latest generated image mode
The model follows the openai chat standard format for calls, cf:https://apidoc.cometapi.com/api-15928299
šŸ”¹ gpt-4o-image
gpt-4o-image This model is dedicated to image generation and editing, which enables image style conversion, preservation of original image features, superb consistency, and output of high-definition images.
The model follows the openai chat standard format for calls, cf:https://apidoc.cometapi.com/api-15928299

🌟 2025.03.27#

šŸ”¹ gemini-2.5-pro-exp-03-25
Features native multimodal processing capabilities, with an extensive context window of up to 1 million tokens, providing unprecedented powerful support for complex, long-sequence tasks.
The model follows the openai chat standard format for calls, cf:https://apidoc.cometapi.com/chat-api-13851472
šŸ”¹ gemini-2.5-pro-preview-03-25
According to Google's data, Gemini 2.5 Pro demonstrates particularly outstanding performance in handling complex tasks.
The model follows the openai chat standard format for calls, cf:https://apidoc.cometapi.com/chat-api-13851472

🌟 2025.03.24#

šŸ”¹ gpt-4.5-preview-2025-02-27
Preview Version: Showcasing the latest features of GPT-4.5, providing enhanced understanding and generation capabilities, suitable for various tasks, improving user experience.
šŸ”¹ gpt-4.5-preview
Preview Version: Deeply optimized algorithms and performance, delivering ultra-fast responses and precise outputs, perfectly suited for efficient decision-making scenarios.
šŸ”¹ gpt-4.5
Professional Standard Version: Stable and reliable, combining rich expression and multi-task processing capabilities, suitable for wide applications including business, education, creative, and technical fields.

🌟 2025.02.20#

šŸ”¹ claude-3-7-sonnet-thinking
Advanced model designed for complex reasoning and creative thinking, unleashing unlimited possibilities, empowering breakthrough problem-solving and innovation.
šŸ”¹ claude-3-7-sonnet-20250219
High-end version integrating the latest technological breakthroughs, handling complex tasks with superior performance, providing intelligent innovative solutions for users.
šŸ”¹ cometapi-3-7-sonnet
Outstanding multi-domain processing expert, delivering precise and smooth output experience, easily tackling various professional challenges.
šŸ”¹ cometapi-3-7-sonnet-thinking
Equipped with revolutionary algorithm architecture, significantly enhancing deep analysis and complex task management capabilities, making thinking more thorough and comprehensive.

šŸ”— Usage Guide:#

āœ… All models have been added to the default group, allowing you to flexibly call them according to different usage scenarios and requirements, easily integrate, and maximize their application value.

šŸ›  Quick Start:#

Simple integration into your system unlocks powerful capabilities. Fully utilize each model's unique advantages to meet professional needs across different domains.
šŸ”„ Experience the revolutionary performance improvements these breakthrough models bring right now! šŸ”„

For professional support or detailed consultation, please contact our customer service team or visit our technical documentation center. We look forward to your valuable feedback!
Modified atĀ 2025-11-27 09:17:29
Previous
Important Guidelines
Next
Quickly request CometAPI via ApiDog
Built with