API Doc-CometAPI
HomeDashBoardModel_Price
HomeDashBoardModel_Price
Discord_Support
  1. Audio
  • GET START
    • Model New Release Announcement
    • Help Center
    • Quick Start
    • About Pricing
    • About Grouping
    • Interface Stability
    • Privacy policy
    • Terms of service
    • Error code description
    • Code example
    • Must see for use
    • Common Misconceptions
    • Confusion about use
    • Best Practices
      • Retry Logic Documentation for CometAPI and OpenAI Official API
      • Midjourney Best Practices
      • Runway Best Practices
  • OpenAI Compatiable Endpoint
    • gpt-4o-image generates image
      POST
    • Chat
      POST
    • Recognizing Images
      POST
    • Models
      GET
    • Embeddings
      POST
    • Images
      POST
    • Realtime
      POST
    • Image Editing (gpt-image-1)
      POST
  • Audio
    • Create speech
      POST
    • Create transcription
      POST
    • Create translation
      POST
  • Anthropic Compatiable Endpoint
    • Anthropic Claude
      POST
  • Music Generation Endpoint
    • Suno
      • Setting suno Version
      • Generate lyrics
      • Generate music clip
      • Upload clip
      • Submit concatenation
      • Single task query
      • Batch query tasks
    • Udio(Temporarily unavailable)
      • Generate music
      • Task query
  • Image Generation Endpoint
    • Midjourney(images)
      • Task Fetching API
        • List by Condition
        • Fetch Single Task (most recommended)
      • Imagine
      • Change (UPSCALE; VARIATION; REROLL)
      • Action (UPSCALE; VARIATION; REROLL; ZOOM, etc.)
      • Blend (image -> image)
      • Describe (image -> text)
      • Modal (Area Redesign & Zoom)
    • Ideogram(images)
      • Official documentation (updated in real time)
      • ideogram Text Raw Image
      • ideogram Hybrid image
      • ideogram enlargement HD
      • ideogram describes the image
      • ideogram Edit image
    • Flux(images)
      • Generate image (replicate format)
      • flux fine-tune images(Temporarily unavailable)
      • flux generate image(Temporarily unavailable)
      • flux query
    • Replicate(image)
      • replicate Generate
      • replicate query
    • Recraft(images)
      • Appendix
      • Recraft Generate Image
      • Recraft Vectorize Image
      • Recraft Remove Background
      • Recraft Clarity Upscale
      • Recraft Create style
      • Recraft Generative Upscale
  • Video Generation Ednpoint
    • runway(video)
      • official format
        • runway images raw video
        • runway to get task details
      • Reverse Format
        • generate(text)
        • generate(Reference images)
        • Video to Video Style Redraw
        • Act-one Expression Migration
        • feed-get task
    • PIKA(video)
      • pika feed
      • PIKA Reference Video Generation
      • PIKA Reference Image Generation
      • PIKA reference text generation
    • luma (video)
      • Official api interface format
        • luma generate
        • luma search
    • kling (video)
      • callback_url
      • Generating images
      • Text Generation Video
      • Image Generation Video
      • Video Extension
      • virtual try-on
      • lip sync
      • Query tasks (images)
      • Individual queries (videos)
    • MiniMax Conch(video)
      • MiniMax Conch Official Documentation
      • MiniMax Conch Generation
      • MiniMax Conch Query
      • MiniMax Conch Download
    • sora
      • Official Format
        • Create Video
        • Query Video Task
      • Reverse Format
        • Create Video
  • Software Integration Guide
    • cometapi Site API Call Testing
    • OpenManus
    • Chatbox
    • CherryStudio
    • Cursor
    • ChatHub
    • cline
    • dify
    • gptme
    • Immersive Translation
    • Lobe-Chat
    • Zotero
    • LangChain
    • AnythingLLM
    • Eudic Translation
    • OpenAI Translator
    • ChatAll Translation
    • Pot Translation
    • GPT Academic Optimization (gpt_academic)
    • NEXT CHAT (ChatGPT Next Web)
    • Obsidian's Text Generator Plugin
    • Open WebUI
    • avante.nvim
    • librechat
    • Lazy Customer Service
    • utools-ChatGPT Friend
    • IntelliJ Translation Plugin
    • n8n
  1. Audio

Create translation

POST
https://api.cometapi.com/v1/audio/translations
Maintainer:Not configured

POST /v1/audio/translations#

This endpoint is used to request audio translations using the specified parameters.

Request Body#

file (text): The audio file object (not file name) to translate, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
model (text): ID of the model to use. Only whisper-1 (which is powered by our open source Whisper V2 model) is currently available.
prompt (text): An optional text to guide the model's style or continue a previous audio segment. The prompt should be in English.
response_format (text): The format of the output, in one of these options: json, text, srt, verbose_json, or vtt.
temperature (text): The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.

Request

Header Params
Authorization
string 
required
Example:
Bearer {{api-key}}
Body Params multipart/form-data
file
file 
required
The audio file object (not file name) translate, in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, or webm.
model
string 
required
ID of the model to use. Only whisper-1 (which is powered by our open source Whisper V2 model) is currently available.
Example:
whisper-1
prompt
string 
required
An optional text to guide the model's style or continue a previous audio segment. The prompt should be in English.
response_format
string 
required
The format of the output, in one of these options: json, text, srt, verbose_json, or vtt.
Example:
json
temperature
string 
required
The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.
Example:
0

Request samples

Shell
JavaScript
Java
Swift
Go
PHP
Python
HTTP
C
C#
Objective-C
Ruby
OCaml
Dart
R
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.cometapi.com/v1/audio/translations' \
--header 'Authorization: Bearer {{api-key}}' \
--form 'file=@""' \
--form 'model="whisper-1"' \
--form 'prompt=""' \
--form 'response_format="json"' \
--form 'temperature="0"'

Responses

🟢200Create translation
text/plain
Body
text
string 
required
Example
{
  "text": "Hello, my name is Wolfgang and I come from Germany. Where are you heading today?"
}
Previous
Create transcription
Next
Anthropic Claude
Built with