Create translation
Use CometAPI POST /v1/audio/translations to translate spoken audio into English text with a selected translation model and response format.
First request
Send a supported audio file withmodel and file. Use a short sample first so you can verify upload handling, language detection, and output parsing.
Read the response
The default response includes translated Englishtext. If you request a different response format, parse the response according to that format.
Next steps
- Use Create Transcription when you need text in the original language.
- Use Create Speech when you need to generate audio from text.
Authorizations
Bearer token authentication. Use your CometAPI key.
Body
The audio file to translate. Supported formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm.
The audio translation model to use. Choose a current speech model from the Models page.
Optional text to guide the model's style or continue a previous audio segment. The prompt should be in English.
The output format for the translation.
json, text, srt, verbose_json, vtt Sampling temperature between 0 and 1. Higher values produce more random output; lower values are more focused. When set to 0, the model auto-adjusts temperature using log probability.
0 <= x <= 1Response
The translation result in English.
The translated text in English.