音频模型
创建语音
使用 CometAPI POST /v1/audio/speech 通过 TTS 模型将文本转换为逼真的音频。可从 10 种声音中进行选择,调整速度,并输出为 MP3、OPUS、AAC、FLAC、WAV 或 PCM。
POST
Python (OpenAI SDK)
Documentation Index
Fetch the complete documentation index at: https://apidoc.cometapi.com/llms.txt
Use this file to discover all available pages before exploring further.
授权
Bearer token authentication. Use your CometAPI key.
请求体
application/json
The TTS model to use. Choose a current speech model from the Models page.
The text to generate audio for. Maximum length is 4096 characters.
Maximum string length:
4096The voice to use for speech synthesis.
可用选项:
alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer The audio output format.
可用选项:
mp3, opus, aac, flac, wav, pcm The speed of the generated audio. Select a value between 0.25 and 4.0.
必填范围:
0.25 <= x <= 4响应
200 - audio/mpeg
The audio file content.
The response is of type file.