Tokaify APITokaify API
API 参考
AI 模型接口音频doubao接口seed-tts-1.0

语音生成

seed-tts-1.0 语音生成接口

调用时 model 固定为 seed-tts-1.0

POST
/v1/audio/speech

Authorization

BearerAuth

AuthorizationBearer <token>

模型 relay 接口鉴权。请求头:Authorization: Bearer

In: header

Request Body

application/json

model*string

语音合成模型。

Default"seed-tts-1.0"
input*string

要朗读的文本内容。

voice*string

音色,例如 alloynova。 voice 字符串字段。范围:非空字符串或按业务配置校验。

Default"zh_female_cancan_mars_bigtts"
format?string

输出音频格式,例如 mp3wav

speed?number

语速。 speed 数值字段。范围:以接口说明或后台配置为准。

Response Body

audio/mpeg

curl -X POST "https://api.tokaify.com/v1/audio/speech" \  -H "Content-Type: application/json" \  -d '{    "model": "seed-tts-1.0",    "input": "需要处理的输入文本。",    "voice": "zh_female_cancan_mars_bigtts",    "format": "json",    "speed": 1  }'
"string"

请求参数

字段类型必选默认值描述
modelstring固定为 seed-tts-1.0
inputstring要合成的文本。参考火山引擎在线语音合成接口,普通音色单次文本建议不超过 1024 字节;长文本或复刻音色以渠道配置为准。
voicestringzh_female_cancan_mars_bigtts字节音色 key,例如 zh_female_cancan_mars_bigtts,实际可用音色以账号授权为准。
response_formatstringpcm输出音频格式,对应官方 audio.encoding,支持 mp3wavpcmogg_opus;其中 wav 通常不用于流式场景。
speednumber1语速,对应官方 audio.speed_ratio,范围 [0.2, 3]1 表示正常语速。

示例代码

curl https://api.tokaify.com/v1/audio/speech \
  -H "Authorization: Bearer $TOKAIFY_API_KEY" \
  -H "Content-Type: application/json" \
  --output speech.mp3 \
  -d '{
    "model": "seed-tts-1.0",
    "input": "你好,这是一段由 Seed TTS 1.0 生成的语音。",
    "voice": "zh_female_cancan_mars_bigtts",
    "response_format": "mp3",
    "speed": 1
  }'
import requests

response = requests.post(
    "https://api.tokaify.com/v1/audio/speech",
    headers={"Authorization": "Bearer YOUR_TOKAIFY_API_KEY"},
    json={
        "model": "seed-tts-1.0",
        "input": "你好,这是一段由 Seed TTS 1.0 生成的语音。",
        "voice": "zh_female_cancan_mars_bigtts",
        "response_format": "mp3",
        "speed": 1,
    },
)
response.raise_for_status()
with open("speech.mp3", "wb") as file:
    file.write(response.content)
import { writeFile } from "node:fs/promises";

const response = await fetch("https://api.tokaify.com/v1/audio/speech", {
  method: "POST",
  headers: {
    Authorization: `Bearer ${process.env.TOKAIFY_API_KEY}`,
    "Content-Type": "application/json",
  },
  body: JSON.stringify({
    model: "seed-tts-1.0",
    input: "你好,这是一段由 Seed TTS 1.0 生成的语音。",
    voice: "zh_female_cancan_mars_bigtts",
    response_format: "mp3",
    speed: 1,
  }),
});

const audio = await response.arrayBuffer();
await writeFile("speech.mp3", Buffer.from(audio));

业务提示

voice 对应字节音色 key;不传时使用默认音色 zh_female_cancan_mars_bigtts。如需控制音量、音高、情感或语言,官方字段分别为 volume_ratiopitch_ratioemotionlanguage,具体支持范围以渠道配置和音色能力为准。

这篇文档对您有帮助吗?

最后更新于