
{"type":"doc","content":[{"type":"paragraph","attrs":{"id":"6acbd668-0a85-46a2-9cb6-7905f83187af","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","text":"2026年,腾讯云语音合成(TTS)在实时对话、音色丰富度和声音克隆方面都有值得关注的更新。本文基于2026年5-6月的产品动态和技术文档,梳理核心能力、接入方式和参数调优思路。"}]},{"type":"paragraph","attrs":{"id":"9b1e7ee2-c5bb-40f2-8a51-f3154aa11b87","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"一、2026年主要更新"}]},{"type":"paragraph","attrs":{"id":"847d5763-709d-44fe-b4d7-d66273be2f83","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"对话式TTS上线"},{"type":"text","text":"。基于TRTC(实时音视频)打造的新一代低延迟语音合成能力,首包延迟低至300ms,支持声音克隆与多语种。推荐模型 flow_02_turbo 支持中文、英文、日语、粤语四种语言。"}]},{"type":"paragraph","attrs":{"id":"31f14668-4be6-4aab-a664-4682da0b8b8c","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"17个新音色上线"},{"type":"text","text":"。2026年5月,语音合成上线了17个新音色,包括6个男音色和11个女音色,新增聊天风格音色和四川话。2026年3月上线了“沉稳青叔”“邻家女孩”2个超自然大模型音色。"}]},{"type":"paragraph","attrs":{"id":"f9bad557-a127-4770-92c4-15010ac6998c","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"二、核心能力"}]},{"type":"paragraph","attrs":{"id":"2872d029-b948-4b12-b926-7c51e81d434c","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"音色与语种"},{"type":"text","text":":腾讯云语音合成目前支持男女共46种声音效果。超自然大模型音色包括智小虎(聊天童声)、智小悟(聊天男声)、智小解(解说男声)、智小满(营销女声)、智小敏(聊天女声)等。支持中、英、日、韩等40 语种。"}]},{"type":"paragraph","attrs":{"id":"67a43651-2812-4017-a81c-6c476b31296a","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"合成方式"},{"type":"text","text":":提供通用语音合成和长文本语音合成两类产品。通用语音合成包含基础语音合成、实时语音合成和流式文本语音合成三种方式。长文本语音合成支持10万字以内的文本异步合成。"}]},{"type":"paragraph","attrs":{"id":"2cef391f-c9bd-4119-b5d1-9c8724dc96fe","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"SSML支持"},{"type":"text","text":":支持SSML标记语言,可自定义音量、语速等参数,语速从0.6倍到1.5倍可选。"}]},{"type":"paragraph","attrs":{"id":"2027907f-1c0d-4a50-ba50-5e0aea910af8","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"声音克隆"},{"type":"text","text":":提交少量语音样本(16k单声道wav,6秒-180秒)即可创建专属克隆音色。生成的VoiceId与精品音色ID用法一致,可在任意语音合成接口中直接使用。该服务目前限时免费。"}]},{"type":"paragraph","attrs":{"id":"fe19d7a7-5141-4f2e-9b37-7a07aadd89fb","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"三、免费额度与定价"}]},{"type":"paragraph","attrs":{"id":"52aa87ad-f503-40df-804b-88a58245655f","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"免费额度"},{"type":"text","text":":语音合成提供三类免费资源包,需在语音合成控制台领取——基础/精品音色800万字符、大模型音色10万字符、超自然大模型音色2万字符。仅支持通用语音合成接口,暂不支持长文本语音合成接口。免费资源包自领取之日起三个月内有效,过期作废,一个账号只能领取一次。"}]},{"type":"paragraph","attrs":{"id":"23b59f54-57eb-4362-896c-231065ee51ab","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"后付费价格"},{"type":"text","text":":通用语音合成-精品音色后付费单价约0.3元/万字符;超自然大模型音色采用梯度计价,日用量越大单价越低。"}]},{"type":"paragraph","attrs":{"id":"0518a086-e5da-4d8c-a12e-1af6d05c0268","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"四、Python接入示例"}]},{"type":"paragraph","attrs":{"id":"fab5deac-bab3-4ec2-9df1-6f4d63fd57d4","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","text":"以下代码基于腾讯云TTS SDK,实现基础的文本转语音功能:"}]},{"type":"codeBlock","attrs":{"id":"223f769a-8991-415f-95ba-7a20a14bf827","language":"javascript","theme":"atom-one-dark","runtimes":0,"isHoverDragHandle":false,"key":"","languageByAi":"javascript"},"content":[{"type":"text","text":"from tencentcloud.common import credentialnfrom tencentcloud.tts.v20190823 import tts_client, modelsnn# 初始化认证ncred = credential.Credential("YOUR_SECRET_ID", "YOUR_SECRET_KEY")nclient = tts_client.TtsClient(cred, "ap-guangzhou")nn# 构建请求nreq = models.TextToVoiceRequest()nreq.Text = "需要合成的文本内容"nreq.VoiceType = 1002# 音色IDnreq.Speed = 0 # 语速,范围-2到2nreq.Volume = 5# 音量,范围0到10nn# 发送请求并保存音频nresp = client.TextToVoice(req)nwith open("output.mp3", "wb") as f:nf.write(resp.Audio)"}]},{"type":"paragraph","attrs":{"id":"97606391-7131-4116-8932-cb4e67952b5f","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"VoiceType参考"},{"type":"text","text":":1002(成熟男声)、1003(活力男声)、1004(温润女声)、1005(甜美女声)、1050(新闻女声)"}]},{"type":"paragraph","attrs":{"id":"4eddf151-b68c-452a-94e4-ab1c5f487ece","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"声音克隆接口"},{"type":"text","text":":接口域名为 "},{"type":"text","marks":[{"type":"link","attrs":{"href":"https://trtc.tencentcloudapi.com/","target":"_blank","rel":"noreferrer","class":null}}],"text":"trtc.tencentcloudapi.com"},{"type":"text","text":",接口名称为 VoiceClone。提交音频样本后返回 VoiceId,可在任意语音合成接口中使用。"}]},{"type":"paragraph","attrs":{"id":"0d5881e6-1390-4adf-a372-728a11475e43","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"五、参数调优思路"}]},{"type":"paragraph","attrs":{"id":"f584a87f-ddff-4c18-ab95-060e4e515011","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"音色选型"},{"type":"text","text":":在腾讯云控制台的声音试听功能中,依次试听不同音色风格,确定最适合项目场景的音色ID。"}]},{"type":"paragraph","attrs":{"id":"d113aae3-6746-45bf-82a8-62c276875264","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"语速调试"},{"type":"text","text":":根据内容的节奏需求,在SDK中逐步调整Speed参数(-2到2),先确定大致范围再精细化调整。"}]},{"type":"paragraph","attrs":{"id":"8c4c3f10-aa40-4a7d-89cd-618751823189","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"音量校准"},{"type":"text","text":":参考主流音频平台的响度标准,通过Volume参数(0-10)调整输出音量。"}]},{"type":"paragraph","attrs":{"id":"1739676e-4f1e-432c-85cf-e97a3ff08953","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"SSML控制"},{"type":"text","text":":对于需要精细停顿或强调的文本,使用SSML标签进行标注。"}]},{"type":"paragraph","attrs":{"id":"f850ac71-da32-429e-bcef-df18515a454f","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"六、适用场景参考"}]},{"type":"paragraph","attrs":{"id":"0ed8fc8b-a2da-4558-91ae-3fa7aec2ecf6","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"需要批量生产、API集成"},{"type":"text","text":" → 通用语音合成,800万字符免费额度,支持SSML标记语言和40 语种,提供多语言SDK。"}]},{"type":"paragraph","attrs":{"id":"47f92ecd-944b-46cc-881e-fdd43fd1c109","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"需要实时对话场景"},{"type":"text","text":" → 对话式TTS(flow_02_turbo),首包延迟低至300ms,支持声音克隆与多语种。"}]},{"type":"paragraph","attrs":{"id":"02138f56-7518-4166-91dc-4b84e15aabd2","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"需要声音克隆"},{"type":"text","text":" → 声音克隆服务,6-180秒录音即可克隆,VoiceId可直接用于合成,目前限时免费。"}]},{"type":"paragraph","attrs":{"id":"d87e0c59-ff4b-437b-9a09-492aed961cf2","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","marks":[{"type":"textStyle","attrs":{"color":"","background":""}},{"type":"bold"}],"text":"小结"}]},{"type":"paragraph","attrs":{"id":"7c838c1a-b2b5-4297-80e3-2a57175380dc","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","text":"2026年的腾讯云TTS在实时对话、音色丰富度和声音克隆方面都有更新。对话式TTS的首包延迟低至300ms,适合实时交互场景;17个新音色和四川话的加入扩展了音色选择范围;声音克隆服务目前限时免费,可以低成本体验。"}]},{"type":"paragraph","attrs":{"id":"cb8f7aae-5b4b-4ab9-8a99-63e8725d0f47","textAlign":"inherit","indent":0,"color":null,"background":null,"isHoverDragHandle":false},"content":[{"type":"text","text":"以上信息基于2026年5-6月产品动态和技术文档整理,具体以腾讯云官网实时展示为准。"}]}]}","createTime":1782792376,"ext":{"closeTextLink":0,"comment_ban":0,"description":"","focusRead":0},"favNum":0,"html":"","isOriginal":0,"likeNum":0,