Skip to main content
Speak settings control how assistant text becomes audio. Use this section for speak.* runtime options and deployment settings related to text-to-speech, voices, pronunciation, ambient audio, and speech delivery.
Speak settings are used by voice-capable deployments: Phone Call, Web Widget with spoken responses, and Web App / SDK with spoken responses. Text-only channels do not use speak configuration.

Configure it

Open your assistant, select Configure Assistant, then open Deployments. Speak settings appear in the Voice Output step for each deployment that supports spoken responses.
AreaWhat it controls
Text-to-SpeechProvider, credential, model, voice, language, and speech synthesis behavior.
PronunciationHow dates, times, numbers, addresses, URLs, acronyms, and domain terms are spoken.
DeliveryPause behavior, conjunction boundaries, and optional ambient audio.

Configuration pages

Text-to-Speech

Choose the provider, voice, model, language, pronunciation, and speech delivery settings.

Custom TTS

Connect a custom WebSocket speech synthesis provider with DSL rules.
AreaStart with
TTSA low-latency streaming voice that supports the assistant’s primary language.
PronunciationEnable dictionaries for numbers, currency, dates, times, URLs, and domain terms that users will hear often.
PromptShort spoken responses, usually one or two sentences.
Tune speak settings by listening to full conversations in the target channel. A voice that sounds good in a browser preview can still sound unclear over phone audio.

Troubleshooting map

SymptomFirst place to look
Assistant voice starts slowlyText-to-Speech model latency and response length
Assistant mispronounces product names or numbersText-to-Speech pronunciation dictionaries
Assistant sounds rushedConjunction boundaries, pause duration, and prompt response length
Voice sounds poor on phoneTest through Phone Call, not only browser preview

Experience

Configure greeting, idle timeout, error message, and session duration.

Listen

Configure speech-to-text, VAD, noise cancellation, and end-of-speech.

Phone Call Deployment

Configure required voice output for phone calls.

Web Widget Deployment

Configure optional spoken responses in the web widget.