listen.* runtime options and deployment settings related to speech-to-text, noise cancellation, voice activity detection, and end-of-speech.
Listen settings are used by voice-capable deployments: Phone Call, Web Widget with microphone input, and Web App / SDK with microphone input. Text-only channels do not use listen configuration.
Configure it
Open your assistant, select Configure Assistant, then open Deployments. Listen settings appear in the Voice Input step for each deployment that supports microphone or phone audio.| Area | What it controls |
|---|---|
| Speech-to-Text | Provider, credential, model, language, and transcription behavior. |
| Noise cancellation | Background noise removal before VAD and STT. |
| Voice Activity Detection | Speech start/stop detection and barge-in sensitivity. |
| End of Speech | Turn completion detection and silence timeout behavior. |
Configuration pages
Speech-to-Text
Choose the provider, credential, model, and language used to transcribe user speech.
Noise Cancellation
Clean background noise before VAD and STT process the user’s audio.
Voice Activity Detection
Tune speech detection, silence frames, and barge-in sensitivity.
End of Speech Detection
Decide when the user has finished a turn and the assistant should respond.
Recommended starting point
| Area | Start with |
|---|---|
| STT | A streaming provider and model that matches your channel audio. |
| Noise cancellation | RNNoise enabled for phone calls and noisy browser environments. |
| VAD | Silero VAD. |
| EOS | Pipecat Smart Turn for natural conversations, or Silence-Based for simple IVR-style flows. |
Troubleshooting map
| Symptom | First place to look |
|---|---|
| Assistant responds before the user is done | End of Speech Detection |
| Assistant interrupts on coughs or background noise | Voice Activity Detection and Noise Cancellation |
| Transcript is wrong or incomplete | Speech-to-Text |
| Phone calls behave differently from web sessions | Deployment-level Voice Input settings |
Related
Experience
Configure greeting, idle timeout, error message, and session duration.
Speak
Configure text-to-speech and spoken output.
Phone Call Deployment
Configure required voice input for phone calls.
Web App / SDK Deployment
Configure optional voice input for custom apps.