Skip to main content
Listen settings control how Rapida handles user audio before the assistant responds. Use this section for listen.* runtime options and deployment settings related to speech-to-text, noise cancellation, voice activity detection, and end-of-speech.
Listen settings are used by voice-capable deployments: Phone Call, Web Widget with microphone input, and Web App / SDK with microphone input. Text-only channels do not use listen configuration.

Configure it

Open your assistant, select Configure Assistant, then open Deployments. Listen settings appear in the Voice Input step for each deployment that supports microphone or phone audio.
AreaWhat it controls
Speech-to-TextProvider, credential, model, language, and transcription behavior.
Noise cancellationBackground noise removal before VAD and STT.
Voice Activity DetectionSpeech start/stop detection and barge-in sensitivity.
End of SpeechTurn completion detection and silence timeout behavior.

Configuration pages

Speech-to-Text

Choose the provider, credential, model, and language used to transcribe user speech.

Noise Cancellation

Clean background noise before VAD and STT process the user’s audio.

Voice Activity Detection

Tune speech detection, silence frames, and barge-in sensitivity.

End of Speech Detection

Decide when the user has finished a turn and the assistant should respond.
AreaStart with
STTA streaming provider and model that matches your channel audio.
Noise cancellationRNNoise enabled for phone calls and noisy browser environments.
VADSilero VAD.
EOSPipecat Smart Turn for natural conversations, or Silence-Based for simple IVR-style flows.
Tune listen settings from real conversation logs. If a caller gets cut off, start with EOS and VAD. If transcription is wrong, check language, audio quality, noise cancellation, and STT model.

Troubleshooting map

SymptomFirst place to look
Assistant responds before the user is doneEnd of Speech Detection
Assistant interrupts on coughs or background noiseVoice Activity Detection and Noise Cancellation
Transcript is wrong or incompleteSpeech-to-Text
Phone calls behave differently from web sessionsDeployment-level Voice Input settings

Experience

Configure greeting, idle timeout, error message, and session duration.

Speak

Configure text-to-speech and spoken output.

Phone Call Deployment

Configure required voice input for phone calls.

Web App / SDK Deployment

Configure optional voice input for custom apps.