Listen

Listen settings control how Rapida handles user audio before the assistant responds. Use this section for listen.* runtime options and deployment settings related to speech-to-text, noise cancellation, voice activity detection, and end-of-speech.

Listen settings are used by voice-capable deployments: Phone Call, Web Widget with microphone input, and Web App / SDK with microphone input. Text-only channels do not use listen configuration.

Configure it

Open your assistant, select Configure Assistant, then open Deployments. Listen settings appear in the Voice Input step for each deployment that supports microphone or phone audio.

Area	What it controls
Speech-to-Text	Provider, credential, model, language, and transcription behavior.
Noise cancellation	Background noise removal before VAD and STT.
Voice Activity Detection	Speech start/stop detection and barge-in sensitivity.
End of Speech	Turn completion detection and silence timeout behavior.

Configuration pages

Speech-to-Text

Choose the provider, credential, model, and language used to transcribe user speech.

Noise Cancellation

Clean background noise before VAD and STT process the user’s audio.

Voice Activity Detection

Tune speech detection, silence frames, and barge-in sensitivity.

End of Speech Detection

Decide when the user has finished a turn and the assistant should respond.

Recommended starting point

Area	Start with
STT	A streaming provider and model that matches your channel audio.
Noise cancellation	RNNoise enabled for phone calls and noisy browser environments.
VAD	Silero VAD.
EOS	Pipecat Smart Turn for natural conversations, or Silence-Based for simple IVR-style flows.

Tune listen settings from real conversation logs. If a caller gets cut off, start with EOS and VAD. If transcription is wrong, check language, audio quality, noise cancellation, and STT model.

Runtime overrides

You can override listen settings for a single outbound phone call by passing listen.* keys in CreatePhoneCallRequest.options. Runtime overrides do not update the saved deployment.

{
  "options": {
    "listen.language": "en",
    "listen.model": "nova-3",
    "listen.threshold": 0.75,
    "listen.audio.encoding": "mulaw",
    "listen.audio.sample_rate": 8000,
    "listen.smart_format": true,
    "listen.filler_words": false,
    "listen.endpointing": "300"
  }
}

Option key	Type	Description
`listen.language`	`string`	Primary transcription language or provider language code.
`listen.model`	`string`	Provider-specific STT model for this call.
`listen.threshold`	`number`	Speech detection threshold used by supported providers.
`listen.audio.encoding`	`string`	Input audio encoding, such as `mulaw` or `pcm_s16le`, when supported by the provider.
`listen.audio.sample_rate`	`number`	Input audio sample rate, such as `8000` for telephony audio.
`listen.region`	`string`	Provider region for services that support regional endpoints.
`listen.smart_format`	`boolean`	Enables provider smart formatting when supported.
`listen.filler_words`	`boolean`	Controls filler-word transcription when supported.
`listen.vad_events`	`boolean`	Enables provider VAD events when supported.
`listen.endpointing`	`string`	Provider endpointing or silence setting.
`listen.multichannel`	`boolean`	Enables multichannel transcription when supported.
`listen.keyword`	`string` / `string[]`	Provider keyword boosting terms.
`listen.operating_point`	`string`	Provider accuracy/latency tier, such as Speechmatics operating point.
`listen.query_params`	JSON object string	Custom STT provider query parameters.
`listen.request_rules`	JSON array string	Custom STT request DSL rules.
`listen.response_rules`	JSON array string	Custom STT response DSL rules.

Troubleshooting map

Symptom	First place to look
Assistant responds before the user is done	End of Speech Detection
Assistant interrupts on coughs or background noise	Voice Activity Detection and Noise Cancellation
Transcript is wrong or incomplete	Speech-to-Text
Phone calls behave differently from web sessions	Deployment-level Voice Input settings

Experience

Configure greeting, idle timeout, error message, and session duration.

Speak

Configure text-to-speech and spoken output.

Phone Call Deployment

Configure required voice input for phone calls.

Web App / SDK Deployment

Configure optional voice input for custom apps.

Assistants

Knowledge

LLM Endpoint

Activity & Logs

External Integrations

Credentials

Workspace

Governance

Deployment Options

Configure it

Configuration pages

Speech-to-Text

Noise Cancellation

Voice Activity Detection

End of Speech Detection

Recommended starting point

Runtime overrides

Troubleshooting map

Experience

Speak

Phone Call Deployment

Web App / SDK Deployment

​Configure it

​Configuration pages

Speech-to-Text

Noise Cancellation

Voice Activity Detection

End of Speech Detection

​Recommended starting point

​Runtime overrides

​Troubleshooting map

​Related

Experience

Speak

Phone Call Deployment

Web App / SDK Deployment

Configure it

Configuration pages

Recommended starting point

Runtime overrides

Troubleshooting map

Related