Skip to main content
OpenAI TTS produces natural, expressive speech from text using neural voice synthesis. It supports multiple distinct voices and is available in standard and HD quality tiers.

Getting Started

Follow these steps to configure your provider:
1

Add OpenAI credentials to your vault

Navigate to Integration → Vault in the Rapida dashboard and add your OpenAI API key. This credential is shared across all OpenAI services (LLM, Whisper STT, and TTS).
2

Select OpenAI as your TTS provider

When configuring your assistant, open Audio Settings and choose OpenAI as your Text-to-Speech provider.
3

Choose a voice

Select from the available voices to match your assistant’s persona.

Available Voices

VoiceCharacter
alloyNeutral, balanced
echoMale, clear
fableExpressive, narrative
onyxDeep, authoritative
novaWarm, natural
shimmerClear, bright

Supported Models

ModelDescription
tts-1Optimized for real-time, low-latency synthesis
tts-1-hdHigher quality, best for offline or non-real-time use
For real-time voice assistants, use tts-1 to minimize latency.

Configuration Options

OptionDescriptionDefault
Modeltts-1 or tts-1-hdtts-1
VoiceOne of the six voices abovealloy
SpeedPlayback speed multiplier (0.25–4.0)1.0
FormatOutput format: mp3, opus, aac, flac, wav, pcmpcm

Notes

  • Use tts-1 for real-time conversations; tts-1-hd adds perceptible latency.
  • Audio is streamed in chunks, so playback begins before synthesis is fully complete.
  • Pricing is per 1,000 characters of input text. See OpenAI TTS pricing.