Getting Started
To integrate Deepgram Text-to-Speech with your Rapida application, follow these steps:Supported Models
Text-to-Speech Models
| Model Name | Quality | Use Case |
|---|---|---|
| aura-asteria-en | Premium | Ultra-realistic voices, high-quality output |
| aura-luna-en | Premium | Natural, expressive female voices |
| aura-stella-en | Premium | Professional male voices |
| aura-juniper-en | Standard | Cost-effective, quality voices |
Supported Languages
Deepgram TTS supports 20+ languages including:- English (US, UK, Australian, Indian variants)
- Spanish, French, German, Italian, Dutch
- Portuguese, Russian, German, Polish
- Japanese, Mandarin, Korean
- And more
Voice Characteristics
Each Deepgram voice model includes:- Multiple Voices: Different voice personalities
- Emotion Styles: Express different emotions and tones
- Pronunciation Control: Fine-tune how words are pronounced
- Speed Control: Adjust speaking rate
Prerequisites
- Have a Deepgram account (sign up at https://deepgram.com)
- Navigate to your API dashboard
- Generate an API key
- Copy the API key (make sure to save it securely)
Setting Up Provider Credentials
1
Access the Integrations Page

2
Select Deepgram
On the Integrations page, find the Deepgram provider card.Click the “Setup Credential” button for Deepgram.
3
Create Provider Credential
A modal window will appear titled “Create provider credential”. Follow these steps:
- Select “Deepgram” from the dropdown (if not already selected)
- Enter a Key Name: Assign a unique name to this provider key for easy identification
- Enter the API Key: Input your Deepgram API key
- Click “Configure” to save the credential
4
Verify Credential Setup
After setting up the credential, you can verify it’s been added:
- The Deepgram provider card should now show “Connected”
- If you click on the provider, you’ll see a “View provider credential” modal
- This modal displays the credential name, when it was last updated, and options to delete or close
Integration Features
- Ultra-Realistic Voices: Industry-leading voice quality
- 20+ Languages: Comprehensive global language support
- Multiple Voice Options: Choose from various voice personalities
- Emotion Control: Express different emotional tones
- Real-time Streaming: Stream audio in real-time
- Low Latency: Fast response times for interactive applications
- Pronunciation Control: Fine-tune specific word pronunciations