Getting Started
To integrate Deepgram Text-to-Speech with your Rapida application, follow these steps:Supported Models
Text-to-Speech Models
| Model Name | Quality | Use Case |
|---|---|---|
| aura-asteria-en | Premium | Ultra-realistic voices, high-quality output |
| aura-luna-en | Premium | Natural, expressive female voices |
| aura-stella-en | Premium | Professional male voices |
| aura-juniper-en | Standard | Cost-effective, quality voices |
Supported Languages
Deepgram TTS supports 20+ languages including:- English (US, UK, Australian, Indian variants)
- Spanish, French, German, Italian, Dutch
- Portuguese, Russian, German, Polish
- Japanese, Mandarin, Korean
- And more
Voice Characteristics
Each Deepgram voice model includes:- Multiple Voices: Different voice personalities
- Emotion Styles: Express different emotions and tones
- Pronunciation Control: Fine-tune how words are pronounced
- Speed Control: Adjust speaking rate
Prerequisites
- Have a Deepgram account (sign up at https://deepgram.com)
- Navigate to your API dashboard
- Generate an API key
- Copy the API key (make sure to save it securely)
Setting Up Provider Credentials
Select Deepgram
On the Integrations page, find the Deepgram provider card.Click the “Setup Credential” button for Deepgram.
Create Provider Credential
A modal window will appear titled “Create provider credential”. Follow these steps:
- Select “Deepgram” from the dropdown (if not already selected)
- Enter a Key Name: Assign a unique name to this provider key for easy identification
- Enter the API Key: Input your Deepgram API key
- Click “Configure” to save the credential
Verify Credential Setup
After setting up the credential, you can verify it’s been added:
- The Deepgram provider card should now show “Connected”
- If you click on the provider, you’ll see a “View provider credential” modal
- This modal displays the credential name, when it was last updated, and options to delete or close
Integration Features
- Ultra-Realistic Voices: Industry-leading voice quality
- 20+ Languages: Comprehensive global language support
- Multiple Voice Options: Choose from various voice personalities
- Emotion Control: Express different emotional tones
- Real-time Streaming: Stream audio in real-time
- Low Latency: Fast response times for interactive applications
- Pronunciation Control: Fine-tune specific word pronunciations
