Deepgram Text-to-Speech

Supported Models

Text-to-Speech Models

Model Name	Quality	Use Case
aura-asteria-en	Premium	Ultra-realistic voices, high-quality output
aura-luna-en	Premium	Natural, expressive female voices
aura-stella-en	Premium	Professional male voices
aura-juniper-en	Standard	Cost-effective, quality voices

Supported Languages

Deepgram TTS supports 20+ languages including:

English (US, UK, Australian, Indian variants)

Spanish, French, German, Italian, Dutch

Portuguese, Russian, German, Polish

Japanese, Mandarin, Korean

And more

Voice Characteristics

Each Deepgram voice model includes:

Multiple Voices: Different voice personalities

Emotion Styles: Express different emotions and tones

Pronunciation Control: Fine-tune how words are pronounced

Speed Control: Adjust speaking rate

Setting Up Provider Credentials

Access the Integrations Page

Navigate to the “Integration > Models” page to access TTS providers.

Select Deepgram

On the Integrations page, find the Deepgram provider card.Click the “Setup Credential” button for Deepgram.

Create Provider Credential

A modal window will appear titled “Create provider credential”. Follow these steps:

Select “Deepgram” from the dropdown (if not already selected)
Enter a Key Name: Assign a unique name to this provider key for easy identification
Enter the API Key: Input your Deepgram API key
Click “Configure” to save the credential

Verify Credential Setup

After setting up the credential, you can verify it’s been added:

The Deepgram provider card should now show “Connected”
If you click on the provider, you’ll see a “View provider credential” modal
This modal displays the credential name, when it was last updated, and options to delete or close

Your Deepgram Text-to-Speech provider credential is now set up.

Integration Features

Ultra-Realistic Voices: Industry-leading voice quality

20+ Languages: Comprehensive global language support

Multiple Voice Options: Choose from various voice personalities

Emotion Control: Express different emotional tones

Real-time Streaming: Stream audio in real-time

Low Latency: Fast response times for interactive applications

Pronunciation Control: Fine-tune specific word pronunciations

Assistants

Knowledge

LLM Endpoint

Activity & Logs

External Integrations

Credentials

Workspace

Governance

Deployment Options

Getting Started

Supported Models

Text-to-Speech Models

Supported Languages

Voice Characteristics

Prerequisites

Setting Up Provider Credentials

Integration Features

​Getting Started

​Supported Models

​Text-to-Speech Models

​Supported Languages

​Voice Characteristics

​Prerequisites

​Setting Up Provider Credentials

​Integration Features

Getting Started

Supported Models

Text-to-Speech Models

Supported Languages

Voice Characteristics

Prerequisites

Setting Up Provider Credentials

Integration Features