Skip to main content

Getting Started

To integrate Cartesia Text-to-Speech with your Rapida application, follow these steps:

Supported Models

Text-to-Speech Models

Model NameDescriptionUse Case
Sonic EnglishHigh-quality English voicesProfessional TTS applications
Bark ModelExpressive speech synthesisCreative and dynamic voice outputs
Tortoise TTSUltra-realistic voicesHigh-fidelity voice synthesis

Supported Languages

Cartesia supports 20+ languages including:
  • English (multiple accents)
  • Spanish, French, German, Italian
  • Portuguese, Russian, Polish, Dutch
  • Japanese, Mandarin, Korean
  • Hindi and other Indian languages
  • And more

Voice Features

  • Multiple Voice Personalities: Choose from various voice options
  • Emotion Expression: Control emotional tone and expressiveness
  • Pronunciation Control: Define how specific words are pronounced
  • Speed Control: Adjust speaking rate from slow to fast
  • Streaming Support: Stream audio in real-time

Prerequisites

  • Have a Cartesia account (sign up at https://cartesia.ai)
  • Navigate to your API dashboard
  • Generate an API key
  • Copy the API key (make sure to save it securely)

Setting Up Provider Credentials

1

Access the Integrations Page

Integrations PageNavigate to the “Integration > Models” page to access TTS providers.
2

Select Cartesia

On the Integrations page, find the Cartesia provider card.Click the “Setup Credential” button for Cartesia.
3

Create Provider Credential

A modal window will appear titled “Create provider credential”. Follow these steps:
  1. Select “Cartesia” from the dropdown (if not already selected)
  2. Enter a Key Name: Assign a unique name to this provider key for easy identification
  3. Enter the API Key: Input your Cartesia API key
  4. Click “Configure” to save the credential
4

Verify Credential Setup

After setting up the credential, you can verify it’s been added:
  1. The Cartesia provider card should now show “Connected”
  2. If you click on the provider, you’ll see a “View provider credential” modal
  3. This modal displays the credential name, when it was last updated, and options to delete or close
Your Cartesia Text-to-Speech provider credential is now set up.

Integration Features

  • Ultra-Realistic Voices: Advanced voice synthesis quality
  • 20+ Languages: Comprehensive language coverage
  • Multiple Voice Models: Choose different TTS models
  • Expression Control: Fine-tune emotional expression
  • Real-time Processing: Low-latency audio generation
  • Streaming TTS: Stream audio as it’s being generated
  • Professional Quality: Enterprise-grade voice synthesis
For more information, visit the Cartesia documentation.