Skip to main content

Getting Started

To integrate Cartesia with your Rapida application for speech-to-text (STT) and text-to-speech (TTS) capabilities, follow these steps:

Supported Models

Speech-to-Text Models

Model NameLanguageDescription
Sonic EnglishEnglishHigh-accuracy English speech recognition
Sonic MultilingualMultilingualSupport for multiple languages

Text-to-Speech Models

Model NameFeaturesBest For
BarkExpressive speech synthesisNatural-sounding voices with emotion
TortoiseHigh-quality TTSProfessional voice applications
Sonic XLUltra-realistic voicesHigh-fidelity voice synthesis

Supported Languages for STT

  • English (US, UK, Australian variants)
  • Spanish, French, German
  • Mandarin, Japanese, Korean
  • And more

Supported Languages for TTS

Cartesia supports 20+ languages for text-to-speech synthesis with multiple voice options.

Prerequisites

  • Have a Cartesia account (sign up at https://cartesia.ai)
  • Navigate to your API dashboard
  • Generate an API key
  • Copy the API key (make sure to save it securely)

Setting Up Provider Credentials

1

Access the Integrations Page

Integrations PageNavigate to the “Integration > Models” page to access voice service providers.
2

Select Cartesia

On the Integrations page, find the Cartesia provider card.Click the “Setup Credential” button for Cartesia.
3

Create Provider Credential

A modal window will appear titled “Create provider credential”. Follow these steps:
  1. Select “Cartesia” from the dropdown (if not already selected)
  2. Enter a Key Name: Assign a unique name to this provider key for easy identification
  3. Enter the API Key: Input your Cartesia API key
  4. Click “Configure” to save the credential
4

Verify Credential Setup

After setting up the credential, you can verify it’s been added:
  1. The Cartesia provider card should now show “Connected”
  2. If you click on the provider, you’ll see a “View provider credential” modal
  3. This modal displays the credential name, when it was last updated, and options to delete or close
Your Cartesia provider credential is now set up for voice AI integration.

Integration Features

  • Unified Platform: Both STT and TTS in one platform
  • High-Quality Audio: Professional-grade voice synthesis
  • Real-time Processing: Low-latency speech processing
  • Multiple Languages: Comprehensive language support
  • Voice Customization: Create custom voices and speaking styles
  • Streaming Support: Real-time streaming for both STT and TTS
For more information, visit the Cartesia documentation.