Skip to main content

Overview

NVIDIA delivers speech-to-text through the NVIDIA Cloud Functions (NVCF) API, offering enterprise-grade ASR powered by NVIDIA’s high-throughput inference infrastructure. It is suitable for enterprise voice deployments requiring scalable, accurate transcription.

Authentication

KeyDescription
keyAPI key from build.nvidia.com
function_idNVCF Function ID for the STT model endpoint

Configuration

OptionDescription
languageLanguage code (e.g. en-US, en-GB, es-ES, fr-FR, de-DE, ja-JP)

Getting Started

1

Add NVIDIA credentials to your vault

Navigate to Integration → Vault in the Rapida dashboard. Add your NVIDIA API key and the NVCF Function ID for the desired ASR model from build.nvidia.com.
2

Select NVIDIA as your STT provider

When configuring your assistant, open Audio Settings and choose NVIDIA as your Speech-to-Text provider.
3

Configure language

Set the language code matching your expected input audio. Browse supported languages in the NVIDIA NIM catalog.