Skip to main content

Spitch Documentation

The voice layer for your applications. One API for text-to-speech, speech-to-text, translation and diacritics across many languages, including Yorùbá, Igbo, Hausa, Swahili and more.

What you can build

Spitch turns text into natural speech and speech back into accurate text, tuned for the tones, diacritics and accents of the many languages your users speak. Pick a capability to dive in.

Text to Speech

Convert any text into lifelike audio across 10+ voices and locales.

Speech to Text

Transcribe audio and streams with punctuation and diacritics.

Translation

Translate between English and other supported languages.

Diacritics

Restore tone marks and diacritics on raw, unmarked text.

Start here

New to Spitch? These three pages take you from zero to your first generated clip in a few minutes.

Quickstart

Your first request.

Authentication

Create and use API keys.

SDKs

Official client libraries.