Request
Thetranscribe() function can be used to transcribe audio. Pass the audio as content: file bytes, an unauthenticated audio URL, or a Spitch file UUID.
Parameters
content(required) - Audio file content, an audio file URL, or a Spitch file UUIDlanguage(optional) - ISO 639 language code for the audio, such asen,yo,ha,ig, orammodel(deprecated) - Previous STT model selector. Accepted values aremansa_v1andlegacy, but new integrations should omit it.special_words(optional) - Custom words to help with recognition accuracytimestamp(optional) - Timestamp granularity:sentenceorword
Use
content for all input types. URLs must be directly accessible without authentication.STT Models
Themodel parameter is deprecated in the current API. Omit it unless you are maintaining an older integration that still sends mansa_v1 or legacy.
Timestamp Options
Thetimestamp parameter controls the level of timing information returned:
sentence- Timestamps for each sentenceword- Timestamps for each individual word
Best Practices for Use
- Send audio through the
contentfield as file bytes, a file UUID, or a public URL. - The maximum file size is 25MB, we will support larger sizes in the future.
- We only support
mp3,wav,m4a, andoggfile formats. - If you provide a URL in
content, ensure that access to the file is not blocked by authentication. - The
languagefield is optional. If you set it, use the language code (e.g.en,yo,ig) rather than the full language name.
Response
The response for transcription is JSON.- The Content-Type is
application/json - A
request_idis returned for issue resolution with our support team.
Examples - file
Examples - URL
For error codes and retry guidance, see Troubleshooting.