file_id across many requests — instead of re-sending the bytes or hosting the file at a public URL every time. It is especially handy for transcription, where the same recording is often processed more than once.
Reach for file storage when you:
- Transcribe the same audio repeatedly, for example re-running with different
special_wordsortimestampsettings. - Want to avoid exposing audio at a public URL.
- Process large files and prefer to upload them a single time.
Upload a file
Uploading returns a file object whosefile_id you reference in later calls.
Use a file in transcription
Pass thefile_id as content to reuse an uploaded recording without sending the bytes again.
content accepts file bytes, a public audio URL, or a Spitch file_id. See Transcription for the full request shape.List files
Listing returns your files with anext_cursor you can use to page through results.
Maximum number of files to return per page (up to 99).
Pagination cursor taken from a previous response’s
next_cursor.Retrieve file details
Fetch the metadata for a single file by itsfile_id.
Download a file
Generate a temporary signed URL to download the stored audio. The link expires afterttl seconds.
How long the signed URL stays valid, in seconds. Minimum 60, maximum 3600.
Delete a file
Remove a file from storage. This cannot be undone.Check storage usage
See how much storage you are using and how many files you have.File object
Upload, retrieve, and list return file metadata with these fields:| Field | Type | Description |
|---|---|---|
file_id | string (uuid) | Unique identifier. Pass this as content to reuse the file. |
status | string | Processing status of the file. |
original_name | string | null | The filename supplied at upload, when available. |
size_bytes | integer | null | File size in bytes. |
uploaded_by | string | null | Identifier of the uploader. |
created_at | string (date-time) | When the file was uploaded. |
| Field | Type | Description |
|---|---|---|
num_files | integer | Number of files in your storage. |
used_bytes | integer | Storage used, in bytes. |
total_bytes | integer | Total storage available, in bytes. |
used | string | Storage used, human-readable (for example 12.4 MB). |
total | string | Total storage available, human-readable. |
Limits
- Supported audio formats are
mp3,wav,m4a, andogg. - The maximum file size is 25MB, matching the transcription limit.
- Download links are signed and expire after their
ttl(60–3600 seconds).