The File Upload API allows you to submit audio files for transcription. Files are processed asynchronously, and you can check the status using the Get File Status endpoint.

Endpoint

POST https://infer.voice.intron.io/file/v1/upload

Authentication

All requests require a Bearer token in the Authorization header:
Authorization: Bearer YOUR_API_KEY

Request

Supported File Formats

FormatExtensions
WAV.wav
MP3.mp3
MP4.mp4
M4A.m4a
OGG.ogg
WebM.webm
FLAC.flac

use category options

  • file_category_general
  • file_category_telehealth
  • file_category_procedure
  • file_category_call_center
  • file_category_legal
  • file_category_meeting_notes

Headers

HeaderValueRequired
AuthorizationBearer YOUR_API_KEYYes
Content-Typemultipart/form-dataYes

Request Body form-data

FieldTypeDescriptionRequiredOptionsDefault
audio_file_nameStringnon-unique file nameyes
audio_file_blobStringurl to a readable fileyes
Post processing options
use_diarizationStringget the transcript text as a diarized responsenoTRUE | FALSE
use_template_idStringuse a custom prompt id for the transcript post-processingno
use_categoryStringset the category of post-processing to use on the filenofile_category_telehealth

Post processing options for the file categories

FieldTypeDescriptionRequiredOptionsDefault
get_summaryStringget a summary of the transcriptnoTRUE | FALSE

Basic request and response sample

curl --location 'https://infer.voice.intron.io/file/v1/upload' \
  --header 'Authorization: Bearer api-key' \
  --form 'audio_file_name="my_file_2"' \
  --form 'audio_file_blob=@"/C:/Users/aaaa/file.wav"'

Request and response with custom template id

curl --location 'https://infer.voice.intron.io/file/v1/upload' \
  --header 'Authorization: Bearer api-key' \
  --form 'audio_file_name="my_file_2"' \
  --form 'use_template_id="a3f32e46-afa4-42cb-9155-522dcf9xyxyxd"' \
  --form 'audio_file_blob=@"/C:/Users/aaaa/file.wav"'

Next Steps

After uploading a file, use the Get File Status endpoint to check the transcription progress and retrieve results.
For real-time transcription, consider using the Widget Integration instead.