model | String | The ID of the model to use. | Required | transcriptions, translations |
file | File | Audio file in FLAC, MP3, MP4, MPEG, MPGA, M4A, Ogg, WAV, or WebM format. File size limit: 25MB. | Required | transcriptions, translations |
prompt | String | Prompt to influence transcription style or vocabulary. Example: “Please transcribe carefully, including pauses and hesitations.” | Optional | transcriptions, translations |
response_format | String | Output format: either json or text. | json | transcriptions, translations |
language | String | The language of the input audio. Using ISO-639-1 format (e.g., en) improves accuracy and latency. | Optional | transcriptions, translations |
stream | Boolean | Enables streaming responses. | false | transcriptions, translations |
stream_options | Object | Additional streaming configuration (e.g., {"include_usage": true}). | Optional | transcriptions, translations |