Learn how to transcribe a dialog using the record method.
To activate speech-to-text transcription, you just have to set the transcribe parameter to true and the language parameter – to one of the supported languages. Here is how it should look like:
Unlike audio and video recording, transcription results are available only after a call ends, so it should be retrieved via the GetCallHistory method of the HTTP API. You have to call this method with the
with_records=true parameter specified.
There will be records in the response JSON with the transcription_url field. This field value returns the transcription as a plain text:
By default, each line in transcription file is prefixed "Left" for an audio stream from a call endpoint to the Voximplant cloud, and "Right" for an audio stream from the Voximplant cloud to a call endpoint (same logic as with left and right audio channel for stereo recording). "Left" and "Right" names can be changed via the labels parameter. The dict parameter allows you to specify an array of words that the transcriber will try to match in case of recognition problems. Specifying domain-specific words can improve transcription results a lot.