Custom Speech-to-Text is the process of training the algorithm on specific words in an audio file. This can be used when there is an important word in a conversation, for example. The machine will make sure to always write that particular word in the right way.
There are many Speech Diarization engines on the market and their performances vary between each provider depending on your audio file. Each of them also has different costs and processing times: it is important to be able to test some of them before choosing the right one.
By aggregating several Custom Speech-to-Text engines on a single API, Eden AI allows you to use a number of these engines at the same time depending on the audio you wish to transcribe.