We are pleased to announce that Speechmatics speech-to-text solution has been integrated into Eden AI API.
Speechmatics provides automatic speech recognition (ASR) technology. Their software uses machine learning algorithms to transcribe speech from audio or video files into text in real-time, with high accuracy.
The company's technology can transcribe speech in a variety of languages and accents, and it can be used for a wide range of applications, such as transcribing phone calls, meetings, lectures, and podcasts, as well as providing closed captions for videos. Speechmatics also offers a range of other speech-related services, such as voice biometrics and speaker identification.
Eden AI offers Speechmatics speech technology on its platform amongst several other technologies. We want our users to have access to multiple AI engines and manage them in one place so they can reach high performance, optimize cost and cover all their needs.
There are many reasons for using multiple AI APIs :
You need to set up an AI API that is requested if and only if the main AI API does not perform well (or is down). You can use the confidence score returned or other methods to check provider accuracy.
After the testing phase, you will be able to build a mapping of AI vendors' performance that depends on the criteria that you chose. Each data that you need to process will be then sent to the best API.
This method allows you to choose the cheapest provider that performs well for your data. Let's imagine that you choose Google Cloud API for customer "A" because they all perform well and this is the cheapest. You will then choose Microsoft Azure for customer "B", a more expensive API but Google performances are not satisfying for customer "B". (this is a random example)
This approach is required if you look for extremely high accuracy. The combination leads to higher costs but allows your AI service to be safe and accurate because AI APIs will validate and invalidate each other for each piece of data.
We had the chance to talk to Georgina Robertson, Speechmatics Head of Communications, who agreed to answer some of our questions:
Speechmatics exists to understand every voice. Offering its speech API engine for solution and service providers to integrate into their stack irrespective of their industry or use case. Businesses use Speechmatics around the world to accurately understand and transcribe human-level speech into text regardless of demographic, age, gender, accent, dialect or location.
Founded in 2006, Speechmatics was originally founded as a research company. It wasn’t until 2014 that the company became a commercial entity.
Speechmatics offers a speech API to customers, which includes both speech-to-text and translation in the one API. Operating in both real-time and for pre-recorded files (batch). On-premises and SaaS deployments. 48 languages for speech-to-text and 69 language pairs to and from English.
Real-time Translation and Ursa models using GPUs.
BT Sport (using RedBee Media), Deloitte UK, Ubisoft (Brawlhalla), Veritone, Vail Systems, and Udemy.
Media Captioning, Media Monitoring, Media Broadcast, Educational Technology, and UCaaS & CCaaS.
Eden AI offers a platform that combines all the leading providers within various technologies, one of them being ASR. Since we offer best-in-class accuracy, we think Eden AI customers that use ASR, can massively benefit from using our service as part of the overall Eden API package.
You'll need some documentation to use Speechmatics's ASR technology on Eden AI. Then call the API:
Eden AI is the future of AI usage in companies. Our platform not only allows you to call multiple AI APIs but also gives you :
You can see Eden AI documentation here.