We are pleased to announce that Gladia’s Speech-to-Text API technology has been integrated into Eden AI API.
Jean-Louis Quéguiner founded Gladia in Paris, France in 2022 to democratize cutting-edge AI tools for developers. The company expanded its focus to address the underutilization of up to 90% of entéerprise audio data.
Gladia's platform provides various AI tools, including speech to text transcription (asynchronous and live) and audio intelligence like translation, speaker separation etc. These capabilities empower software developers to find suitable AI API models for their specific requirements.
Currently collaborating with call centers, virtual meeting services, and video publishers like Claap, Livestorm, and Selectra, Gladia raised $4 million in seed funding led by New Wave. Their future goals include summarisation and chapterisation.
Eden AI offers Gladia Speech-to-Text solutions on its platform amongst several other Automatic Speech Recognition technologies. We want our users to have access to multiple AI engines and manage them in one place so they can reach high performance, optimize cost and cover all their needs.
There are many reasons for using multiple AI APIs :
You need to set up an AI API that is requested if and only if the main AI API does not perform well (or is down). You can use the confidence score returned or other methods to check provider accuracy.
After the testing phase, you will be able to build a mapping of AI vendors' performance that depends on the criteria that you chose. Each data that you need to process will be then sent to the best API.
This method allows you to choose the cheapest provider that performs well for your data. Let's imagine that you choose Google Cloud API for customer "A" because they all perform well and this is the cheapest. You will then choose Microsoft Azure for customer "B", a more expensive API but Google performances are not satisfying for customer "B". (this is a random example)
This approach is required if you look for extremely high accuracy. The combination leads to higher costs but allows your AI service to be safe and accurate because AI APIs will validate and invalidate each other for each piece of data.
We had the chance to talk to Anna Jelezovskaia, Gladia’s head of Marketing, who agreed to answer some of our questions:
Founded in 2022 by Jean-Louis Quéguiner (ex-VP of AI at OVHCloud) and Jonathan Soto (ex-VP of Engineering at Sigfox), Gladia’s mission was to make the most advanced AI available to any developer. Since then, our API has evolved into an enterprise-grade Language AI product, enabling speech-to-text transcription (async and real-time) and audio intelligence (translation, speaker separation, etc).
We’re currently working on a series of audio intelligence features, including summarization and sentiment analysis.
At Gladia, we provide a single plug-and-play API, unlocking a range of audio and video transcription and analytics functionalities to help companies fully leverage the untapped potential of their unstructured data.
Here are some factors that set us apart:
1. Top-level performance: We provide near error-free transcription accuracy without compromising speed (1h audio <60s) while remaining one of the most affordable API providers on the market.
2. Multilingual capabilities: Unlike any other provider on the market, the API already supports 99 languages for transcription, speaker diarization, and translation. All of our features are designed to be fully multilingual.
3. Security: We take our client’s privacy seriously. Our data hosting is fully compliant with GDPR.
4. Scalability: Flexible pay-as-you-go pricing that allows scaling transcription volumes easily.
Our API is adapted to the most sought-after use cases like virtual meetings translation and analytics, podcast transcription, video subtitling, as well as call center assistance. Our clients come from the corresponding media, SaaS, and call center industries and include Livestorm, Selectra, Musixmatch, and Claap.
Currently, up to 90% of audio data in companies is either lost or unused due to the lack of high-quality and affordable transcription and audio intelligence tools. Our API helps our clients derive more insight from their audio and video data, and power new features in their applications to boost user experience and retention. It can also help with content creation and SEO, among others.
Eden AI is a key partner for Gladia, being the #1 resource for developers to access the most advanced AI models. The Gladia team quickly identified Eden AI as the right place to be integrated. Partnering up with the team was very simple, with a light-speed integration.
All of the Eden AI users can now leverage Gladia’s transcription, and we are very proud to be now part of the landscape. Gladia wants to be part of the Eden AI mission, accelerating the adoption of AI within companies.
We have a lot of super exciting releases coming up this fall. We’re going to expand our market-leading support for transcription in 99 languages to translation, allowing our users to transcribe audio from any language to any language.
We’ll follow up with a package of audio intelligence add-ons like summarization and chapterization – all fully multilingual as well. Our goal is to deliver a truly global API adapted to a variety of real-life use cases and complex environments.There’s a real need for versatile AI solutions right now. As the volume of unstructured data is increasing exponentially, so does the need for optimized tools to extract actionable information from it. Our initial focus is on audio – due to the tremendous ‘dark data’ problem mentioned before – but in the long run, we’re planning to expand to other sources of data.
You'll need some documentation to use Gladia's Speech technologies on Eden AI. Then, call the API:
Eden AI is the future of AI usage in companies. Our platform not only allows you to call multiple AI APIs but also gives you :
You can see Eden AI documentation here.