How to Convert Speech to Text (STT / ASR) with JavaScript?

How to Convert Speech to Text (STT / ASR) with JavaScript?

In this tutorial, you will learn how to convert Speech to Text with JavaScript in 5 minutes. Eden AI provides an easy and developer-friendly API that allows you to transcribe your audio files.

What is Speech-to-Text API?

Speech-to-Text (STT) API is a software service that uses advanced speech recognition technology to transcribe spoken words into written text. The API can be integrated into various applications to automate the process of transcribing speech.

Speech-to-Text API result on Eden AI

STT can also provide additional features such as automatic punctuation, speaker diarization, and the ability to transcribe audio files in real-time or in batches.

Get Started with Speech-to-Text API using JavaScript

The first step is to set Axios, a promise-based HTTP client for the browser and Node.js, that will allow you to call Eden AI API.

Next, you'll need to initialize the File System module in order to access local files on your computer.

Finally, you'll need to create your multipart/formdata parameters form:

How to Transcribe an Audio File with JavaScript

You are now ready to process your audio file into Eden AI Speech-to-Text API. You can access the list of languages supported in our documentation here.

1. Get a Speech-to-Text API Key on Eden AI

To perform Speech-to-Text, you'll need to create an account on Eden AI for free. Then, you will be able to get your API key directly from the homepage with free credits offered by Eden AI.

Get your API key for FREE on Eden AI


2. Let’s Transcribe your Audio File

Now that you have initialized the required JavaScript modules and got your API key, you will be able to transcribe your audio file. With Eden AI, you can choose from a wide range of engines you want for Speech-to-Text. You can access the list of Speech-to-Text providers available on Eden AI directly on our documentation.

Here is the JavaScript code to add the form data parameters values for your request:

For example, we called two different Speech-to-Text engines. Once the parameters values are passed, you can configure your request:

Then, you'll need to create launchJob() function that will execute POST request:

Finally, you have to create the getJob() function that will execute GET request with the Job ID of your POST request:

You will first get this response:

Once the request is done (status : finished), you will be able to get the result for STT task:

Benefits of using Speech-to-Text API with Eden AI

Using Speech-to-Text with Eden AI API is quick and easy.

Multiple AI engines in one API

Save time and cost

We offer a unified API for all providers: simple and standard to use, with a quick switch between providers and an access to the specific features of each provider.

Easy to integrate

The JSON output format is the same for all suppliers thanks to Eden AI's standardisation work. The response elements are also standardised thanks to Eden AI's powerful matching algorithms.


With Eden AI you have the possibility to integrate a third-party platform: we can quickly develop connectors. To go further and customize your Speech-to-Text request with specific parameters, check out our documentation.

Related Posts

Try Eden AI for free.

You can directly start building now. If you have any questions, feel free to schedule a call with us!

Get startedContact sales