OCR Table or Table Parser refers to the process of using Optical Character Recognition (OCR) technology to extract data from tables within documents, such as scanned PDFs, images, and other types of files.
Since tables may contain a lot of structured data, extracting data from tables using general OCR can be challenging because tables often have a complex layout with rows and columns, and the data can be in different formats, such as numbers, text, or dates. Therefore, Table OCR lets you extract tabular data from PDFs and images in one shot by using advanced Image Processing and Machine Learning algorithms to automatically identify and extract the tabular data from the documents.
Other document types like receipts, invoices, resumes, IDs, etc., also follow the same layout and benefit from Table OCR's capabilities.
The first step to getting started is to set Axios, a promise-based HTTP client for the browser and Node.js, that will allow you to call Eden AI API.
Next, you'll need to initialize the File System module in order to access local files on your computer.
Finally, you'll need to create your multipart/formdata parameters form:
You are now ready to process your file into Eden AI OCR Table API. You can process files in .pdf, .jpg, .png or .jpeg and documents in many languages.
To perform OCR Table, you'll need to create an account on Eden AI for free. Then, you will be able to get your API key directly from the homepage with free credits offered by Eden AI.
For example, we called two different OCR Table engines. Once the parameters values are passed, you can configure your request:
Then, you need to create launchJob() function that will execute POST request:
Finally, you have to create the getJob() function that will execute GET request with the Job ID of your POST request:
You will first get this response:
Once the request is done (status : finished), you will be able to get the result for OCR Table task:
Using Table Extraction with Eden AI API is quick and easy.
We offer a unified API for all providers: simple and standard to use, with a quick switch between providers and an access to the specific features of each provider.
The JSON output format is the same for all suppliers thanks to Eden AI's standardisation work. The response elements are also standardised thanks to Eden AI's powerful matching algorithms.
With Eden AI you have the possibility to integrate a third party platform: we can quickly develop connectors. To go further and customize your OCR Table request with specific parameters, check out our documentation.
You can directly start building now. If you have any questions, don't hesitate to schedule a call with us!Get started