In this tutorial, we'll show you how to integrate Eden AI's Invoice parser API into your data processing workflow using Dataiku to help streamline your financial operations and free up time for more important tasks.
Eden AI is used by AI experts to quickly test, choose and integrate ready-to-use AI APIs. Managing multiple accounts for each app can be a tough job, but with Eden AI, you can connect and manage all your APIs on a single account.
Since some AI providers can be complex to implement, we wanted to simplify the integration to make AI APIs accessible as fast as possible.
Eden AI allows you to solve multiple AI tasks on Dataiku:
Another advantage of using Eden AI on Dataiku is the flexibility it provides in terms of selecting the best AI features and providers for a particular task, or even combining multiple providers to create a solution more suited for their use case.
Just like Receipt and Resume Parsing, Invoice Parsing is a tool powered by OCR to extract and digitalize meaningful data, Computer Vision to identify structure of the document, and NLP techniques to pin down the fields. Invoice parser technology extracts key information from an invoice (.pdf, .png or .jpg format) such as the invoice ID, total amount due, invoice date, customer name, etc.
Invoice Processing implies the necessity of software and technology to automate the processing and management of invoices. It includes tasks such as capturing invoice data, validating it in comparison to purchase orders, and routing it for approval, payment and archiving. The goal of AI in invoice processing is to improve efficiency, accuracy, and speed in handling invoices without any human intervention.
If you're looking for an easier and faster way to execute invoice parsing API in Dataiku, skip the tutorial and watch the video below:
The steps to extract information from invoices using Eden AI invoice parser in Dataiku are as follows:
To use the Eden AI API with Dataiku, you’ll need the following requirements:
To begin with, you’ll need to create a new Dataiku project or open an existing one:
Once your project is open, click on "New Dataset" located on the right-hand side panel, then select the "Folder" option to create a folder dataset:
Next, you’ll need to upload your invoices in the folder as follows:
Once your invoices are imported into the folder, you’ll need to create a new recipe by clicking on the action button. Then, select the new code recipe:
You can choose the type of recipe you want to create, such as Python or Shell. You will also need to create a dataset output for the recipe and give it a name:
After creating the recipe, you can start coding the connection to Eden AI invoice parser. You’ll need to define the invoice parser endpoint that you want to connect to and call the API with your key:
Once you have retrieved the data from the API, you’ll need to put the response in a Pandas dataframe. In this example, we chose to extract some basic information from the invoice, such as total, subtotal, customer name, and customer address:
Once you have coded your Eden AI invoice call and returned the data in a structured format (Pandas dataframe), you’ll need to import the invoices from the folder dataset.
Finally, you’ll need to call the function defined early on and apply it to the invoices with the providers that you want.
Last but not least, don’t forget to write the dataframe response into the output dataset!
By following these steps, you’ll be able to get the extracted information from the invoices in a structured format as follow :
Congrats 🥳 You're all set and ready to automate your invoice processing with Dataiku!
You can access to the full code sample for the recipe here :
If you're interested in more low-code tools, have a look at our step-by-step tutorials on how to bring AI to your application with Power Apps, Google App Script, Retool, Make, IFTTT, n8n, Bubble, and Zapier.