Top 10 Object Detection APIs

Updated: Dec 24, 2021

This article is brought to you by the Eden AI team. We allow you to test and use in production a large number of AI engines from different providers directly through our API and platform. You are a solution provider and want to integrate Eden AI, contact us at :


In this article, we are going to see how we can easily integrate an Object Detection engine in your project and how to choose and access the right engine according to your data.


Object detection is a computer vision technique that works to identify and locate objects within an image or video. Specifically, object detection draws bounding boxes around these detected objects, which allow you to locate where objects are in a given scene. Object detection is different from image recognition which labels an entire image:


In the beginning of the 2000s, the first object detection engines were handmaded due to the lack of effective image representation at that time.

Originally proposed in 2005 by N. Dalal and B. Triggs, the Hog Detector is an improvement of the scale invariant feature transform and shape contexts of its time.HOG works with something called blocks, a dense pixel grid in which gradients are constituted from the magnitude and direction of change in the intensities of pixels within the block. HOGs are widely known for their use in pedestrian detection. To detect objects of different sizes, the HOG detector rescales the input image for multiple times while keeping the size of a detection window unchanged.

Between 2005 and 2015, multiple object detection evolutions were created: Deformable Part-based Model (DPM) then deep learning approaches (AlexNet, RCNN? SSPnet, FastRCNN, FPN, etc.).

Top 10 Object Detection:

Microsoft Azure - Available on Eden AI

Object detection is similar to tagging, but the API returns the bounding box coordinates (in pixels) for each object found in the image. For example, if an image contains a dog, cat and person, the Detect operation will list those objects with their coordinates in the image. You can use this functionality to process the relationships between the objects in an image. It also lets you determine whether there are multiple instances of the same object in an image.

Available on Eden AI

Chooch AI

Chooch AI is a visual detection platform that quickly replicates human visual tasks and processes with AI on the edge & in the cloud. With high accuracy and fast response time, Chooch AI is the leader both in computer vision training and deployment for true Visual AI on the edge and in the cloud. Chooch provides complete artificial intelligence solutions in healthcare, geospatial, media, security, retail and industrial applications.


Imagga is a computer vision artificial intelligence company. Imagga Image Recognition API features auto-tagging, auto-categorization, face recognition, visual search, content moderation, auto-cropping, color extraction, custom training and ready-to-use models. Available in the Cloud and on On-Premise. It is currently deployed in leading digital asset management solutions and personal cloud platforms and consumer facing apps.

Google Cloud - Available on Eden AI

The Vision API can detect and extract multiple objects in an image with Object Localization.

Object localization identifies multiple objects in an image and provides a LocalizedObjectAnnotation for each object in the image. Each LocalizedObjectAnnotation identifies information about the object, the position of the object, and rectangular bounds for the region of the image that contains the object.

Available on Eden AI


Cloudmersive brings its customers a complete portfolio of APIs across document conversion and processing, deep learning OCR, image recognition, NLP, etc. Cloudmersive powerful object detection API automatically identify the location and type of objects, and people in an image.

AWS - Available on Eden AI

Amazon Rekognition Image can return the bounding box for common object labels such as cars, furniture, apparel or pets. Bounding box information isn't returned for less common object labels. It provides bounding boxes to find the exact locations of objects in an image, count instances of detected objects, or to measure an object's size using bounding box dimensions.