ailia Tech BLOG

M2Det : Highly Accurate Object Detection Model

This is an introduction to「M2Det」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using ailia SDK as well as many other ready-to-use ailia MODELS.


Overview

M2Det is a highly accurate object detection model proposed in November 2018. It can detect bounding boxes of objects from the 80 categories in COCO.

M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid NetworkFeature pyramids are widely exploited by both the state-of-the-art one-stage object detectors (e.g., DSSD, RetinaNet…arxiv.org

Conventional object detection uses an image classification backbone (Mobilenet, VGG, ResNet, etc.) to compute bounding boxes. For example, in SSD, after extracting features with the image classification backbone, the bounding box is calculated by adding a similar backbone in the later stage.

The same was true for ReinaDet, which uses Feature Pyramids, and although it uses more hierarchical features than SSD, the backbone of object detection was based on image classification.

In M2Det, after extracting features with the image classification backbone, a specialized backbone is used for object detection to achieve higher accuracy.

(Source:https://arxiv.org/abs/1811.04533

This backbone specialized for object detection is called MLFPN (Multi-Level Feature Pyramid Network), which uses TUM (Thinned U-shape Module) in a hierarchical manner.

(Source:https://arxiv.org/abs/1811.04533

TUM has the following structure.

(Source:https://arxiv.org/abs/1811.04533

Performance of M2Det

M2Det performs better than YOLOv3 and RetinaDet.

(Source:https://arxiv.org/abs/1811.04533

VGG-16 and ResNet-101 are used as backbone for M2Det. VGG-16 is faster and ResNet-101 is more accurate.

(Source:https://arxiv.org/abs/1811.04533

Usage

The ailia SDK sample below allows you to use M2Det, with a VGG-16 based M2Det that takes 3x512x512 as input.

ailia-ai/ailia-modelsShape : (1, 3, 448, 448) Range : [0.0, 1.0] category : [0,80] (coco dataset classes, 0 is reserved for backgrounds)…github.com

You can use the following command to perform object detection using M2Det on a web camera.

$ python3 m2det.py -v 0

Related topics

YOLOv3 : A machine learning model to detect the position and type of an objectThis is an introduction to「YOLOv3」, a machine learning model that can be used with ailia SDK. You can easily use this…medium.com

YOLOv4 : A Machine Learning Model to Detect the Position and Type of an ObjectThis is an introduction to「YOLOv4」, a machine learning model that can be used with ailia SDK. You can easily use this…medium.com

YOLOv5 : The Latest Model for Object DetectionThis is an introduction to「YOLOv5」, a machine learning model that can be used with ailia SDK. You can easily use this…medium.com

MobilenetSSD : A Machine Learning Model for Fast Object Detectionmedium.com


ailia Inc. has developed ailia SDK, which enables cross-platform, GPU-based rapid inference.

ailia Inc. provides a wide range of services from consulting and model creation, to the development of AI-based applications and SDKs. Feel free to contact us for any inquiry.