Title: Object detection based on multiple trick feature pyramid networks and dynamic balanced L1 loss

Authors: Yiming Yin; Guangjian Zhang

Addresses: School of Artificial Intelligence, Chongqing University of Technology, Banan District, Chongqing, China ' School of Artificial Intelligence, Chongqing University of Technology, Banan District, Chongqing, China

Abstract: Although the performance of the object detection has been significantly optimised in recent years, there is still a lot of room for designing multi-scale feature fusion methods and designing loss functions. Specifically, we propose Multiple Trick Feature Pyramid Networks (MT-FPN), by using various techniques such as feedback information, global module, attention mechanism, and fusion of refined information, to solve the problem of insufficient multi-scale feature fusion. We also propose Dynamic Balanced L1 Loss (DBLL), by utilising dynamic strategies and solving the derivative discontinuity problem, in order to help relieve the inconsistent problem between the dynamic training process and the fixed parameters. Moreover, by replacing FPN with MT-FPN, our Average Precision (AP) on Microsoft Common Objects in Context (MSCOCO) is 5.1 points and 3.8 points higher than FPN Faster R-CNN and Libra R-CNN, respectively. Without any bells and whistles, our experiments also show that the combined application of MT-FPN and DBLL achieves competitive performance compared with most advanced detectors on MS COCO benchmark.

Keywords: object detection; feature pyramid network; dynamic training; loss function.

DOI: 10.1504/IJWMC.2022.122489

International Journal of Wireless and Mobile Computing, 2022 Vol.22 No.1, pp.93 - 103

Received: 17 Nov 2021
Accepted: 19 Feb 2022

Published online: 27 Apr 2022 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article