Title: Distilling object detectors with mask-guided feature and relation-based knowledge

Authors: Liang Zeng; Liyan Ma; Xiangfeng Luo; Yinsai Guo; Xue Chen

Addresses: School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China ' School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China ' School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China ' School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China ' School of Computer Engineering and Science, Shanghai University, Shanghai, 200444, China; State Key Lab of Mathematical Engineering and Advanced Computing, Wuxi, 214083, China

Abstract: Knowledge distillation (KD) is an effective technique for network compression and model accuracy enhancement in image classification, semantic segmentation, pre-trained language model, and so on. However, existing KD methods are specialised for image classification and cannot be used effectively for object detection tasks, with the following two limitations: the imbalance of foreground and background instances and the neglect distillation of relation-based knowledge. In this paper, we present a general mask-guided feature and relation-based knowledge distillation framework (MAR) consisting of two components, mask-guided distillation, and relation-based distillation, to address the above problems. The mask-guided distillation is designed to emphasise students' learning of close-to-object features via multi-value masks, while relation-based distillation is proposed to mimic the relational information between different feature pixels on the classification head. Extensive experiments show that our methods achieve excellent AP improvements on both one-stage and two-stage detectors. Specifically, faster R-CNN with ResNet50 backbone achieves 40.6% in mAP under 1 × schedule on the COCO dataset, which is 3.2% higher than the baseline and even surpasses the teacher detector.

Keywords: knowledge distillation; multi-value mask; object detection.

DOI: 10.1504/IJCSE.2024.137291

International Journal of Computational Science and Engineering, 2024 Vol.27 No.2, pp.195 - 203

Received: 24 Oct 2022
Accepted: 28 Dec 2022

Published online: 11 Mar 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article