You can view the full text of this article for free using the link below.

Title: Research on multi-scale face detection based on graph embedded Swin-Transformer

Authors: Fang Wang; Huang Zhong

Addresses: School of Intelligent Manufacturing, Wuxi Vocational College of Science and Technology, Wuxi, Jiangsu, China ' Suzhou Triwin EST Co., Ltd., Suzhou, Jiangsu, China

Abstract: The existing Swin-Transformer has achieved great success in the field of object detection due to its lightweight, hierarchical and efficient long-distance dependency modelling advantages. However, in complex scenes, the impact of scale changes seriously restricts the performance of face detection. Therefore, this article proposes a multi-scale face detection network based on graph embedding Swin-Transformer. This network fully utilises multi-scale features and contextual information, effectively improving the performance of multi-scale face detection. Specifically, we designed a multi-feature fusion module based on Swin Transformer, which effectively integrates deep features of advanced information with shallow texture features. In addition, to effectively model the contextual information of the target, we use embedding graph convolution operations. We conducted extensive comparative experiments on publicly available data sets, and the experimental results showed that the proposed model achieved higher recognition performance.

Keywords: multi-scale face detection; Swin-Transformer; graph convolution embedded; multi-scale feature fusion.

DOI: 10.1504/IJCAT.2025.148169

International Journal of Computer Applications in Technology, 2025 Vol.76 No.1/2, pp.94 - 105

Received: 27 Aug 2023
Accepted: 14 Jun 2024

Published online: 27 Aug 2025 *

Full-text access for editors Full-text access for subscribers Free access Comment on this article