Title: A new fast DBSCAN using dual-space analysis and colour integral volume for document image segmentation

Authors: Zakia Kezzoula; Djamel Gaceb

Addresses: Department of Computer Science, LIMOSE Laboratory, University M'Hamed Bougara of Boumerdes, Algeria ' Department of Computer Science, LIMOSE Laboratory, University M'Hamed Bougara of Boumerdes, Algeria

Abstract: The segmentation of the colour document images is an essential step allowing facilitating and improving the stages of characterisation and interpretation of the information contained in these documents. Recent systems of automatic processing and recognition of document images, which use separation of colorimetric layers, are more efficient compared to conventional systems, only based on binary or grey levels images. This task requires non-supervised pixel segmentation or clustering techniques to separate the document image to a variable and unknown number of colour layers. The methods based on density are widely used in this context at pixel level, such as the DBSCAN method and its different variants, very robust to the noise and more adapted to the degradations present on document images, but who suffer from a great complexity. In this context, we propose a new faster DBSCAN variant using the volume integral in colorimetric space for the first time to significantly reduce calculation time. The combination of the two spaces, Cartesian and colorimetric has also accelerated the method and improved its performance on document images with different challenges. The results obtained show the effectiveness of the proposed approach, which was marked by significant improvement in the quality of segmentation and reduction in computed time.

Keywords: clustering; DBSCAN; region growing; document image segmentation; fast I2SDBSCAN; 3D colour histogram; integral volume.

DOI: 10.1504/IJCVR.2025.146298

International Journal of Computational Vision and Robotics, 2025 Vol.15 No.3, pp.395 - 416

Accepted: 07 May 2024
Published online: 19 May 2025 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article