Authors: Amjad Rehman
Addresses: College of Computer and Information Sciences, Prince Sultan University, Riyadh 11586, Saudi Arabia
Abstract: Precise character segmentation is the only solution towards higher optical character recognition (OCR) accuracy. In cursive script, overlapped characters are serious issue in the process of character segmentations as characters are deprived from their discriminative parts using conventional linear segmentation strategy. Hence, nonlinear segmentation is an utmost need to avoid loss of characters parts and to enhance character/script recognition accuracy. This paper presents an improved approach for nonlinear segmentation of the overlapped characters in handwritten roman script. The proposed approach presents a set of rules that are derived heuristically to search character boundaries of the cursive script that is validated by using ensemble of neural confidence. Accordingly, correct boundaries are retained and incorrect are removed based on ensemble neural networks vote. Finally, based on verified valid segmentation points, characters are segmented nonlinearly. For fair comparison CEDAR benchmark database is experimented. The experimental results are much better than conventional linear character segmentation techniques reported in the state of art. Ensemble neural network play vital role to enhance character segmentation accuracy as compared to individual neural networks.
Keywords: nonlinear character segmentation; ensemble neural networks; analytical approach; CEDAR database.
International Journal of Computational Vision and Robotics, 2020 Vol.10 No.4, pp.275 - 288
Received: 30 Oct 2018
Accepted: 13 May 2019
Published online: 01 May 2020 *