Title: Pseudo Zernike moments-based approach for text detection and localisation from lecture videos

Authors: Belkacem Soundes; Guezouli Larbi; Zidat Samir

Addresses: Department of Computer Science, University of Batna 2, Algeria ' LaSTIC Laboratory, Department of Computer Science, University of Batna 2, Algeria ' LaSTIC Laboratory, Department of Computer Science, University of Batna 2, Algeria

Abstract: Scene text presents challenging characteristics mainly related to acquisition circumstances and environmental changes resulting in low quality videos. In this paper, we present a scene text detection algorithm based on pseudo Zernike moments (PZMs) and stroke features from low resolution lecture videos. Algorithm mainly consists of three steps: slide detection, text detection and segmentation and non-text filtering. In lecture videos, slide region is a key object carrying almost all important information; hence slide region has to be extracted and segmented from other scene objects considered as background for later processing. Slide region detection and segmentation is done by applying pseudo Zernike moment's based on RGB frames. Text detection and extraction is performed using PZMs segmentation over V channel of HSV colour space, and then stroke feature is used to filter out non-text region and to remove false positives. The algorithm is robust to illumination, low resolution and uneven luminance from compressed videos. Effectiveness of PZM description leads to very few false positives comparing to other approached. Moreover resulting images can be used directly by OCR engines and no more processing is needed.

Keywords: text localisation; text detection; pseudo Zernike moments; PZMs; slide region detection.

DOI: 10.1504/IJCSE.2019.100231

International Journal of Computational Science and Engineering, 2019 Vol.19 No.2, pp.274 - 283

Available online: 17 Jun 2019 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article