Title: Real time noisy dataset implementation of optical character identification using CNN

Authors: R. Anand; T. Shanthi; R.S. Sabeenian; S. Veni

Addresses: Department of Electronics and Communication Engineering, Sona SIPRO, Sona College of Technology, Salem – 636005, Salem, Tamil Nadu 636005, India ' Department of Electronics and Communication Engineering, Sona SIPRO, Sona College of Technology, Salem – 636005, Salem, Tamil Nadu 636005, India ' Department of Electronics and Communication Engineering, Sona SIPRO, Sona College of Technology, Salem – 636005, Salem, Tamil Nadu 636005, India ' Department of Electronics and Communication Engineering, Amrita School of Engineering, Amrita University, Coimbatore, Amrita Vishwa Vidyapeetham 641112, India

Abstract: Optical character recognition (OCR) is one of the major research problem in real time applications and it is used to recognise all the characters in an image. As English is a universal language, character recognition in English is a challenging task. Deep learning approach is one of the solution for the recognition of optical characters. Aim of this research work is to perform character recognition using convolutional neural network with LeNET architecture. Dataset used in this work is scanned passport dataset for generating all the characters and digits using tesseract. The dataset has training set of 60,795 and testing set of 7,767. Total samples used are 68,562 which is separated by 62 labels. Till now there is no research on predicting all 52 characters and ten digits. The algorithm used in this work is based on deep learning with appropriate some layer which shows significant improvement in accuracy and reduced the error rate. The developed model was experimented with test dataset for prediction and can produce 93.4% accuracy on training, and 86.5% accuracy on the test dataset.

Keywords: convolutional neural networks; CNN; scanned passport; deep learning; classification; optical character recognition; OCR; discrete wavelet transform; DWT.

DOI: 10.1504/IJIE.2020.104646

International Journal of Intelligent Enterprise, 2020 Vol.7 No.1/2/3, pp.67 - 80

Received: 27 Jun 2018
Accepted: 25 Sep 2018

Published online: 24 Jan 2020 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article