Title: Text and image area classification in mobile scanned digitised documents

Authors: Anne-Sophie Ettl; Arjan Kuijper

Addresses: GRIS, Department of Computer Science, TU Darmstadt, Fraunhoferstrasse 5, D-64283, Germany ' Fraunhofer IGD and GRIS, Department of Computer Science, TU Darmstadt, Fraunhoferstrasse 5, D-64283, Germany

Abstract: Post processing and automatic interpretation of images plays an increasingly important role in the mobile area. Both for the efficient compression and for the automatic evaluation of text, it is useful to store text content as textual information rather than as graphics information. For this purpose pictures from magazines are recorded with the camera of a smartphone and classified according to text and image areas. In this work established desktop procedures are presented and analysed in terms of their applications on mobile devices. Based on these methods, an approach for image segmentation and classification on mobile devices is developed, taking into account the limited resources of these mobile devices.

Keywords: text classification; image classification; mobile documents; scanned documents; digitised documents; digitising; mobile devices; smartphones; smartphone camera; image segmentation.

DOI: 10.1504/IJAPR.2014.063749

International Journal of Applied Pattern Recognition, 2014 Vol.1 No.2, pp.173 - 198

Received: 27 Nov 2013
Accepted: 24 Dec 2013

Published online: 20 Jul 2014 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article