Information retrieval by mining text and image Online publication date: Mon, 07-Nov-2016
by R. Seethalakshmi; K.S. Ravichandran; P. Swaminathan; A.N. Alagappan
International Journal of Advanced Intelligence Paradigms (IJAIP), Vol. 8, No. 4, 2016
Abstract: We have wonderful scripts which are lying to be digitised in Tamil. Tamil is a language which is enriched with several ancient scripts. Optical character recognition is done in Tamil in order to digitise the scripts. The optical character recognition consists of scanning phase, preprocessing phase, segmentation phase and recognition phase. The retrieved text is stored as an archive in the database. The archive also encompasses the original images. The front end GUI contains the search engine wherein which the keyword is put. The crawler crawls in the database and retrieves the searched page and the image based on context. The retrieved pages will be displayed in the order of relevant context and the appropriate page is clicked and fetched as desired.
Online publication date: Mon, 07-Nov-2016
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Advanced Intelligence Paradigms (IJAIP):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org