Title: Context-based information retrieval from large heterogeneous data sources using semantics and polarity-based ranking
Authors: S. Subitha; S. Sujatha
Addresses: Department of Science and Humanities, Anna University, Chennai, India ' Department of Computer Applications, Anna University of Technology, BIT Campus, Trichy, India
Abstract: Information retrieval, in its naïve form is not suitable for the current information age, where all possible documents are available in digital formats. This paper presents an effective semantic and polarity-based information retrieval strategy for heterogeneous data sets. Context of the input query is identified and all the documents that satisfy the polarity and context of the input query are retrieved from the data source. Another major advantage of this method is that it can work on any data format, utilising the tags associated with the multimedia documents (image, audio and video) to identify the context. Experiments show effective retrieval and filtering rates, hence making this method a reliable and generic architecture for information retrieval.
Keywords: information retrieval; semantics; context; polarity; Big Data; data heterogeneity; text processing; tokenisation; sentiment analysis; Hadoop.
DOI: 10.1504/IJCAET.2017.086957
International Journal of Computer Aided Engineering and Technology, 2017 Vol.9 No.4, pp.497 - 507
Received: 22 Apr 2016
Accepted: 27 Aug 2016
Published online: 03 Oct 2017 *