Title: Integrated framework for semantic text mining and ontology construction using inference engine

Authors: Purnachand Kollapudi; G. Narsimha

Addresses: Department of CSE, JNTU College of Engineering, Kakinada, Andhra Pradesh, 533003, India ' Department of CSE, JNTUH College of Engineering, Jagitial, Telangana State, 505501, India

Abstract: Traditional clustering algorithms are generally either keyword or index based but not semantic based. These algorithms are facing difficulties in identifying synonymies or polysemies due to high dimensionality of text data. Ontologies are identified to overcome these difficulties. In this paper, we propose a framework which automates the extraction of concepts or terms with support of: a) our proposed metric called term rank identifier (TRI), it measures the frequent terms; b) semantically enriched terms (SETs) clustering algorithm, it calculates the semantic relation between the terms with Word net; c) Ontology Building can be done automatically for the concepts extracted from SET Clustering using inference engines. The experimental results show that our proposed metric TRI and SET clustering algorithm performed significantly.

Keywords: clustering; TRI; term rank identifier; SETs; semantically enriched terms; ontology; inference engine; semantic relation; text classification; semantic based; knowledge terms.

DOI: 10.1504/IJDS.2017.084766

International Journal of Data Science, 2017 Vol.2 No.2, pp.138 - 154

Received: 02 Apr 2015
Accepted: 02 Feb 2016

Published online: 26 Jun 2017 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article