A new text categorisation strategy: prototype design and experimental analysis
by N. Venkata Sailaja; L. Padma Sree; N. Mangathayaru
International Journal of Knowledge and Learning (IJKL), Vol. 13, No. 2, 2020

Abstract: Since a decade, ample amount of text data is being generated through various web sources in online or offline scenarios. This huge amount of data is mainly inconsistent and non-structured format, so hard to process through computing machines available. With the advent of computers and the information age, statistical and analytical problems have also grown both in the size and complexity. Text classification using various machine learning mechanisms encounters the difficulty of the high dimensionality of attributes vector. Therefore, a feature selection technique is very much required to discard irrelevant as well as noisy attributes from the feature set vector so that the ML algorithms can work efficiently. In this paper, a hybrid method is proposed for text documents classification. Further, proposed method's performance is evaluated on standard datasets, i.e., Reuters-21578 and 20 newsgroups. We opted 'bydate' version of the dataset containing 18,941 documents. Through our experiments, we attempted to explore the various performance measures.

Online publication date: Thu, 16-Apr-2020

The full text of this article is only available to individual subscribers or to users at subscribing institutions.

 
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.

Pay per view:
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.

Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Knowledge and Learning (IJKL):
Login with your Inderscience username and password:

    Username:        Password:         

Forgotten your password?


Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.

If you still need assistance, please email subs@inderscience.com