Title: Using shallow semantic analysis and graph modelling for document classification

Authors: Przemysław Maciołek; Grzegorz Dobrowolski

Addresses: AGH University of Science and Technology, Al. Adama Mickiewicza 30, 30-059 Kraków, Poland ' AGH University of Science and Technology, Al. Adama Mickiewicza 30, 30-059 Kraków, Poland

Abstract: Using graph-based, shallow semantic analysis-driven approach for modelling text contents allow to extract additional information about meaning of text. This paper discusses using two novel algorithms that are based on this idea. They are compared against 'legacy' bag-of-words and Schenker et al. approaches in NN document classification task.

Keywords: document classification; graph modelling; shallow semantic analysis; semantic networks; text content; text meaning; nearest neighbour.

DOI: 10.1504/IJDMMM.2013.053692

International Journal of Data Mining, Modelling and Management, 2013 Vol.5 No.2, pp.123 - 137

Published online: 29 Jul 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article