Authors: Neda AleEbrahim; Mohammad Fathian
Addresses: Industrial Engineering Department, Iran University of Science and Technology, Narmak, Tehran, 16846-13114, Iran ' Industrial Engineering Department, Iran University of Science and Technology, Narmak, Tehran, 16846-13114, Iran
Abstract: In recent years, with the expansion of electronic commerce, the number of customer online reviews available on the internet is growing rapidly. Lots of online merchant's websites ask the customers to leave a review about their experiences with the products. The reviews gathered from these websites are rich source of information for product development and marketing. The large volume of reviews that a product receives, make it hard for a potential customer or a manufacturer to read them and know about the customers' preferences, needs and experiences. So, this large volume of text data needs to be summarised using text mining approaches. The approach used in this paper to overcome this problem, is to develop a text summarisation system which extracts and groups the representative sentences of customer reviews. The proposed system, first extracts key topics discussed frequently in the customer review texts in the form of sequences of words. Then, the proposed system, groups the sentences assigned to the key topics, based on their semantic and syntactic similarity, using a genetic clustering algorithm. The evaluation result of the proposed system shows that the technique is effective and outperforms an existing text summarisation method.
Keywords: customer online reviews; text summarisation; text mining; frequent words; word sequences; syntactic similarity; semantic similarity; genetic algorithms; clustering algorithms; electronic commerce; e-commerce; product reviews; product development; marketing; key topics.
International Journal of Business Information Systems, 2013 Vol.13 No.3, pp.343 - 358
Published online: 27 Sep 2013 *Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article