Title: Multi-document summarisation using genetic algorithm-based sentence extraction

Authors: A. Kogilavani, P. Balasubramanie

Addresses: Department of Computer Science and Engineering, Kongu Engineering College, Perundurai, Erode 638052, Tamil Nadu, India. ' Department of Computer Science and Engineering, Kongu Engineering College, Perundurai, Erode 638052, Tamil Nadu, India

Abstract: Automatic document summarisation is the process of generating a summary of the original documents with the aim of shorter reading time. Sentence extraction is a widely adopted document summarisation technique by which relevant sentences are extracted from documents. The proposed system generates optimal summary by Genetic Algorithm-based sentence extraction strategy. Based on individual word weight and other sentence-specific features sentence score is calculated. To produce optimal summary fitness function is used. Machine-generated summaries are compared against human summaries using different measures. The experiment results show that the proposed approach is efficient and outperforms the existing approach.

Keywords: linguistic analysis; word weight; sentence specific features; sentence extraction; sentence score; document summarisation; optimal summary; genetic algorithms; machine-generated summaries.

DOI: 10.1504/IJCAT.2011.041653

International Journal of Computer Applications in Technology, 2011 Vol.40 No.4, pp.246 - 253

Published online: 28 Jul 2011 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article