Title: A robust pipelined granular approach for enhanced efficiency in document text summarisation using hierarchical categorisation and feature extraction techniques
Authors: Krishna Dheeravath; S. Jessica Saritha
Addresses: Department of Computer Science and Engineering, Jawaharlal Nehru Technological University, Ananthapur, Andhra Pradesh – 515002, India ' Department of Computer Science and Engineering, Jawaharlal Nehru Technological University, Ananthapur, Andhra Pradesh – 515002, India
Abstract: The exponential growth of digital textual data presents challenges for traditional text summarisation methods, especially due to natural language complexity and high dimensionality. This work proposes a pipelined granular approach to enhance summarisation efficiency by addressing semantic ambiguity, high dimensionality, and scalability. The methodology comprises two phases: optimised feature selection and advanced rule-based categorisation. In the first phase, a granular approach using a discernibility matrix and iterative optimisation reduces dimensionality while retaining key features for accuracy. The second phase employs hierarchical categorisation to generate precise rules, ensuring adaptability and scalability across diverse domains. Experimental benchmarking and complexity analysis validate the approach, demonstrating notable improvements in summarisation accuracy and computational efficiency. The integration of granular feature extraction and hierarchical categorisation effectively tackles key challenges, making the approach suitable for large-scale document analytics across various applications.
Keywords: feature extraction; text summarisation; hierarchical categorisation; natural language processing; NLP; large-scale document analytics.
DOI: 10.1504/IJAHUC.2025.147752
International Journal of Ad Hoc and Ubiquitous Computing, 2025 Vol.49 No.4, pp.225 - 232
Received: 21 Aug 2024
Accepted: 26 Nov 2024
Published online: 30 Jul 2025 *