Title: Keyword extraction method based on complex network

Authors: Zhen Yang; Huajian Gong; Weiyun Ma; Chunlin Yin; Jie Li; Siyang Liu; Hua Zhu; Na Zhao

Addresses: Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China ' School of Computer Science and Technology, Xidian University, Xian, 710000, China ' Key Laboratory in Software Engineering of Yunnan Province, School of Software, Yunnan University, Kunming, 650091, China ' Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China ' Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China ' Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China ' Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China ' Electric Power Research Institute of Yunnan Power Grid, Kunming, 650217, China; Key Laboratory in Software Engineering of Yunnan Province, School of Software, Yunnan University, Kunming, 650091, China

Abstract: Keyword extraction has a wide range of applications in the field of natural language processing. Many research results on keyword extraction at present. Among them, keyword extraction methods are based on complex networks do not require a lot of data training in advance and are simple to implement. This paper analyses and realises the application of complex network theory in the Chinese news keyword extraction task. We collected and analysed 2,061 news texts from Sina website as experimental data, and found that the number of keywords extracted affected the performance of keyword extraction method. We also compared the performance of our algorithm with other keyword extraction algorithms. The experiments verify the accuracy and effectiveness of complex networks in keyword extraction of Chinese news texts. Based on PyQt and TextRank, a news text keyword analysis platform is constructed to realise the visualisation of text network and the extraction of Chinese news keywords.

Keywords: complex network; keyword extraction; text network; TextRank.

DOI: 10.1504/IJICT.2024.142110

International Journal of Information and Communication Technology, 2024 Vol.25 No.4, pp.323 - 335

Received: 23 Mar 2022
Accepted: 25 Jul 2022

Published online: 07 Oct 2024 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article