Title: Community discovery algorithm under big data: taking microblog as an example

Authors: Hongwei Qi; Haiyan Bai

Addresses: Computer School, Jining Normal University, Ulanqab, Inner Mongolia 012000, China ' Computer School, Jining Normal University, Ulanqab, Inner Mongolia 012000, China

Abstract: Microblog has become a popular social media because of its short text and timely release, and its impact on society has gradually increased. In order to study the behaviour of microblog users, this paper introduced two algorithms for microblog network community division, which were community discovery algorithm based on density peak clustering and community discovery algorithm based on similarity. Then the two algorithms were simulated using MATLAB software. The data used in the experiment included the artificial network generated by LFR tool and the following information data of different users collected by taking the API interface of the microblog of a student from Computer School of Jining Normal University as the starting point by crawlers. The results demonstrated that the normalised mutual information (NMI) and the density of the community structure obtained by the two algorithms decreased, and the conductivity increased with the expansion of the scale of microblog network, and the community structure obtained by the similarity-based algorithm had higher NMI and density and lower conductivity under the same scale of micrblog network. In conclusion, the similarity-based algorithm can divide microblog network better.

Keywords: microblog; density peak; similarity; community division.

DOI: 10.1504/IJWBC.2021.114461

International Journal of Web Based Communities, 2021 Vol.17 No.2, pp.88 - 98

Received: 17 Mar 2020
Accepted: 06 Apr 2020

Published online: 22 Apr 2021 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article