A novel method for clustering tweets in Twitter Online publication date: Sun, 05-Apr-2015
by Shanmugam Poomagal; Palanisamy Visalakshi; Thiagarajan Hamsapriya
International Journal of Web Based Communities (IJWBC), Vol. 11, No. 2, 2015
Abstract: A popular social networking service called Twitter is used to post short messages that could be useful to someone in the world. These messages have been analysed by the researchers in different ways. This paper proposes a clustering technique to cluster the tweets in the Twitter. The basic aim of performing this clustering is to identify the groups of similar tweets posted and this information is useful to identify various user communities. These user communities can be recommended to the advertisers in Twitter by matching their topic of interest with the advertisers' field. Suffix Tree Clustering (STC) algorithm is the core web documents clustering algorithm which groups similar documents into clusters by constructing suffix tree. We used STC along with semantic similarity among the posted tweets to identify the topics of interest. The proposed method is compared with STC and Lingo algorithms using intra-cluster distance and inter-cluster distance. Results show that the proposed method performs better than the existing methods with 10.59% reduction in the intra-cluster distance value and 44.99% increase in the inter-cluster distance value.
Online publication date: Sun, 05-Apr-2015
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Web Based Communities (IJWBC):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email email@example.com