A paragraph-inserted word salad filtering algorithm Online publication date: Wed, 31-Dec-2014
by Ok-Ran Jeong; Won Kim
International Journal of Web and Grid Services (IJWGS), Vol. 8, No. 1, 2012
Abstract: Social spam is one type of spam which includes spamming the members of social websites by sending or posting unwanted ads or baiting them to visit particular websites. Word salad in turn is one type of social spam which aims at baiting people to visit particular websites, such as blogs, personal profiles, third-party applications built on social networking websites, etc. A word salad is created by inserting either words or paragraphs within a normal document, where the inserted words or paragraphs have no relevance to the document. The purpose of a word salad is to fool the search engines into assigning high ranks to the document. In this paper, we discuss an algorithm that filters (detects) paragraph-inserted word salads. The algorithm is based on the Singular Value Decomposition (SVD) method and, based on experiments, shows up to 81.3% accuracy.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Web and Grid Services (IJWGS):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com