Most recent issue published online in the International Journal of Knowledge and Web Intelligence.

An enterprise perspective of web content analysis research: a strategic road-map

Ramesh S. Wadawadagi — 2019-11-15T23:20:50-05:00

An enterprise perspective of web content analysis research: a strategic road-map
Ramesh S. Wadawadagi; Veerappa B. Pagi
International Journal of Knowledge and Web Intelligence, Vol. 6, No. 2 (2019) pp. 51 - 88
Participating in social networks to create and share opinion content has become a ubiquitous part of our everyday life. Understanding social media content is at the top of the agenda for many firms today. Business analysts and quants are trying harder to discover ways in which enterprises can be benefited by comprehending the content generated through social media such as Facebook, Wikipedia, Blogs, Youtube and Twitter. This pioneering work may aid business analysts and data scientists with insights into ways to adapt the stable content analysis (CA) techniques to analyse web page contents containing user-generated data. In this paper, we develop an integrated enterprise framework that defines web content analysis (WCA) as a comprehensive and functional layered architecture, and consequently, this framework can be used in various levels of the decision-making process. Further, a four dimensional view of comparative analysis of various WCA systems is presented. Based on the critical analysis of the literature survey, the study explores many open and challenging issues for further research in this domain.

Determining the semantic orientation of opinion words using typed dependencies for opinion word senses and SentiWordNet scores from online product reviews

K.C. Ravi Kumar — 2019-11-15T23:20:50-05:00

Determining the semantic orientation of opinion words using typed dependencies for opinion word senses and SentiWordNet scores from online product reviews
K.C. Ravi Kumar; D. Teja Santosh; B. Vishnu Vardhan
International Journal of Knowledge and Web Intelligence, Vol. 6, No. 2 (2019) pp. 89 - 105
Opinion words express the information regarding the like and dislike of a user on the target entities such as products and product aspects present in the online reviews. The polarised information collected from the reviews is analysed by calculating the orientation of the adjectives. The synonymy relation graph is a way to determine the orientation of the adjectives present in the product reviews dataset. It considers the minimum path length between the adjectives under analysis using WordNet synsets. The synonymy relation graph cannot determine the orientations of all the opinion words present in the dataset. In order to evaluate opinion orientation of all the adjectives from the dataset, the synonymy relation graph of WordNet is to be replaced with the SentiWordNet scores of the opinion words. These scores are provided to the opinion words by finding the contextual clues surrounding the opinion words to disambiguate their sense. The contextual clues are finalised based on the typed dependencies grammatical relations. The distance between the opinion word and the context insensitive seed term (good/bad) is computed by calculating the difference between these scores. This paper addresses advantages of using SentiWordNet scores. This improves the accuracy of the determined opinion word orientations.

MOSSA: a morpho-semantic knowledge extraction system for Arabic information retrieval

Nadia Soudani — 2019-11-15T23:20:50-05:00

MOSSA: a morpho-semantic knowledge extraction system for Arabic information retrieval
Nadia Soudani; Ibrahim Bounhas; Yahya Slimani
International Journal of Knowledge and Web Intelligence, Vol. 6, No. 2 (2019) pp. 106 - 141
In this paper, we propose to exploit different morpho-semantic resources to enhance Arabic information retrieval (IR). We use standardised LMF Arabic dictionaries and Arabic corpora. Our goal by this communication is to take advantage of the different existing resources to extract useful knowledge for Arabic IR. We equally study the impact of the Arabic morphology on IR effectiveness. Several query expansion strategies are carried based on morphological, semantic and morpho-semantic relations. In addition, combining such knowledge is also studied and evaluated. We experiment the effect of short diacritics and part of speech (POS) disambiguation and tagging in the indexing step. A graph-based representation is used to formalise knowledge resources graph-based representation. This latter represents a powerful formalism to express semantics of texts and to support NLP tools and applications as IR. Several experimental comparisons are handled between the different used knowledge resources and the different carried IR approaches.