Authors: Panagiotis Antonellis, Christos Makris
Addresses: Computer Engineering and Informatics Department, Patras University, Rio 26500, Greece. ' Computer Engineering and Informatics Department, Patras University, Rio 26500, Greece
Abstract: Information-filtering systems constitute a critical component of modern information-seeking applications. As the number of users grows and the amount of information available becomes even bigger, it is imperative to employ scalable and efficient representation and filtering techniques. Typically, the use of eXtensible Markup Language (XML) representation entails profile representation with the use of the XPath query language and the employment of efficient heuristic techniques for constraining the complexity of the filtering mechanism. In this paper, we propose an efficient technique for matching user profiles that is based on the use of holistic twig-matching algorithms and is more effective, in terms of time and space complexities, in comparison with previous techniques. The proposed algorithm is able to handle order matching of user profiles, while its main positive aspect is the envisaging of a representation based on Prufer sequences that permits the effective investigation of node relationships. Experimental results showed that the proposed algorithm outperforms the previous algorithms in XML filtering both in space and time aspects.
Keywords: eXtensible Markup Language; XML filtering; information filtering; string representation; twig matching; order matching; user profiles.
International Journal of Web Engineering and Technology, 2008 Vol.4 No.1, pp.70 - 94
Published online: 05 Dec 2007 *Full-text access for editors Access for subscribers Purchase this article Comment on this article