Authors: Stein L. Tomassen, Darijus Strasunskas
Addresses: Department of Computer and Information Science, Norwegian University of Science and Technology, No-7491 Trondheim, Norway. ' Department of Industrial Economics and Technology Management, Norwegian University of Science and Technology, No-7491 Trondheim, Norway
Abstract: Search is probably the most frequent activity on the web. Yet, it is not effortless, mainly due to heterogeneous information resources. Semantic search is a means to tackle the problem of ambiguity. In this paper, we analyse a process of constructing semantic-linguistic Feature Vectors (FVs) used in our semantic search approach. These FVs are built based on domain semantics encoded in an ontology and enhanced by relevant terminology from web documents. Since FVs are central building blocks of the approach, we investigate the quality of FVs. We take a closer look at the process of FV construction and the impact of chosen techniques on the quality of FVs. We report on a set of laboratory experiments and analyse aspects affecting the FV quality and the FV construction error rates.
Keywords: semantic search; FVC; feature vector construction; evaluation; ontology; intrinsic quality; domain semantics.
International Journal of Metadata, Semantics and Ontologies, 2010 Vol.5 No.2, pp.120 - 133
Received: 09 Jun 2009
Accepted: 29 Dec 2009
Published online: 16 May 2010 *