Title: Bringing taxonomic structure to large digital libraries

Authors: David Sanchez, Antonio Moreno

Addresses: Department of Computer Science and Mathematics, University Rovira i Virgili, Avda. Paisos Catalans, 26, 43007, Tarragona, Spain. ' Department of Computer Science and Mathematics, University Rovira i Virgili, Avda. Paisos Catalans, 26, 43007, Tarragona, Spain

Abstract: Digital libraries are invaluable repositories of information. However, in many situations, their size makes it difficult to access the desired resource. In this paper, we present an automatic, unsupervised, domain-independent and scalable approach for structuring the resources available in a certain electronic repository for a particular domain. The system automatically detects and extracts the main topics related to the desired domain, offering a taxonomical structure. This result is complemented by the library|s search engine, offering an integrated tool for accessing resources as an automatically composed directory service. The system has been tested for several digital libraries and domains of knowledge, providing good quality results in all cases.

Keywords: taxonomy learning; digital libraries; web mining; web search engines; resource indexing; knowledge acquisition; ontologies; semantic web; electronic repositories; directory services.

DOI: 10.1504/IJMSO.2007.016805

International Journal of Metadata, Semantics and Ontologies, 2007 Vol.2 No.2, pp.112 - 122

Published online: 23 Jan 2008 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article