Title: Design and collection of ontological metadata for enhancing interoperability of language resources

Authors: Shunsuke Kozawa; Hitomi Tohyama; Kiyotaka Uchimoto; Shigeki Matsubara; Hitoshi Isahara

Addresses: Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan. ' Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan. ' National Institute of Information and Communications Technology, 3-5 Hikari-dai, Seika-cho, Soraku-gun, Kyoto, 619-0289, Japan. ' Graduate School of Information Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya, 464-8601, Japan. ' Information and Media Center, Toyohashi University of Technology, 1-1 Hibarigaoka, Tempaku, Toyohashi, Aichi, 441-8580, Japan

Abstract: This paper describes the design and implementation of a large scale ontological database named SHACHI, storing detailed metadata on language resources (LRs) in Asian and Western countries. SHACHI has been constructed to enhance the interoperability of LRs, that is, to effectively combine LRs, to systematically store LR metadata, to provide a common infrastructure for web services, to investigate languages, tag sets, and formats compiled in LRs, and to ultimately utilise all these factors for more efficient development of LRs. This ontological metadata database, containing more than 2,000 compiled LRs such as corpora, dictionaries, thesauruses and lexicons, has an aspect of an archive of a large scale metadata of LRs, and its website is now open to the public and accessible to all internet users. SHACHI metadata set is an extended version of OLAC metadata set which conforms to Dublin Core metadata element set. This paper first presents the methodologies to systematically store LR metadata and efficiently LR catalogues, and then explains the structure of the ontological metadata database, as well as the realisation of the LR catalogue search tool. The usefulness of the ontology search function has been investigated.

Keywords: language resources; interoperability; web services; ontological metadata; ontology; corpus; dictionary; thesaurus; lexicon; catalogue search.

DOI: 10.1504/IJKWI.2012.050852

International Journal of Knowledge and Web Intelligence, 2012 Vol.3 No.3, pp.206 - 223

Available online: 06 Dec 2012 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article