Title: Persons, GLAM institutes and collections: an analysis of entity linking based on the COURAGE registry

Authors: Ghazal Faraj; András Micsik

Addresses: Eötvös Loránd University, Pázmány Péter stny. 1/C., 1117, Budapest, Hungary ' Institute for Computer Science and Control (SZTAKI), Eötvös Loránd Research Network (ELKH), Lágymányosi u. 11., Budapest, Hungary

Abstract: It is an important task to connect encyclopaedic knowledge graphs by finding and linking the same entity nodes. Various available automated linking solutions cannot be applied in situations where data is sparse, private or a high degree of correctness is expected. Wikidata has grown into a leading linking hub collecting entity identifiers from various registries and repositories. To get a picture of connectability, we analysed the linking methods and results between the COURAGE registry and Wikidata, VIAF, ISNI and ULAN. This paper describes our investigations and solutions while mapping and enriching entities in Wikidata. Each possible mapped pair of entities received a numeric score of reliability. Using this score-based matching method, we tried to minimise the need for human decisions, hence we introduced the term human decision window for the mappings where neither acceptance nor refusal can be made automatically and safely. Furthermore, Wikidata has been enriched with related COURAGE entities and bi-directional links between mapped persons, organisations, collections, and collection items. We also describe the findings on coverage and quality of mapping among the above mentioned authority databases.

Keywords: linked data; cultural heritage; link discovery; entity linking; authority data; metadata quality; Wikidata; VIAF; ISNI; ULAN.

DOI: 10.1504/IJMSO.2021.10040247

International Journal of Metadata, Semantics and Ontologies, 2021 Vol.15 No.1, pp.39 - 49

Received: 10 Jun 2020
Accepted: 08 Jan 2021

Published online: 10 Aug 2021 *

Full-text access for editors Access for subscribers Purchase this article Comment on this article