Title: Repairing broken RDF links in the web of data

Authors: Mohammad Pourzaferani; Mohammad Ali Nematbakhsh

Addresses: Department of Computer Engineering, University of Isfahan, P.O. Box 8174673441, Isfahan, Iran ' Department of Computer Engineering, University of Isfahan, P.O. Box 8174673441, Isfahan, Iran

Abstract: In the web of data, linked datasets are changed over time. These changes include updating on features and address of entities. The address change in RDF entities causes their corresponding links to be broken. Broken link is one of the major obstacles that the web of data is facing. Most approaches to solve this problem attempt to fix broken links at the destination point. These approaches have two major problems: a single point of failure; and reliance on the destination data source. In this paper, we introduce a method for fixing broken links which is based on the source point of links, and discover the new address of the detached entity. To this end, we introduce two datasets, which we call 'superior' and 'inferior'. Through these datasets, our method creates an exclusive graph structure for each entity that needs to be observed over time. This graph is used to identify and discover the new address of the detached entity. Afterward, the most similar entity, which is candidate for the detached entity, is deduced and suggested by the algorithm. The proposed model is evaluated with DBpedia dataset within the domain of 'person' entities. The result shows that most of the broken links, which had referred to a 'person' entity in DBpedia, had been fixed correctly.

Keywords: broken links; link integrity; linked data; web of data; resource description framework; RDF; DBpedia; link repair.

DOI: 10.1504/IJWET.2013.059106

International Journal of Web Engineering and Technology, 2013 Vol.8 No.4, pp.395 - 411

Published online: 31 Mar 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article