Authors: Vassilios S. Verykios, Alexandros Karakasidis, Vassilios K. Mitrogiannis
Addresses: Department of Computer and Communication Engineering, School of Engineering, University of Thessaly, Glavani 37 & 28th Octovriou Str., GR 38221, Volos, Greece. ' Department of Computer and Communication Engineering, School of Engineering, University of Thessaly, Glavani 37 & 28th Octovriou Str., GR 38221, Volos, Greece. ' SingularLogic S.A., Al. Panagouli & Siniosoglou Str., N. Ionia, GR 14234, Athens, Greece
Abstract: Privacy-preserving record linkage is a very important task, mostly because of the very sensitive nature of the personal data. The main focus in this task is to find a way to match records from among different organisation data sets or databases without revealing competitive or personal information to non-owners. Towards accomplishing this task, several methods and protocols have been proposed. In this work, we propose a certain methodology for preserving the privacy of various record linkage approaches and we implement, examine and compare four pairs of privacy preserving record linkage methods and protocols. Two of these protocols use n-gram based similarity comparison techniques, the third protocol uses the well known edit distance and the fourth one implements the Jaro-Winkler distance metric. All of the protocols used are enhanced by private key cryptography and hash encoding. This paper presents also a blocking scheme as an extension to the privacy preserving record linkage methodology. Our comparison is backed up by extended experimental evaluation that demonstrates the performance achieved by each of the proposed protocols.
Keywords: data integration; privacy preserving record linkage; cryptography; personal data; privacy protection; private key cryptography; hash encoding; blocking schemes.
International Journal of Data Mining, Modelling and Management, 2009 Vol.1 No.2, pp.206 - 221
Published online: 26 May 2009 *Full-text access for editors Access for subscribers Purchase this article Comment on this article