Title: A cluster-based approach for distributed anonymisation of vertically partitioned data
Authors: Antonios Xenakis; Zhiyuan Chen; George Karabatis
Addresses: Department of Information Systems, University of Maryland, Baltimore County (UMBC), Baltimore, MD, USA ' Department of Information Systems, University of Maryland, Baltimore County (UMBC), Baltimore, MD, USA ' Department of Information Systems, University of Maryland, Baltimore County (UMBC), Baltimore, MD, USA
Abstract: In modern organisations, data is often spread across different sites, posing challenges for effective analysis. Transferring data to a centralised server may jeopardise privacy and leak sensitive/proprietary information. Therefore, organisations hesitate adopting this solution despite its potential to fully utilise, and analyse the data, for better decision making. Current approaches concentrate on distributed privacy-preserving techniques for data analysis, where data does not leave each site, but incurs substantial computational and communication overhead. This paper focuses on distributed data that is anonymised on site, then merged and sent to a centralised server for analysis. Two new approaches on cluster-based distributed anonymisation are introduced for vertically partitioned data, one based on distributed coordinated anonymisation, and the other based on top-down distributed anonymisation, resulting in low initial onsite anonymisation overhead. Experiments show these approaches preserve data privacy with very minor loss of utility of anonymised data and impose minimal computational overhead.
Keywords: privacy; distributed anonymisation; differential privacy; K-anonymity; cluster-based anonymisation.
DOI: 10.1504/IJWET.2024.143360
International Journal of Web Engineering and Technology, 2024 Vol.19 No.4, pp.397 - 420
Received: 23 Oct 2023
Accepted: 03 Jan 2024
Published online: 16 Dec 2024 *