Data transformation techniques for preserving privacy in distance-based mining algorithms Online publication date: Thu, 23-Oct-2014
by Mohammad Ali Kadampur; D.V.L.N. Somayajulu
International Journal of Data Mining, Modelling and Management (IJDMMM), Vol. 6, No. 3, 2014
Abstract: Dissimilarity calculation between two objects is one of the important knowledge gathering methods in cognition science. Many data mining algorithms explore dissimilarity computation to cluster the data in order to know intra-relations, inter-relations, and outliers in the data. Majority of these algorithms use Euclidean distance as the dissimilarity criterion. In this paper, signal transformation functions, with their orthogonal property and energy compaction features are explored in transforming the data. The data transformation scheme considers entire data as a single entity. The proposed scheme is designed such that it can be used even for the non-Euclidean space by using the distance mapping algorithm. The existing randomisation approaches for data transformation maintain only the distributions and do not maintain the Euclidean distance between the records. The proposed methods are superior to the existing methods in terms of run time complexity O(n) and preservation of distance between individual data points.
Online publication date: Thu, 23-Oct-2014
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Data Mining, Modelling and Management (IJDMMM):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email firstname.lastname@example.org