MapReduce fuzzy C-means ensemble clustering with gentle AdaBoost for big data analytics Online publication date: Tue, 17-Aug-2021
by K.M. Padmapriya; B. Anandhi; M. Vijayakumar
International Journal of Business Intelligence and Data Mining (IJBIDM), Vol. 19, No. 2, 2021
Abstract: Big data clustering is one of the significant processes employed in numerous application domains. Existing clustering algorithms do not cope with large-scale data, resulting in higher false positive rate. In order to cluster such large datasets with higher accuracy, MapReduce gradient descent gentle AdaBoost clustering (MGDGAC) technique is proposed. The MGDGAC technique designs MapReduce fuzzy C-means (MFCM) clustering where the large dataset is initially subdivided into a number of chunks which are executed in parallel on different nodes to effectively perform clustering processes with minimal time. The data with larger membership value are grouped in the cluster with help of mappers. Then, reducer in MFCM clustering re-estimates the centroid value and iteratively fed to the mapper again until it attains a particular iteration and groups the similar data together. Finally, MGDGAC technique applies gentle AdaBoost with intention of reducing the training error of large data clustering.
Online publication date: Tue, 17-Aug-2021
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of Business Intelligence and Data Mining (IJBIDM):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email email@example.com