Title: Machine learning algorithm for lung cancer classification using ADASYN with standard random forest

Authors: J. Viji Gripsy; T. Divya

Addresses: PSGR Krishnammal College for Women, Peelamedu, Coimbatore, Tamil Nadu, India ' PSGR Krishnammal College for Women, Peelamedu, Coimbatore, Tamil Nadu, India

Abstract: Lung cancer is one type of cancer that develops in the lungs. Early identification of lung cancer symptoms may lead to a successful treatment. The dataset indicates the presence of duplicate characteristics, as well as an imbalanced classification, making lung cancer classification a challenging task. This study presents a novel approach that combines the ADASYN with the standard random forest (ASRF) model to enhance the efficacy of lung cancer dataset identification. The ASRF, as described, offers interpretable outcomes by using feature significance, hence providing significant insights into the aspects that contribute to judgments on the classification of lung cancer. The classification algorithm is used to ascertain the existence or absence of lung cancer in a certain patient. When comparing the proposed ASRF with the current SVM, MLP, RF and GB, compared to other existing methods, the ASRF technique achieved 93.5% precision, 94.7% recall, 94.1% F-measure, and 94% accuracy.

Keywords: lung cancer; LC; RF ASRF; MLP; support vector machine; SVM; GB.

DOI: 10.1504/IJDMB.2026.150961

International Journal of Data Mining and Bioinformatics, 2026 Vol.30 No.1/2, pp.1 - 17

Received: 30 Jan 2024
Accepted: 15 May 2024

Published online: 06 Jan 2026 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article