Title: Integrating geolocation intelligence with ensemble machine learning models for enhanced darknet traffic classification
Authors: Ngaira Mandela; Nilay R. Mistry
Addresses: School of Digital Forensics and Cyber Security, National Forensic Sciences University, Gandhinagar, India ' School of Digital Forensics and Cyber Security, National Forensic Sciences University, Gandhinagar, India
Abstract: This study presents an innovative approach to darknet traffic classification, combining advanced machine learning techniques with hybrid LASSO-random forest (HLRF) feature selection and IP geolocation mapping. We propose a new ensemble model that significantly outperforms traditional classifiers, achieving an accuracy of 96.86% and an F1-score of 96.12%. Our research utilises an enhanced version of the CIC-Darknet2020 dataset, augmented with additional darknet traffic collected over a six-month period. The HLRF selector is employed to identify the most relevant features, improving the model's efficiency and interpretability. Furthermore, we incorporate IP geolocation mapping to provide insights into the global distribution of darknet activities. Our findings demonstrate the effectiveness of our ensemble method with HLRF feature selection in capturing complex darknet traffic patterns and highlight the challenges in geographical attribution due to sophisticated anonymisation techniques. This work contributes to the field of cybersecurity by offering an improved method for darknet traffic classification and providing a deeper understanding of the global nature of darknet operations.
Keywords: darknet traffic classification; darknet; machine learning; XGBoost; neural networks; darkweb.
DOI: 10.1504/IJICS.2025.148111
International Journal of Information and Computer Security, 2025 Vol.27 No.4, pp.515 - 535
Received: 02 Sep 2024
Accepted: 16 Jan 2025
Published online: 25 Aug 2025 *