Title: OptiPhish: URL-based phishing detection using an optimised feature framework

Authors: Kamaljeet Kaur; Ankit Kumar Jain

Addresses: National Institute of Technology Kurukshetra, 136119, Haryana, India ' National Institute of Technology Kurukshetra, 136119, Haryana, India

Abstract: Phishing, a social engineering cyberattack, poses a significant risk to an individual's privacy and finances. To launch these attacks, criminals trick consumers into giving their sensitive information. Existing techniques lack precise and fast identification of phishing websites. This work suggests a method to build a strong model with high accuracy and low false positive rates for categorising URLs as benign or dangerous. We found optimised features via feature engineering, which helped to eliminate less significant ones. This approach reduced processing time and raised all evaluation criteria by providing optimised features to the machine learning classifier. A balanced dataset prevents underfitting and overfitting, ensuring a reasonable and adequate dataset for the machine learning systems. The experiments show that Random Forest and XGBoost algorithms perform best with 97% accuracy, a 0.8% false positive rate, and a 96.9% F1 score. This suggested work represents significant progress in enhancing cybersecurity.

Keywords: machine learning; ML; website phishing; URL; support vector machine; SVM; extreme gradient boosting; XGBoost; feature engineering; optimisation.

DOI: 10.1504/IJSN.2025.146759

International Journal of Security and Networks, 2025 Vol.20 No.2, pp.93 - 109

Received: 11 Dec 2024
Accepted: 20 Apr 2025

Published online: 16 Jun 2025 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article