Forthcoming Articles
International Journal of Computer Applications in Technology

Forthcoming articles have been peer-reviewed and accepted for publication but are pending final changes, are not yet published and may not appear here in their final order of publication until they are assigned to issues. Therefore, the content conforms to our standards but the presentation (e.g. typesetting and proof-reading) is not necessarily up to the Inderscience standard. Additionally, titles, authors, abstracts and keywords may change before publication. Articles will not be published until the final proofs are validated by their authors.
Forthcoming articles must be purchased for the purposes of research, teaching and private study only. These articles can be cited using the expression "in press". For example: Smith, J. (in press). Article Title. Journal Title.
Articles marked with this shopping trolley icon are available for purchase - click on the icon to send an email request to purchase.
Online First articles are also listed here. Online First articles are fully citeable, complete with a DOI. They can be cited, read, and downloaded. Online First articles are published as Open Access (OA) articles to make the latest research available as early as possible.
Register for our alerting service, which notifies you by email when new issues are published online.
International Journal of Computer Applications in Technology (33 papers in press) Regular Issues
Abstract: This paper describes the application direction of DMVI processing technology and the acquisition and post-processing of ultra-high-definition quality data, explores the application of DMVI processing technology in image analysis, proposes a method for obtaining ultra-high definition quality video data, and discusses the reconstruction of ultra-high-definition quality video. According to the research results, satisfaction with the introduction of the five-dimensional light field function algorithm and CV technology reached over 21%; at 4K resolution, the processing time of the five-dimensional light field was 1.05 Keywords: digital media video image; image data processing; computer vision; ultra clear picture quality image. DOI: 10.1504/IJCAT.2025.10073493
Abstract: To elevate the quality of image enhancement for smart home product layout scenes and expedite processing times, a study focused on virtual reality-based enhancement of these scenes has been undertaken. Initially, a virtual reality framework is employed to create an indoor environment for smart homes, with the ISGSA algorithm model utilized to generate this environment. Subsequently, the attributes of each constituent element are amalgamated and fed into a generator to produce a novel indoor scene. Ultimately, a conditional generative adversarial network is devised to formulate a composite loss function, integrating channel color loss, structural feature loss, and smoothness loss. This loss function is instrumental in achieving image enhancement. Experimental findings reveal that the proposed method attains an average information entropy of 8.846, with an image enhancement processing duration of merely 3.9 s. Keywords: virtual reality; smart home products; layout scene; image enhancement; ISGSA algorithm; attention learning module; channel colour loss. DOI: 10.1504/IJCAT.2025.10073932
Abstract: This paper proposes an improved moving target tracking algorithm (TTA) based on the mean-shift (MS) method, which is suitable for complex industrial environments. The improved algorithm introduces the YOLO (You Only Look Once) model for moving target detection and uses its results as tracking input. In addition, the algorithm also introduces a twin network (SN) to extract the deep features of the target for re-identification after occlusion. In order to further improve the tracking stability, a Kalman filter is introduced to predict the next motion state of the target. Stability analysis shows that the algorithm achieves the best multi-target tracking accuracy (MOTA) index in various complex environments, outperforming other tracking methods and showing good multi-target tracking stability. In summary, the algorithm successfully overcomes the limitations of the traditional MS method and provides a novel solution for moving target tracking in industrial environments. The algorithm has important practical value and provides a valuable reference for future research on moving target tracking in dynamic and complex environments. Keywords: moving target tracking; mean-shift algorithm; YOLO model; Siamese network; Kalman Filter. DOI: 10.1504/IJCAT.2025.10074104
Abstract: To overcome the limitations of current mining algorithms and improve the effectiveness of resource mining, this paper proposes a multimodal teaching resource association resource mining algorithm for MOOC ideological and political learning. Firstly, the features of text, image, and audio modalities are extracted using the bag of words model, VGG16 network, and Mel frequency cepstral coefficient method. Secondly, the feature vectors of each modality are concatenated and fused. Due to the high dimensionality after fusion, principal component analysis is used for dimensionality reduction. Finally, feature fusion, dimensionality reduction, and association rule mining are used to optimize the association of multimodal teaching resources, and dynamic association rules are introduced to adapt to the dynamic needs of students' learning process, thereby improving the effectiveness of MOOC ideological and political learning. The experimental results show that the mining results of the proposed algorithm have diversity and strong correlation with the target topic. Keywords: MOOC; ideological and political education; multimodal; teaching resources; resource mining; principal component analysis; association rules. DOI: 10.1504/IJCAT.2025.10074404
Abstract: This paper studies a "Road to Waterway" model for medium and long-distance cargo transportation with consideration of transport efficiency. First, addressing the time-sensitive requirements of high-value-added cargo transportation faced by multimodal operators, a "Road to Waterway" model for medium and long-distance transportation is developed. Second, through cost analysis that quantifies various expenses while establishing objective functions and constraints, the model ensures reasonable transportation mode selection, transit connections, and flow balance. Finally, employing genetic algorithms to generate initial solutions and maintain population diversity, combined with ant colony algorithm's positive feedback mechanism for optimal solution search, the model demonstrates significantly improved solving efficiency and time performance. Experimental results indicate a stable on-time arrival rate exceeding 97.7% and cost savings reaching 9.3%. Keywords: transportation efficiency; medium to long distance; freight transportation; ‘Road to Waterway’ model. DOI: 10.1504/IJCAT.2025.10074405
Abstract: In actual manufacturing environments, electronic components often face occlusion problems, which makes it difficult for traditional point cloud segmentation methods to estimate the pose of objects accurately. To address this challenge, this paper introduces the multi-scale feature learning capability provided by PointNet++ to extract deep collective feature information in local areas of different scales and understand the overall morphology of components in a global context. According to experimental analysis, under the same occlusion level, PointNet++ outperforms the PointNet model, the RANSAC (Random Sample Consensus) algorithm, and the voxelisation method Point-Voxel CNN in terms of segmentation accuracy. The pose estimation method of electronic components studied in this paper is highly applicable in actual mechanical manufacturing environments, can process large-scale data, and meets real-time requirements. It provides the theoretical basis and technical support for solving the positioning and assembly problems of components in actual industrial production. Keywords: point cloud segmentation; pose estimation; PointNet++ Model; occlusion problems; mechanical manufacturing; random sample consensus. DOI: 10.1504/IJCAT.2025.10074466
Abstract: Traditional static risk assessment methods struggle to meet real-time processing demands for large-scale, multi-source heterogeneous data, showing sluggish responsiveness to emergencies and abnormal transactions. These approaches often suffer from poor early-warning accuracy and frequent false or missed alerts. To address these challenges, this study proposes a cloud-based security risk warning evaluation system for the digital economy. The system first establishes a multi-level risk indicator framework, utilizing fuzzy hierarchical analysis and information entropy to calculate weighted metrics that integrate qualitative and quantitative indicators. It then employs grey prediction algorithms for short-term risk trend forecasting. Through a cloud computing distributed architecture, the system achieves real-time collection, processing, and risk assessment of multi-source heterogeneous data, ensuring instant precision in warnings. Experimental results demonstrate that this method consistently outperforms existing approaches in both warning accuracy and Recall metrics, with significantly reduced average response time while maintaining reasonable control over false alarm rates and resource consumption. This research provides a practical technical solution for digital economy security risk management, offering both theoretical value and practical significance. Keywords: risk early warning; evaluation system construction; digital economy; economic security. DOI: 10.1504/IJCAT.2025.10075047
Abstract: Remote sensing object detection faces persistent challenges in accurately identifying small-scale targets embedded in high-resolution, cluttered scenes. Conventional detectors often suffer from feature dilution, scale variance, and high computational cost, limiting their applicability in real-time or edge-based remote sensing scenarios. To address these issues, we propose DAFPN, a lightweight Dynamic Attention-guided Feature Pyramid Network that integrates asymmetric multi-scale fusion and dual-branch attention, consisting of spatial and channel-wise attentions, into a unified architecture optimized via multi-objective constrained learning aimed at simultaneously maximizing detection accuracy, attention alignment, and architectural compactness. On DOTA-v2.0, our method improves mAP@0.75 by 4.5% and mAP@0.5 by 3.8% over YOLOv8, while achieving similar gains on FAIR1M, DIOR, and RSSOD. The results confirm DAFPNs robustness under variable input resolutions and dense object distributions, highlighting its practical value for deployment in real-time and resource-constrained remote sensing applications. Keywords: remote sensing; small object detection; multi-objective optimization; feature pyramid networks; dynamic attention; lightweight detection; aerial imagery; real-time inference. DOI: 10.1504/IJCAT.2026.10075464
Abstract: High-dimensional data has become increasingly prevalent in a wide range of fields, including cybersecurity, finance, healthcare and industrial monitoring. However, the sparsity, redundancy and complex inter-feature relationships inherent in such data significantly complicate anomaly detection and pattern recognition tasks. Traditional machine learning methods often suffer from poor scalability and limited generalisation in high-dimensional settings. To address these limitations, this paper proposes a novel deep learning framework specifically designed for high-dimensional anomaly detection and pattern recognition. The proposed model introduces three key innovations. First, a hierarchical representation module is developed to extract multilevel semantic features by integrating adaptive kernel transformations with semantic-preserving aggregation strategies. This design improves the models ability to capture both global patterns and local anomalies. Second, a dual-branch attention mechanism is introduced to jointly learn feature-level and instance-level relevance, enhancing the models robustness to noise and irrelevant dimensions. Third, an interpretable anomaly scoring strategy is constructed based on prototype deviation in latent space, offering transparency and actionable insights for decision support. Extensive experiments are conducted on multiple real-world high-dimensional data sets. Results demonstrate that the proposed method consistently outperforms existing approaches in terms of accuracy, robustness and interpretability. Keywords: high-dimensional data; anomaly detection; hierarchical representation learning; attention mechanism. DOI: 10.1504/IJCAT.2026.10075605
Abstract: In the analysis process of popular music singing audio, factors such as environmental noise interference and complex instrument accompaniment seriously affect the accuracy of audio feature extraction, resulting in the performance of traditional music beat extraction methods being difficult to meet practical needs. Therefore, this study innovatively proposes a popular music singing beat extraction method based on multi feature fusion. Performing preprocessing operations such as discretization, denoising, and normalization on the original singing audio signal effectively improves signal quality. Through joint time-frequency domain analysis, comprehensively extract the time-frequency characteristics of music signals. Adopting a feature fusion strategy, combined with beat cycle analysis and inter beat distance calculation, high-precision beat detection is achieved. Experimental data shows that the missed detection rate and false detection rate of this method are as low as 2.1% and 2.5%, respectively, significantly better than traditional methods, providing reliable technical support for pop music performance analysis. Keywords: audio features; pop music; singing rhythm; intelligent extraction model. DOI: 10.1504/IJCAT.2026.10075713
Abstract: To reduce peak lag deviation (PLD), enhance flow mutation responsiveness (FMR), and optimize hotspot overlap rate (HOR), this paper proposes a feature recursive elimination-based method for accurate daily tourist flow prediction in attractions. Firstly, integrate multidimensional historical data through data dimensionality reduction processing to reduce data complexity. Then, extract the daily average tourist volume and traffic fluctuation features, and use feature recursive elimination method combined with random forest to streamline the feature dataset and improve feature effectiveness. Finally, utilizing evolutionary strategies to optimize the BP network, overcoming its limitations, and achieving accurate prediction. The experiment shows that for weekdays, the PLD, FMR, and HOR of this method are 2.34%, 94.56%, and 92.34%, respectively. For holidays, the PLD, FMR, and HOR of this method are 5.62%, 83.21%, and 81.23%, respectively. The numerical results are superior to existing methods. Keywords: tourist attractions; daily tourist flow; prediction methods; feature extraction; recursive elimination; evolutionary strategy; BP network. DOI: 10.1504/IJCAT.2026.10075714
Abstract: In order to improve the accuracy of dynamic mining of English teaching resources and shorten response time, a dynamic mining method for English teaching resources based on dynamic minimum support is proposed. Firstly, convert unstructured data into structured features to achieve feature extraction of English teaching resources. Secondly, the mutual information matrix and diagonalisation method are used to calculate the eigenvalues and eigenvectors, and the principal component decision matrix is constructed to achieve dimensionality reduction. Finally, based on dynamic minimum support mining technology, the threshold is adjusted in real-time to meet teaching needs. The prefix span algorithm is used to process incremental data, and when changing support, the updated itemset is scanned and filtered. The effective sequence is integrated with confidence level. The experimental results show that the mining accuracy of our method is stable at over 95%, and the response time remains between 0.62 s and 0.92 s. Keywords: dynamic minimum support; English teaching resources; dynamic mining; mutual information matrix. DOI: 10.1504/IJCAT.2026.10076164
Abstract: In order to fill the theoretical gap in the field of personalized matching of multimodal learning resources, a cross modal recommendation method for English teaching resources based on deep fusion of interest information is studied. Firstly, collect and analyze user behavior data, attribute features, and learning characteristics on the English learning platform, and construct a multidimensional interest fusion model. Secondly, by integrating the BERT model with cross modal attention perception methods, a CCA-BERT recommendation model was constructed to achieve deep feature extraction and semantic association modeling of multimodal English teaching resources such as videos, texts, and audios. Finally, personalized resource recommendation is completed based on click probability, which breaks through the limitations of traditional single mode recommendation. Empirical findings demonstrate that our cross-modal recommendation approach achieves a user satisfaction level exceeding 94.7%, while simultaneously maintaining recommendation diversity above 0.87% across experimental evaluations. Keywords: interest information; information mining; English teaching resources; cross-modal recommendation. DOI: 10.1504/IJCAT.2026.10076165
Abstract: By optimising the high-frequency noise of analogue circuits, the anti-interference ability of analogue circuits can be effectively improved, ensuring the purity of signals during transmission and processing. Therefore, an optimisation method of high-frequency noise in analogue circuits considering the stochastic fluctuation characteristics of signals is proposed. Collect analogue circuit signals, extract wavelet packet energy features, composite multi-scale entropy features and stochastic fluctuation features, and use self-attention mechanism to enhance the features, thereby achieving high-frequency noise recognition in analogue circuits. After determining the presence of high-frequency noise in the signal, CEEMDAN is used to decompose the analogue circuit signal and optimise the high-frequency noise of the analogue circuit through signal filtering and reconstruction operations. The experimental results show that the high-frequency noise recognition accuracy of the proposed method remains stable at over 95%, with an average noise coefficient of about 0.15 and an optimisation time of less than 2 s. Keywords: stochastic fluctuation characteristics of signals; analogue circuit; high-frequency noise; enhance the features; high-frequency noise recognition; CEEMDAN. DOI: 10.1504/IJCAT.2026.10076166
Abstract: To improve the accuracy of customer churn prediction and ensure the quality of e-commerce platform services, a multi-feature fusion-based e-commerce platform customer churn prediction method is proposed. Firstly, the SMOTE algorithm is used to reduce data imbalance and construct a data set for predicting customer churn on e-commerce platforms; Then, select user characteristic attributes, analyse customer consumption behaviour based on consumption time characteristics, consumption value characteristics and consumption quantity characteristics, and integrate multiple characteristics of consumption behaviour. Finally, using the blending model and four base learners RF, GBTD, XGBoost and LightGBM, along with the secondary learner of the logistic regression model, to achieve customer churn prediction on e-commerce platforms. Experimental results have shown that the maximum error in predicting customer churn rate using the method proposed in this article does not exceed 0.1%, with an AUC value of 0.913 and a maximum accuracy of 0.92. Keywords: e-commerce platform; customer churn; multi-feature fusion; blending model; learner. DOI: 10.1504/IJCAT.2026.10076167
Abstract: The operating conditions of the power grid have strong nonlinear time-varying characteristics, and the line loss data presents non-stationary features, resulting in a decrease in the accuracy of line loss anomaly identification and a high false alarm rate. Therefore, a grey wolf algorithm is proposed to optimize the identification method of power grid line loss anomalies under SVR. Firstly, LSTM network is used to complete the missing power line loss data. Secondly, by simulating the encirclement mechanism and position update strategy of grey wolf hunting behavior, the SVR parameters are adaptively adjusted. Finally, with the maximum number of iterations as the termination condition, output the optimal parameter combination to identify the abnormal state of line loss. The experimental results show that the accuracy of the proposed method for identifying line loss anomalies always remains in the high range of 96% -98%, and the false alarm rate always remains below 2%. Keywords: grey wolf algorithm; support vector machine regression; power grid line loss; abnormal identification.
Abstract: External environmental factors such as weather can cause changes in the failure rate of power supply components in distributed distribution networks, resulting in spatiotemporal correlation failures, affecting self-healing performance, and reducing reliability. This article proposes a power supply reliability evaluation method for distributed distribution networks with self-healing performance. Firstly, a topology model of the distribution network is constructed using graph theory. The adjacency matrix and adjacency table are used to describe the node connection relationship, and topology shrinkage technology is combined to simplify the complex grid structure and remove redundant branch lines to improve computational efficiency. Secondly, a system of component reliability indicators, load point reliability indicators, system reliability indicators, and electricity reliability indicators was constructed. Finally, the sequential Monte Carlo simulation method was used to simulate the fault process with self-healing actions through component state sampling, fault affected area division, and load transfer analysis. Based on simplified topology and comprehensive index system, a comprehensive evaluation of the self-healing performance and reliability of distributed distribution networks was achieved. Keywords: self-healing performance; distributed distribution network; reliability evaluation. DOI: 10.1504/IJCAT.2026.10076168
Abstract: To address the issues of high Matthews correlation coefficient, high Brier scores, and low accuracy of perception in traditional methods, an accurate perception method for enterprise financial risk based on Stacking ensemble learning is proposed. Using the improved SMOTE to obtain a balanced dataset of enterprise financial data, and based on the Boruta algorithm to screen the features related to financial risk in the balanced dataset; Using Stacking ensemble learning technology and combining the prediction results of four base models to train a meta model, the obtained features are input into the trained meta model to obtain risk prediction results. Based on the prediction results, financial risks are classified into five levels to achieve enterprise financial risk perception. The experimental results show that the Matthews correlation coefficient of this method has never been lower than 0.90, the minimum Brier score is only 0.071, and the maximum perception accuracy is 98.36%. Keywords: enterprise financial risk; risk perception; SMOTE; Boruta algorithm; stacking ensemble learning. DOI: 10.1504/IJCAT.2026.10076169 FPGA implementation and Multisim simulation of a new four-dimensional two-scroll hyperchaotic system with coexisting attractors ![]() by Sundarapandian Vaidyanathan, Esteban Tlelo-Cuautle, Khaled Benkouider, Aceng Sambas, Ciro Fabian Bermudez-Marquez, Samy Abdelwahab Safaan Abstract: Field-programmable gate array (FPGA) design of a new four-dimensional two-scroll hyperchaotic system is investigated in this work. A detailed system modelling of the new system with a hyperchaotic attractor begins this work with phase plots, which is followed by a bifurcation study of the new system. Special dynamic properties such as multistability and symmetry are also investigated for the new system. Using Multisim software, a circuit model is designed and simulated for the new hyperchaotic system. FPGA design and Multisim simulation of the new system enable practical applications in science and engineering. The implementation of the FPGA design in this work is carried out by applying two numerical schemes, viz. Forward Euler and Trapezoidal methods. Experimental attractors observed in the oscilloscope show good match with the Matlab signal plots.The FPGA hardware resources are detailed for both numerical methods. Keywords: hyperchaos; bifurcation; symmetry; phase plots; hyperchaotic system;rnparameters; stability; multistability; circuit model; FPGA implementation. Improving hybrid-layer convolutional neural network system for lung cancer nodule classification using enhanced weight optimisation algorithm ![]() by Vikul Pawar, P. Premchand Abstract: In recent times, lung cancer is evolving as a highly life-threatening disease for human beings. According to the WHO, lung cancer disease is the second largest cause of deaths as compared to all other types of cancer. The prevailing available technology is striving to get more exposure in the field of medical science using Computer Assisted Diagnosis (CAD), where image processing is playing a crucial role for detecting the cancerous nodules in computer tomographic images. Augmenting the machine learning techniques with image processing algorithms is becoming a more comprehensive examination of cancer disease in proposed CAD systems. This paper is describes a heuristic approach for lung cancer nodule detection, and the proposed model predominantly consists of the following tasks, which are image enhancement, segmenting ROI (Region of Interest), features extraction, and nodule classification. In pre-processing, primarily the Adaptive Median Filter (AMF) filtering method is applied to eliminate the speckle noise from input CT images of Lung Image Database Consortium (LIDC) and Image Database Resource Initiative (IDRI): in the LIDC-IDRI dataset, the quality of input image is improved by applying Histogram Equalization (HE) technique with Contrast-Limited Adaptive (CLA) approach. Secondly, in the successive stage the Improved Level-Set (ILS) algorithm is used to segment the ROI. Furthermore, the third step of the projected work is applied to extract the definite learnable texture features and statistical features from the segmented ROI. The extracted features in the subsequent stage of classification are applied to Hybrid-Layer Convolutional Neural Network (HL-CNN) architecture to classify the lung cancer nodule as either benign or malignant. Principally this research is carried out by contributing to each stage of it, where the novel concept of the improved Hybrid-Layer Convolutional Neural Network (HL-CNN) is employed by optimising and selecting the optimal weight using the Enhanced Cat Swarm Optimisation (ECSO) algorithm. The experimental result of the proposed HL-CNN using the weight optimisation algorithm ECSO is achieved an accuracy of 93%, which is comparatively efficient with respect to existing models such as DBN, SVM, CNN, WOA, MFO, and CSO. Moreover, the proposed model conclusively gives a decision on the detected nodule as either benign or malignant. Keywords: Computer Assisted Diagnosis (CAD); Computer Vision; Cancer Diagnosis; Image Classification; Image Enhancement; Image Segmentation; Feature Extraction. Prediction model for total amount of coke oven gas generation based on FCM-RBF ![]() by Lili Feng, Jun Peng, Zhaojun Huang Abstract: The rational use of Coke Oven Gas (COG) is of great significance to improve the economic efficiency of enterprises. In this paper, a COG generation prediction model based on fuzzy C-mean clustering (FCM) and radial basis function (RBF) neural network is proposed to address the problems such as the difficulty of accurate modelling of COG generation process and the difficulty of real-time flow prediction. Firstly, the coke oven production process is analysed and correlation analysis is used to select the influencing factors. Secondly, the FCM is used to classify the working conditions of the coke oven, and the appropriate number of working conditions is selected through experiments. Finally, the prediction models under different working conditions are established separately by using RBF. The experiments were carried out using actual industrial production data, and the experimental results showed that the model could provide guidance reference for the dispatchers. Keywords: coking oven process; fuzzy C-means clustering; prediction model; radial basis function neural network. Hie-Graph-YOLOv9: a hierarchical YOLOv9 model with graph-based SE attention mechanism for vehicle detection in complex background by T. Selvamuthukumar, K. Vijayalakshmi, P. Dhanalakshmi, R. Abinaya Abstract: Advanced vehicle detection algorithms are key to Intelligent Transportation Systems (ITS), enabling real-time traffic analysis, congestion and security management. Existing models like YOLOv9 face challenges in feature selection and learning, especially in dynamic or cluttered environments. To address these limitations, this research proposes Hie-Graph-YOLOv9 which is an extended version of YOLOv9 based on improving the feature selecting, feature learning and loss function by incorporating Hiera Transformers, Graph-based GAN-SE attention mechanism and Geometric-based Weighted Smooth L1 loss function. Hiera Transformers, integrated into the backbone network across four stages, refine multi-scale feature learning, ensuring robust representation of fine-grained and global patterns. The Graph-based GAN-SE, embedded in the bottleneck module, emphasises critical regions of feature maps, enhancing detection accuracy. Additionally, a Geometric-based Weighted Smooth L1 loss function is employed for bounding box regression, improving convergence speed and training stability. Experimental evaluations demonstrate the superiority of Hie-Graph-YOLOv9, achieving an AP (0.5) of 79.5%, improvement of faster convergence by 120 Epochs and an increased inference speed of 41.95 FPS, outperforming state-of-the-art models. This work offers a significant step forward in vehicle detection under complex real-world conditions. Keywords: object detection; YOLO; vehicle; Hiera; graph; squeeze and excitation. Hie-Graph-YOLOv9: A Hierarchical YOLOv9 model with Graph-based SE attention mechanism for vehicle detection in complex background ![]() by T. Selvamuthukumar, K. Vijayalakshmi, P. Dhanalakshmi, R. Abinaya Abstract: Advanced vehicle detection algorithms are key to intelligent transportation systems (ITS), enabling real-time traffic analysis, congestion and security management. The proposed Hie-Graph-YOLOv9 method is an extended version of YOLOv9 based on improving the feature selecting, feature learning and loss function. In this YOLO architecture, we induced Hiera Transformers in the backbone network in four stages for improving the feature learning. We also introduced the Graph based GAN-SE attention mechanism in the bottleneck module for giving attention to essential feature map regions and utilized Geometric based Weighted Smooth L1 loss function for bounding box prediction for faster convergence, training stability and improved accuracy. Keywords: object detection; YOLO; vehicle;Hiera; graph; Squeeze and Excitation. DOI: 10.1504/IJCAT.2025.10072853 A monitoring and early warning of respiratory infectious disease symptoms based on multi-source information data fusion ![]() by Shengcong Tao, Yirong Guo Abstract: An oversight and alert methodology grounded in multi-source information data amalgamation is proposed to address the issues of elevated root mean square error and suboptimal alert efficacy in respiratory infectious disease symptom monitoring. First, manifestation data characteristics are delineated through time series analysis, and Support Vector Machines (SVM) are employed for feature extraction. Wavelet transformation technology is utilised to eliminate noise and rectify missing data. Subsequently, data level, feature level and decision level are progressively integrated to consolidate multi-source data characteristics, while Markov chain models are amalgamated to determine alert zones. The experimental results demonstrate that the proposed method achieves optimal performance in the root mean square error test of multi-source respiratory infectious disease symptom data fusion, with a minimum error of 0.11%. In the absolute accuracy value test for symptom monitoring and warning, the highest accuracy is observed to approach 100%. Keywords: data fusion; time series definition; SVM; decision level fusion; Markov chain. DOI: 10.1504/IJCAT.2025.10074468 A theoretical framework for integrating federated learning and transfer learning: advancing optimisation in decentralised systems ![]() by Mohammed Abdul Wajeed, Annavarapu Chandra Sekhara Rao Abstract: Federated Learning (FL) has transformed decentralised model training by enabling collaborative learning while protecting data privacy. Key challenges include non-iid data distributions, slow convergence and limited understanding of combining FL with other paradigms. This research introduces a theoretical framework establishing foundations for incorporating Transfer Learning (TL) into FL to address these issues. The Federated Transfer Optimisation (FTO) framework expands FL optimisation theories by introducing transfer-invariant initialisation metrics for efficient use of pre-trained models. We introduce a Transfer Learning Augmented Loss (TLAL) function combining global objectives and local transfer dynamics to control knowledge retention during fine-tuning. The framework presents adaptive task-alignment kernels to balance global and client-specific objectives in heterogeneous scenarios. Experimental evaluations on text classification data sets show FTO achieves better accuracy, reduced communication overhead and faster convergence compared to existing FL methods. This study provides a principled basis for integrating TL, enabling efficient learning systems for privacy-sensitive applications. Keywords: federated learning; transfer learning; federated transfer optimisation; distributed optimisation; adaptive task-alignment kernels; transfer learning augmented loss; TLAL; integrate federated transfer learning; text classification. DOI: 10.1504/IJCAT.2025.10074663 Multi-dimensional data mining of English online teaching platform based on improved decision tree ![]() by Jingping Du Abstract: In order to improve the acceleration ratio and mining accuracy of data mining, this paper proposes a new multi-dimensional data mining method for English online teaching platforms based on improved decision tree. Firstly, information granule technology is introduced for data reconstruction, utilizing neighborhood data relationships to improve clustering accuracy. Secondly, constructing an association rule mapping structure, using association matrix and difference coefficient matrix to present the relationships between datasets, introducing mining factors and relative errors to improve subsequent mining accuracy. Finally, the improved C4.5 decision tree algorithm is adopted, combined with principal component analysis to reduce dimensionality, and features are filtered through information gain rate to improve data mining accuracy and efficiency. The experimental results show that the mining performance of our method is significantly improved, with a data mining acceleration ratio maintained above 0.9 and a data mining accuracy maintained above 98.54%. Keywords: improve decision tree; English online teaching platform; multidimensional data; data mining. DOI: 10.1504/IJCAT.2025.10075111 An enhancement processing for smoke environment images of firefighting robots based on improved homomorphic filtering ![]() by Lei Zhang, Baochen Yang, Wenlian Guo Abstract: In order to improve the image processing effect and enhance the usability of images, this paper designs a smoke environment image enhancement processing method for fire extinguishing robots based on improved homomorphic filtering. Using fire extinguishing robots as carriers, high-speed cameras are installed to capture image information of smoke environments. Homomorphic filtering is used to eliminate interference information in the images, and the homomorphic filtering results are improved through a total variation model. Through smoothing processing, the edges of objects are well preserved. Extract global feature values of images using Retinex algorithm. Finally, the extracted image feature values are weighted and fused to construct an image enhancement model, which completes the image enhancement process through model calculations. The experimental results show that this method can effectively enhance the information in smoke environment images, with an average contrast gain of nearly 30% and a natural preservation of around 98%. Keywords: firefighting robot; smoke environment image; image enhancement; homomorphic filtering; total variational model; Retinex algorithm; feature extraction. DOI: 10.1504/IJCAT.2025.10075218 Integrating security within DevOps for continuous protection: securing software development through unified practices ![]() by Bahaa Eddine Elbaghazaoui, Tarik El Moudden, Salma El Omari, Soukaina Nai, Imane Moustati, Khalid Benabbes Abstract: DevSecOps integrates security into the DevOps pipeline, embedding it as a core part of the software development lifecycle. This paper examines its evolution from traditional DevOps, emphasizing principles such as Security as Code, Shift-Left Security, and Continuous Monitoring, which together enable proactive vulnerability management and resilient delivery. It explores challenges including cultural resistance, skill gaps, and the complexity of tool integration, while outlining practical solutions such as automating security checks, fostering a security-first culture, and leveraging metrics to track progress. Future trends shaping DevSecOps are also discussed, including AI-driven threat detection, Zero Trust Architecture, and Compliance-as-Code to streamline regulatory adherence. By addressing these aspects, organizations can achieve secure, agile, and adaptive software delivery. The paper contributes an actionable, stage-wise adoption view that couples culture, process, and CI/CD gate placement, illustrated with a small-business example and concrete outcome metrics to demonstrate practicality and measurable impact. Keywords: DevSecOps; shift-left security; AI; artificial intelligence; zero trust architecture; compliance-as-code. DOI: 10.1504/IJCAT.2026.10075793 Parallel classification method of remote sensing images based on fuzzy support vector machine ![]() by Youlin Cai Abstract: To enhance the accuracy and efficiency of remote sensing image classification, a parallel classification methodology based on fuzzy support vector machines (FSVM) is proposed. Fully dilated convolutional networks are employed for remote sensing image feature extraction, while both spatial and channel attention mechanisms are incorporated to capture subtle discriminative features. Then, the extracted features are subjected to dimensionality reduction and fusion processing to improve the efficiency of subsequent processing. Finally, fuzzy theory is introduced to construct a fuzzy support vector machine model. The processed features are fed into the system, where parallel computing is integrated into the fuzzy support vector machine (FSVM) framework for remote sensing image classification, thereby enabling parallelised image classification. The results show that the proposed method has a maximum classification accuracy of 97%, an average accuracy of over 90%, a Kappa coefficient of 0.96, and a maximum classification time of only 4.75s. The space complexity remains around 10GB, indicating strong classification performance. Keywords: remote sensing images; spatial and channel attention mechanism; fuzzy support vector machine; parallel classification. DOI: 10.1504/IJCAT.2025.10076151 Study on abnormal behaviour recognition of substation construction site based on dynamic feature fusion ![]() by Yun Zhao, Ziwen Cai, Yuxin Lu, Wei Cao Abstract: This paper proposed an abnormal behaviour recognition method of substation construction site based on dynamic feature fusion. Kinect camera is used to collect the video of substation construction site, and the key frames of the video are extracted by frame difference method. The obtained image is greyscaled and Gaussian filtered, and the image target is detected by convolutional neural network. The dynamic features of the target, such as texture features, motion features and shape features, are fused, and the abnormal behaviour is recognised and abnormal behaviour is recognised by combining the dynamic features with a multi-layer perceptron. Experimental results indicate that the feature extraction accuracy of the proposed method ranges from 95.27% to 97.48%, with a maximum accuracy rate of 98.63%, the recognition time varies from 0.16 s to 0.86 s. Keywords: dynamic feature fusion; substation; construction site; abnormal behaviour recognition; texture features; texture features; shape features. DOI: 10.1504/IJCAT.2025.10076152 A data classification algorithm for ideological and political teaching resources integrating density peak clustering and random forest ![]() by Liping Xu, Qinghua Zhang Abstract: In this paper, a data classification algorithm for ideological and political teaching resources that integrates density peak clustering and random forest is proposed. Firstly, process missing and outlier values through data cleaning steps to ensure data quality. Next, using correlation analysis and information gain based methods, important features for classification are selected to reduce the dimensionality of the data. Subsequently, the density peak clustering method was used to automatically discover the clustering structure in the data through adaptive distance measurement and local density calculation. Finally, the random forest algorithm is used for classification, and through a dynamic weighted voting mechanism, the voting weight of each decision tree is determined based on its classification accuracy on the validation set to optimize the classification performance. The experimental results show that this method effectively improves the efficiency and accuracy of data classification. Keywords: density peak clustering; random forest; teaching resources; dynamic weighted voting. DOI: 10.1504/IJCAT.2025.10076153 Text-guided product image editing based on multimodal feature fusion ![]() by Dianhui Mao, Zhongxin Zheng Abstract: In order to improve the multimodal consistency and semantic similarity of product image editing results, a text guided product image editing method based on multimodal feature fusion is proposed. Firstly, shape features are extracted through Hu moments, texture characteristics are described with a grey-level co-occurrence matrix, and edge features are detected via the Canny algorithm. Secondly, image features including shape, texture, and edges are integrated with target text information using a dual attention mechanism, thereby achieving multimodal feature fusion. Finally, text guided product image editing is achieved by employing a generative adversarial network model and combining the feature fusion results of target text with existing images. The experimental results demonstrate that a multimodal consistency coefficient of 0.98 and a visual semantic similarity of 0.990 can be achieved by the proposed method. Keywords: multimodal features; dual attention mechanism; feature fusion; text guided product images; image editing. DOI: 10.1504/IJCAT.2025.10076154 Security access control method for three-dimensional database of power grid engineering under high concurrent access ![]() by Qiang Wang, Yong Feng, Jingguo Rong, Su Zhang Abstract: In order to reduce the avalanche effect value during secure access to the threedimensional power grid engineering database and improve access throughput, this study designed a database security access control method for high-concurrency environments. Firstly, thread pool technology was introduced to implement user identity authentication in high-concurrency access scenarios. Secondly, based on the zero-trust principle, a secure access permission allocation architecture was designed to dynamically assign user access permissions by calculating user trust values, three-dimensional data quantification values and permission decision values. Finally, blockchain technology was employed to generate secure access control contracts. Experimental results demonstrate that after applying this method, the maximum avalanche effect value of the three-dimensional power grid engineering database was only 0.50, while the maximum database throughput reached 340 Tps. Keywords: high concurrency access; power grid engineering; 3D database; security access control; blockchain technology; identity authentication; permission allocation. DOI: 10.1504/IJCAT.2025.10076155 |
Open Access
