Forthcoming and Online First Articles

International Journal of Big Data Intelligence

International Journal of Big Data Intelligence (IJBDI)

Forthcoming articles have been peer-reviewed and accepted for publication but are pending final changes, are not yet published and may not appear here in their final order of publication until they are assigned to issues. Therefore, the content conforms to our standards but the presentation (e.g. typesetting and proof-reading) is not necessarily up to the Inderscience standard. Additionally, titles, authors, abstracts and keywords may change before publication. Articles will not be published until the final proofs are validated by their authors.

Forthcoming articles must be purchased for the purposes of research, teaching and private study only. These articles can be cited using the expression "in press". For example: Smith, J. (in press). Article Title. Journal Title.

Articles marked with this shopping trolley icon are available for purchase - click on the icon to send an email request to purchase.

Online First articles are published online here, before they appear in a journal issue. Online First articles are fully citeable, complete with a DOI. They can be cited, read, and downloaded. Online First articles are published as Open Access (OA) articles to make the latest research available as early as possible.

Open AccessArticles marked with this Open Access icon are Online First articles. They are freely available and openly accessible to all without any restriction except the ones stated in their respective CC licenses.

Register for our alerting service, which notifies you by email when new issues are published online.

We also offer which provide timely updates of tables of contents, newly published articles and calls for papers.

International Journal of Big Data Intelligence (5 papers in press)

Regular Issues

  • Temporal outlier analysis of Online Civil Trial cases based on graph and process mining techniques   Order a copy of this article
    by Beniamino Di Martino, Luigi Colucci Cante, Antonio Esposito, Pietro Lupi, Massimo Orlando 
    Abstract: Since the complete digitisation of civil processes that took place in Italy in 2008, a lot of data regarding the life cycle of thousands of civil proceedings has been collected. However, despite the continuous monitoring that Italian courts are subjected to, and regulatory changes to the procedures that have been enacted in recent years, the average duration of proceedings is still far from acceptable. In order to identify elements which could point to the causes of such slowness, data provided by the Court of Livorno have been analysed through process mining and graph techniques, in order to assess the coherence and correct application of the process model. A methodology to identify and analyse the outlier processes has been developed, as described in this work, to also detect characteristics which could justify delays in the processes completion. In this work, process mining and statistical techniques have been applied to the analysis of the proceedings data and outliers, with their characteristics, have been recognised.
    Keywords: process mining; data model enrichment; outlier analysis; graph-based techniques.
    DOI: 10.1504/IJBDI.2021.10037170
  • Formulation Of A Rational Option Pricing Model using Artificial Neural Networks   Order a copy of this article
    by Kaustubh Yadav 
    Abstract: This paper inquires on the options pricing modeling using artificial neural networks to price Apples European call options. The model is based on the premise that ANNs can be used as functional approximators and used as an alternative to the numerical methods to some extent, for a viable and faster solution. We evaluate our predictions using the existing numerical solutions for the same, the analytic solution for the Black-Scholes equation, COS-model for Hestons stochastic volatility model and standard Heston-quasi analytic formula. The aim of this study is to find a viable time-efficient alternative to existing quantitative models for option pricing.
    Keywords: Black-Scholes equation; Heston model calibration; option pricing; stochastic processes; artificial neural networks.
    DOI: 10.1504/IJBDI.2021.10039070
  • Systematic Privacy Impact Assessment Scheme(SPIAS) for Smart Connected Toys Data Privacy Compliance   Order a copy of this article
    by Benjamin Yankson, Patrick Hung, Farkhund Iqbal 
    Abstract: Childrens privacy compliance assessment in the area of smart connected toy (SCT) or play robot is a challenge, considering the plethora of device manufacturers using varied controls in an attempt to address diverse global childrens privacy regulations. Systematic privacy impact assessment scheme (SPIAS) is an abstract model developed to assess childrens SCT privacy compliance to known global child-centered legislation such as the Childrens Online Privacy Protection Act (COPPA). SPIAS addresses this issue by integrating global survey and analysis of existing childrens privacy legislation, and evaluating privacy theories to provide means of classifying data processed by SCT into set states using a finite state machine (FSM). SPIAS, as a tool to, can be adapted assess and address privacy risks and compliance arising from a new SCT or the convergence of existing SCT that collects, processes, and transfers children information.
    Keywords: smart connected toy; SCT; data; privacy; internet of things; IoT; assessment; compliance.
    DOI: 10.1504/IJBDI.2021.10040364

Special Issue on: SNTA2019 Systems and Network Telemetry and Analytics

  • Network Traffic Performance Analysis from Passive Measurements using Gradient Boosting Machine Learning   Order a copy of this article
    by Astha Syal, Alina Lazar, Jinoh Kim, Alex Sim, Kesheng Wu 
    Abstract: Effective monitoring and analysis of network traffic are vital for scientific computing, since scientific applications often require moving massive data from one site to another. A body of statistical and machine learning techniques have been introduced for network traffic monitoring and analysis, but it is a highly challenging task due to several reasons, such as unavailability of label information, complication of real-time analysis, generalization property of machine learning models, and so forth. In this paper, we present a novel method that identifies the continuous time windows of low throughput for the purpose of network performance analysis and anomaly detection, in order to facilitate data transfers for high-performance scientific computing. The presented method is based on supervised learning techniques with an adaptive labeling function that automatically determines if the time window is whether slow or normal. The presented method is validated on real large datasets collected from several data transfer nodes (DTNs) located in Science DMZ. Our experimental results show that the proposed method is able to quickly predict time windows of low performing network transfers, that require attention from network engineers.
    Keywords: Network traffic; TCP performance; UMAP; classification; Tstat; supervised machine learning; accuracy; cross-validation.

  • Predicting WAN Traffic Volumes using Fourier and Multivariate SARIMA Approaches   Order a copy of this article
    by Bashir Mohammed, Mariam Kiran, Nandini Krishnaswamy 
    Abstract: Network traffic has been a vital research issue which has attracted huge attention both in the network operations research domain and the industry. It has become crucial to develop techniques to better understand and predict the behavior and performance of networks. Understanding how network links are used and data movements can help network operations improve link utilization and capacity. In this paper, we tackle the need to understand traffic patterns across a large network topology by developing statistical models that use multivariate data model, incorporating seasonality, peak frequencies, and link relationships to improve future predictions. Using Fourier Transforms to extract seasons and peak frequencies from individual traces, we perform seasonality tests and ARIMA measures to determine optimal parameters to use in our prediction model. Our study shows that network traffic is non-stationary and possess seasonality, making SARIMA prediction approach, the most suitable for network traffic prediction. We also develop a multivariate model on real network traces that are collected from multiple time-zones of a large R&E WAN. Our results indicate an improved prediction accuracy with better RMSE and smaller confidence intervals using multivariate approach rather than univariate approaches. Our work provides key insights into studying network traffic, creating a deeper understanding of prediction methods, necessary for future research in network capacity management and planning.
    Keywords: Traffic forecasting; Multi-variate Time-series analysis; FourierTransforms; SARIMA.