Title: Prediction models for ozone in metropolitan area of Mexico City based on artificial intelligence techniques

Authors: Gong Bing; Joaquín Ordieres-Meré; Claudia Barreto Cabrera

Addresses: Department of Industrial Engineering, Business Administration and Statistic, E.T.S. Industrial Engineering, Universidad Politécnica de Madrid, C/José Gutiérrez Abascal, 2-20086, Madrid, Spain ' Department of Industrial Engineering, Business Administration and Statistic, E.T.S. Industrial Engineering, Universidad Politécnica de Madrid, C/José Gutiérrez Abascal, 2-20086, Madrid, Spain ' Department of Industrial Management, Instituto Tecnologizo de Zacatepec, Calrada Tecnologizo 27, Plan Ayala 62780 Zacatepec del Hidalgo, Morelos, Mexico

Abstract: Ozone is one of the worst harmful pollutants nowadays which affects the public health, so it is necessary to predict ozone level accurately in order to prevent the public from exposing to the pollution when it exceeds the limits. This study aims to predict daily maximum ozone concentrations in the metropolitan area of Mexico City by using four individual artificial intelligence techniques: multiple linear regression, neural networks, support vector machine, random forest, and two ensemble techniques: linear ensemble and greedy ensemble. Results from the comparison among different artificial intelligence techniques clearly showed that ensemble models, especially linear ensemble model, outperformed the individual artificial intelligence techniques. Moreover, it is concluded that the performance of models is influenced by the time ahead factor for the predictors. The errors of prediction models related to the data of current day are only around 50% of ones corresponding to the data of the previous day. In addition, in order to select the input variables properly, analysis of variance (ANOVA) based on multiple linear regression models was performed. Best model prediction capability also depends on the ranges of input variables.

Keywords: ozone pollution; prediction modelling; Mexico City; artificial intelligence; air pollution; multiple linear regression; neural networks; support vector machines; SVM; random forest; linear ensemble; greedy ensemble; ANOVA; ozone concentrations; metropolitan areas.

DOI: 10.1504/IJIDS.2015.068756

International Journal of Information and Decision Sciences, 2015 Vol.7 No.2, pp.115 - 139

Published online: 12 Apr 2015 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article