Thursday 6 December 2012

Meteorology-based Forecasting of Air Quality Index Using Neural Network



Meteorology-based Forecasting of Air Quality Index Using Neural Network

Mukesh Sharma,* Sachin Aggarwal*, Purnendu Bose* and Ashok Deshpande+
*Department of Civil Engineering, Indian Institute of Technology
Kanpur, 208016, India
                   +Distinguished Professor SIES-Indian Institute of Environment Management,    
                                                      Navi Mumbai, 400 076 India

Abstract: Air Quality Index (AQI), a system for transforming air pollution levels into a single number, aims at providing information about air quality in simple terms to general public. Any advance information about AQI can forewarn the public of unhealthy air and encourage people to voluntarily reduce emissions-producing activities and avoid exposures to polluted environment. Two mathematical models (i) meteorology-based air quality level predictions and (ii) meteorology forecasting, have been developed (based on four year data) using neural network to forecast AQI for following three days. The AQI forecasting model was concluded as being satisfactory and useful for information dissemination to general public.





Introduction and Objective
Air pollutants at ground level can be harmful to human health if their concentrations exceed certain acceptable levels. As pollutants accumulate in or near large metropolitan areas, this typically exposes people to unhealthy pollutant concentrations In light of the health effects of air pollutants, environmental agencies (e.g. USEPA, 1998) have been using air quality index (AQI) for public information (about air quality) and data interpretation. AQI is defined as an overall scheme that transforms the weighted values of individual air pollution related parameters (e.g. SO2, CO, visibility, etc.) into a single number or set of numbers. An AQI is developed in Indian context (Sharma et al (2001)) which include three pollutants (sulphur dioxide (SO2), nitrogen dioxide (NO2) and Suspended Particulate Matter (SPM)). Segmented linear functions are used for relating the actual air pollution concentrations (of each pollutant) to a normalized number. The categories of index system are:
 0-100; Good, 101-200; Moderate, 201-300; Poor, 301-400; Very poor, 401-500; Severe
Ideally, the developed AQI should have accurate forecasting capability. Any advance information about AQI can forewarn the public of unhealthy air and encourage people to voluntarily reduce emissions-producing activities and avoid exposure to polluted environment. The purpose of this paper is to establish a mathematical model between the metrology and air quality in terms of air quality index and develop a forecasting mechanism using artificial neural network for the developed AQI.
A scheme utilizing neural network simulation is developed in the course of this work for prediction of AQI for a location in the city of Kanpur (longitude 88° 22' E and Latitude 26° 26' N), India. The neural network model is trained and tested using the air quality and meteorology data. Two neural network models were developed one for developing a meteorology -air quality linkage and second for forecasting the meteorology three days in advance. Sampling location at Agricultural University (AU), Kanpur was selected for this study as concurrent data of air quality (from National Ambient Air quality Monitoring Programme) and meteorology were available for last several years at this site, which were essential for this study.
Meteorology-Air Quality Model Development
Air pollution system has three components: emission source, transport medium (atmosphere) and receptor. Pollution reaching a receptor depends not only on the emitted quantity but also on the atmospheric dynamics. The impact on receptor can be estimated by developing source-receptor linkages through atmosphere.
A modeling system (source-receptor linkages) is necessary to predict impact on the receptor and translate the impact into an air quality index.
Steps in Model Formulation
The impact on receptor at any point is a function of source strength, and meteorological parameters like wind speed, wind direction, temperature, relative humidity, stability, rainfall etc. Thus, the AQI forecasting modelling completion is a three-step process.
(i) Air Quality Models: Establish the model that can predict the air quality for various pollutants in terms of meteorology
(ii) Meteorology Forecasting Models: Forecast meteorology for coming next days and combined the forecasted meteorology to forecast air quality using the model developed in step (i) above
(iii) AQI Forecasting: Estimate and forecast the air quality index based on forecasted air quality levels in step (ii) above.
Air Quality Models  
Since air quality data were available on three pollutants, SPM, SO2, and NO2, these pollutants are selected for model prediction and index formulation.
 Four-year air quality data for the site (Agriculture University, Kanpur) for the years 1997 to 2000 were collected from Central Pollution Control Board, Delhi. Similarly, meteorological data for the same time period and at the same location were also collected. Meteorological data included temperature, wind speed and direction, relative humidity and rainfall.
Initial data analysis showed significant seasonal variability in air pollutant concentrations. Monsoon period was the cleanest and winter months were the most polluted. Therefore, meteorological and air quality data were segregated as per the following seasons:
Winter: December, January, and February, march
Summer: April, may, June
Monsoon: July, August, September and
Post monsoon: October, November
In all, twelve models need to be developed for predicting air quality concentrations for each set of meteorological parameters. The twelve models include three models (for SO2, NO2 and SPM) for four seasons (winter, summer, monsoon and post monsoon). In other words, for a given season and meteorological data, the model should predict concentration of the pollutants. For this purpose, a neural network model (presented below) was developed.
Artificial Neural Network (ANN) Model for Air Quality Modelling
For construction of neural network model, the air pollution system is looked upon as a system that under varying sets of meteorological inputs (e.g. weather conditions) will respond by producing different sets of output. Such a model presupposes no prior knowledge about the structure of relationship that exists between input and output variables (e.g. pollutant concentrations).
Neural network comprises a number of interconnected entities, similar in many ways to biological neurons. The choice of the architecture of the network depends on the task to be performed. For modelling physical system such as air pollution system, a feed-forward layer is normally employed (Wasserman, 1989). It consists of layers of input neurons, and one and more hidden layers. For this study, a software "Neuro Genetic Optimizer" (NGO, version 2.6) is used. The modelling results are shown in Figure 1. The other details of model are given below.
ANN model  - Steps Followed
1.       The air quality data were categorized into four seasons as explained earlier.
2.       The meteorology data were regressed with air quality data. Only average wind speed and average temperature of the day showed significant correlation with air quality. Hence, average temperature and wind speed of day was only taken as input layer. Output layer consisted of concentration of SO2, NO2, and SPM concentration.
3.       Four models one for each season were developed for each pollutant. For this purpose, data set of each season for four years was randomly divided into training and testing record (50% for each).
4.       Input data was normalized between -1 and 1 and output was subsequently de-normalized
5.       Weights were randomly initialized between +0.3 to -0.3.
6.       Momentum constant was selected in the range 0.1 to 0.3
7.       Multiple hidden layers were used; their number was selected using genetic algorithms.
8.       Selection of hidden layer and number of hidden neurons was decided using genetic algorithms
9.       Limit on the number of hidden neurons was set as eight.
10.    The minimum number of passes for each network was 20 and maximum was 50.
11.   







Learning rate constants were selected in the range 0.1 to 0.4.



Figure 1 Neural Network Modelling of NO2, SO2 and SPM
Figure 1 shows model performance and values of coefficient of correlation (R). The R-values were found significant at 1% level of significance in a statistical sense and model was accepted for further analysis.  It can be concluded that the developed neural network model establishes a reasonable relationship between meteorological inputs and air concentrations.
Meteorology Forecasting Model
As identified that for predicting air concentration only average temperature and wind speed of the day were adequate for predicting air concentrations. It was decided that attempt should be made to forecast AQI for next three days to provide enough warning period to people likely to be exposed from air pollution. To this end, two neural networks were developed one for forecasting mean temperature and another for wind speed. It may be stated that meteorology is a very complicated phenomenon and explaining it on basis of simple mathematical models is very difficult and one has to resort to ANN. The following steps were undertaken to develop two neural networks-based forecasting model one each temperature and for wind speed.
·         Collection of data : four years of meteorological data for various parameters like  temperature,  wind speed and direction, relative humidity, rainfall etc were collected,
·         Preparation of data: for advance forecasting for three days, input data for each set of output was prepared. A value of previous four days was found adequate to forecast temperature and wind speed for next three days. Therefore, the data were arranged in a set of four; total 1100 such sets were made and fed to the ANN model, and
·         Other steps were identical to those presented in the section on air quality modelling.
Based on the above exercise two models were developed. Model for temperature showed much higher accuracy in prediction than wind speed (Figure 2)
The temperature forecasting results are excellent, and it demonstrates the high predictive capacity of ANN. Both for temperature and wind speed, the value of R was found significant at 1% of level of significance, and model was accepted for further analysis.
AQI Forecasting: Results and Discussion








Having established two models one fore predicting pollutant concentration based on wind speed and temperature and other model for forecasting wind speed and temperature, the values of model-computed AQI can then be estimated for all days of air quality monitoring. Figure 3 compares the AQI values based on actual measurements vis-à-vis model computed AQI.



Figure 2 Neural Modelling results for temperature and wind speed forecasting
The model does not show very high correlation between computed and observed AQI (Figure 3). The probable reason for such a behavior can be attributed high background concentrations and low values of pollutant concentration that may be fluctuating from one day to another and model is not capable of picking these subtle changes. However, the aim of this study is to develop a method, which could predict AQI into a band like good, moderate, poor, severe etc. The model strength must be tested on predictability of AQI in the right band rather than in terms of absolute values. 
Figure 4 compares the model-computed AQI band vis-a-vis observed AQI bands. Considering the AQI bands, it can be stated that the performance of the model improves and it can be used for public information as an advance warning system. Results of other seasons are not shown here. It may be mentioned that for summer and winter the model has performed better than monsoon season (Aggarwal, 2001).



                                                                                                Figure 3 Comparison of AQI predicted and Actual        
                                                                                          for Monsoon.



Figure 4. Comparison of Observed and predicted AQI Bands For Monsoon season.


References
Aggarwal Sachin (2001).  Application of Neural Network to Forecast Air Quality Index. Thesis submitted in partial fulfillment of requirements for a degree in Bachelor of Technology, April 2001.

NGO (version 2.6) (2000). Neuro-Genetic optimizer URL www.bio-comp.com

Sharma M., Sengupta, B., Shukla, B.P. and Maheshwari, M (2000). Air Quality Index for Data Interpretation and Public Information. Presented in International Conference, Centre for Science and Environment, New Delhi, June 6-8, 2000.

USEPA (1998). Federal Register Vol.63 No. 236/Wednesday, December 9, 1998

Wassermann P.D. (1989). Neural Computing theory and Practice. New York Van Nostrand Reinhold.

No comments:

Post a Comment