LaSVM-based big data learning system for dynamic prediction of air pollution in Tehran.

Z Ghaemi, A Alimohammadi, M Farnaghi
Author Information
  1. Z Ghaemi: Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433, Iran.
  2. A Alimohammadi: Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433, Iran.
  3. M Farnaghi: Faculty of Geodesy and Geomatics Engineering, K.N. Toosi University of Technology, No. 1346, ValiAsr Street, Mirdamad cross, Tehran, 19967-15433, Iran. farnaghi@kntu.ac.ir. ORCID

Abstract

Due to critical impacts of air pollution, prediction and monitoring of air quality in urban areas are important tasks. However, because of the dynamic nature and high spatio-temporal variability, prediction of the air pollutant concentrations is a complex spatio-temporal problem. Distribution of pollutant concentration is influenced by various factors such as the historical pollution data and weather conditions. Conventional methods such as the support vector machine (SVM) or artificial neural networks (ANN) show some deficiencies when huge amount of streaming data have to be analyzed for urban air pollution prediction. In order to overcome the limitations of the conventional methods and improve the performance of urban air pollution prediction in Tehran, a spatio-temporal system is designed using a LaSVM-based online algorithm. Pollutant concentration and meteorological data along with geographical parameters are continually fed to the developed online forecasting system. Performance of the system is evaluated by comparing the prediction results of the Air Quality Index (AQI) with those of a traditional SVM algorithm. Results show an outstanding increase of speed by the online algorithm while preserving the accuracy of the SVM classifier. Comparison of the hourly predictions for next coming 24 h, with those of the measured pollution data in Tehran pollution monitoring stations shows an overall accuracy of 0.71, root mean square error of 0.54 and coefficient of determination of 0.81. These results are indicators of the practical usefulness of the online algorithm for real-time spatial and temporal prediction of the urban air quality.

Keywords

References

  1. Environ Health Perspect. 2001 Jun;109(6):633-9 [PMID: 11445519]
  2. Chemosphere. 2005 Apr;59(5):693-701 [PMID: 15792667]
  3. Lancet. 2002 Oct 19;360(9341):1233-42 [PMID: 12401268]
  4. J Air Waste Manag Assoc. 2007 Aug;57(8):893-900 [PMID: 17824279]
  5. Environ Int. 2008 Jul;34(5):592-8 [PMID: 18237781]
  6. Hong Kong Med J. 2000 Dec;6(4):390-8 [PMID: 11177161]
  7. Sci Total Environ. 2015 Feb 1;505:1202-12 [PMID: 25461118]
  8. J Air Waste Manag Assoc. 2013 Jul;63(7):755-63 [PMID: 23926845]
  9. Neural Netw. 2004 Dec;17(10):1467-75 [PMID: 15541948]
  10. Sci Total Environ. 2011 Nov 15;409(24):5517-23 [PMID: 21962560]
  11. Sci Total Environ. 2003 Sep 1;313(1-3):1-13 [PMID: 12922056]
  12. Int J Environ Health Res. 2004 Aug;14(4):307-13 [PMID: 15369995]

MeSH Term

Air Pollutants
Air Pollution
Environmental Monitoring
Environmental Pollution
Forecasting
Iran
Neural Networks, Computer
Particulate Matter
Support Vector Machine
Weather

Chemicals

Air Pollutants
Particulate Matter

Word Cloud

Created with Highcharts 10.0.0airpredictionpollutiondataurbanTehransystemonlinealgorithmqualityspatio-temporalSVM0monitoringdynamicpollutantconcentrationmethodsshowLaSVM-basedresultsaccuracyDuecriticalimpactsareasimportanttasksHowevernaturehighvariabilityconcentrationscomplexproblemDistributioninfluencedvariousfactorshistoricalweatherconditionsConventionalsupportvectormachineartificialneuralnetworksANNdeficiencieshugeamountstreaminganalyzedorderovercomelimitationsconventionalimproveperformancedesignedusingPollutantmeteorologicalalonggeographicalparameterscontinuallyfeddevelopedforecastingPerformanceevaluatedcomparingAirQualityIndexAQItraditionalResultsoutstandingincreasespeedpreservingclassifierComparisonhourlypredictionsnextcoming24 hmeasuredstationsshowsoverall71rootmeansquareerror54coefficientdetermination81indicatorspracticalusefulnessreal-timespatialtemporalbiglearningBigLaSVMOnlineSpatio-temporalUrban

Similar Articles

Cited By