Handling high-dimensional data in air pollution forecasting tasks

作者: Diana Domańska , Szymon Łukasik

DOI: 10.1016/J.ECOINF.2016.04.007

关键词:

摘要: In the paper methods aimed at handling high-dimensional weather forecasts data used to predict concentrations of PM10, PM2.5, SO2, NO, CO and O3 are being proposed. The procedure employed pollution normally requires historical samples for a large number points in time — particularly forecast data, actual data. Likewise, it typically involves using numerous features related atmospheric conditions. Consequently analysis such datasets generate accurate becomes very cumbersome task. examines variety unsupervised dimensionality reduction obtaining compact yet informative set features. As an alternative, approach fractional distances tasks is considered as well. Both strategies were evaluated on real-world obtained from Institute Meteorology Water Management Katowice (Poland), with extended Air Pollution Forecast Model (e-APFM) underlying prediction tool. It was found that employing distance dissimilarity measure ensures best accuracy forecasting. Satisfactory results can be also Isomap, Landmark Isomap Factor Analysis techniques. These formulate universal mapping, ready-to-use gathered different geographical areas.

参考文章(74)
Szymon Łukasik, Piotr Kulczycki, An Algorithm for Sample and Data Dimensionality Reduction Using Fast Simulated Annealing Advanced Data Mining and Applications. pp. 152- 161 ,(2011) , 10.1007/978-3-642-25853-4_12
Ángela Fernández, Ana M. González, Julia Díaz, José R. Dorronsoro, Diffusion maps for the description of meteorological data hybrid artificial intelligence systems. pp. 276- 287 ,(2012) , 10.1007/978-3-642-28942-2_25
John Platt, FastMap, MetricMap, and Landmark MDS are all Nystrom Algorithms international conference on artificial intelligence and statistics. pp. 15- ,(2005)
˙Irem Uçal Sarı, Ba¸sar Öztay¸si, Forecasting Energy Demand Using Fuzzy Seasonal Time Series Atlantis Press, Paris. pp. 251- 269 ,(2012) , 10.2991/978-94-91216-77-0_12
Ferdinand Baer, Numerical weather prediction Advances in Computers. ,vol. 52, pp. 91- 157 ,(2000) , 10.1016/S0065-2458(00)80017-0
Michael E. Houle, Hans-Peter Kriegel, Peer Kröger, Erich Schubert, Arthur Zimek, Can shared-neighbor distances defeat the curse of dimensionality? statistical and scientific database management. pp. 482- 500 ,(2010) , 10.1007/978-3-642-13818-8_34
Bogumil Jakubiak, Richard Hodur, Piotr Flatau, Marcin Witek, Oskar Kapala, Leszek Herman-Izycki, Implementation and Research on the Operational Use of the Mesoscale Prediction Model COAMPS in Poland Defense Technical Information Center. ,(2006) , 10.21236/ADA631047
Nikolaos Avouris, Elias Kalapanidas, Feature selection for air quality forecasting: a genetic algorithm approach Ai Communications. ,vol. 16, pp. 235- 251 ,(2003)
Kevin Beyer, Jonathan Goldstein, Raghu Ramakrishnan, Uri Shaft, When Is ''Nearest Neighbor'' Meaningful? international conference on database theory. pp. 217- 235 ,(1999) , 10.1007/3-540-49257-7_15