A Novel Method for Air Quality Data Imputation by Nuclear Norm Minimization

作者: Xiaobo Chen , Yan Xiao

DOI: 10.1155/2018/7465026

关键词:

摘要: Missing data is a frequently encountered problem in environment research community. To facilitate the analysis and management of air quality data, for example, PM2.5 concentration this study, commonly adopted strategy handling missing values samples to generate complete set using imputation methods. Many methods based on temporal or spatial correlation have been developed purpose existing literatures. The difference various lies characterizing dependence relationship with different mathematical models, which crucial imputation. In paper, we propose two novel principled nuclear norm matrix since it measures such global fashion. first method, termed as minimization (GNNM), tries impute through directly minimizing whole sample matrix, thus at same time maximizing linear samples. second called local (LNNM), concentrates more each its most similar are estimated from results method. way, can be performed those highly correlated instead GNNM, reducing adverse impact irrelevant evaluated measured every 1 h by 22 monitoring stations. simulated percentages. imputed compared ground truth evaluate performance experimental verify effectiveness our methods, especially LNNM,

参考文章(26)
M.P. Gómez-Carracedo, J.M. Andrade, P. López-Mahía, S. Muniategui, D. Prada, A practical comparison of single and multiple imputation methods to handle complex missing data in air quality datasets Chemometrics and Intelligent Laboratory Systems. ,vol. 134, pp. 23- 33 ,(2014) , 10.1016/J.CHEMOLAB.2014.02.007
Xiao Feng, Qi Li, Yajie Zhu, Junxiong Hou, Lingyan Jin, Jingjie Wang, Artificial neural networks forecasting of PM2.5 pollution using air mass trajectory based geographic model and wavelet transformation Atmospheric Environment. ,vol. 107, pp. 118- 128 ,(2015) , 10.1016/J.ATMOSENV.2015.02.030
Carlos Real, J. Ángel Fernández, Jesús R. Aboal, Alejo Carballeira, Substituting missing data in compositional analysis Environmental Pollution. ,vol. 159, pp. 2797- 2800 ,(2011) , 10.1016/J.ENVPOL.2011.05.006
Philip RC Nelson, Paul A Taylor, John F MacGregor, None, Missing data methods in PCA and PLS: Score calculations with incomplete observations Chemometrics and Intelligent Laboratory Systems. ,vol. 35, pp. 45- 65 ,(1996) , 10.1016/S0169-7439(96)00007-X
Esther-Lydia Silva-Ramírez, Rafael Pino-Mejías, Manuel López-Coello, María-Dolores Cubiles-de-la-Vega, Missing value imputation on missing completely at random data using multilayer perceptrons Neural Networks. ,vol. 24, pp. 121- 129 ,(2011) , 10.1016/J.NEUNET.2010.09.008
Xiaobo Chen, Jian Yang, Jun Liang, Qiaolin Ye, Recursive robust least squares support vector regression based on maximum correntropy criterion Neurocomputing. ,vol. 97, pp. 63- 73 ,(2012) , 10.1016/J.NEUCOM.2012.05.004
Laura Folguera, Jure Zupan, Daniel Cicerone, Jorge F Magallanes, None, Self-organizing maps for imputation of missing data in incomplete data matrices Chemometrics and Intelligent Laboratory Systems. ,vol. 143, pp. 146- 151 ,(2015) , 10.1016/J.CHEMOLAB.2015.03.002
A PLAIA, A BONDI, Single imputation method of missing values in environmental pollution data sets Atmospheric Environment. ,vol. 40, pp. 7316- 7330 ,(2006) , 10.1016/J.ATMOSENV.2006.06.040
Xiaodong Sun, Long Chen, Zebin Yang, Huangqiu Zhu, Speed-Sensorless Vector Control of a Bearingless Induction Motor With Artificial Neural Network Inverse Speed Observer IEEE-ASME Transactions on Mechatronics. ,vol. 18, pp. 1357- 1366 ,(2013) , 10.1109/TMECH.2012.2202123
Xiaobo Chen, Jian Yang, Qiaolin Ye, Jun Liang, Recursive projection twin support vector machine via within-class variance minimization Pattern Recognition. ,vol. 44, pp. 2643- 2655 ,(2011) , 10.1016/J.PATCOG.2011.03.001