Towards missing electric power data imputation for energy management systems

作者: Chih-Fong Tsai , Ming-Chang Wang , Wei-Chao Lin , Wei-Chao Lin

DOI: 10.1016/J.ESWA.2021.114743

关键词:

摘要: Abstract Demand for electricity is gradually increasing in many countries. Efforts related studies have been made the application of data mining techniques over electric power development more effective energy management systems. However, one major challenge how to compensate parts collected dataset, such as consumption, voltage, or current that may be missing a specific period time. In literature, several methods employed imputation data, especially single feature value imputation. performance different types methods, i.e. statistical and machine learning multiple features has not fully explored. Moreover, variations their during summer/non-summer seasons peak/off-peak/semi-peak times investigated. this paper, five well-known processing two autoregressive integrated moving average (ARIMA) linear interpolation (LI) models, three k-nearest neighbor (K-NN), multilayer perceptron (MLP), support vector regression (SVR) compared. The experimental results, based on two-year Taiwan, show generally perform better than ones, with K-NN SVR performing best. particular, all produced higher error rates summer season non-summer seasons. (especially K-NN) are choices peak times, whereas LI) off-peak semi-peak times.

参考文章(47)
Piotr Indyk, Trevor Darrell, Gregory Shakhnarovich, Nearest-neighbor methods in learning and vision : theory and practice MIT Press. ,(2005)
Terence C. Mills, Time series techniques for economists ,(1990)
Christopher M. Bishop, Neural networks for pattern recognition ,(1995)
Heikki Junninen, Harri Niska, Kari Tuppurainen, Juhani Ruuskanen, Mikko Kolehmainen, Methods for imputation of missing values in air quality data sets Atmospheric Environment. ,vol. 38, pp. 2895- 2907 ,(2004) , 10.1016/J.ATMOSENV.2004.02.026
Rob J. Hyndman, Anne B. Koehler, Another look at measures of forecast accuracy International Journal of Forecasting. ,vol. 22, pp. 679- 688 ,(2006) , 10.1016/J.IJFORECAST.2006.03.001
Roderick JA Little, Donald B Rubin, None, Statistical Analysis with Missing Data ,(1987)
David F. Rogers, George G. Polak, Optimal Clustering of Time Periods for Electricity Demand-Side Management IEEE Transactions on Power Systems. ,vol. 28, pp. 3842- 3851 ,(2013) , 10.1109/TPWRS.2013.2252373
I. Žliobaitė, J. Hollmén, H. Junninen, Regression models tolerant to massively missing data: a case study in solar-radiation nowcasting Atmospheric Measurement Techniques. ,vol. 7, pp. 4387- 4399 ,(2014) , 10.5194/AMT-7-4387-2014
Corinna Cortes, Vladimir Vapnik, Support-Vector Networks Machine Learning. ,vol. 20, pp. 273- 297 ,(1995) , 10.1023/A:1022627411411
Mohamed Noor Norazian, Yahaya Ahmad Shukri, Ramli Nor Azam, Abdullah Mohd Mustafa Al Bakri, Estimation of missing values in air pollution data using single imputation techniques Science Society of Thailand. ,(2008) , 10.2306/SCIENCEASIA1513-1874.2008.34.341