Rare Event Analysis of High Dimensional Building Operational Data Using Data Mining Techniques

作者: Cheng Fan , Fu Xiao , Shengwei Wang , None

DOI:

关键词:

摘要: Today’s building automation systems (BASs) are becoming increasingly complex. A typical BAS usually stores hundreds of sensor measurements and control signals at each time step, which produces massive high dimensional data sets. Traditional analysis methods for only focus on a small subset the data, resulting in huge information loss. Data mining techniques more effective knowledge extraction data. This study develops holistic methodology analyzing using advanced techniques, with aim identifying rare events operation. Rare event helps to identify atypical operating patterns, detect diagnose faults, eventually improve operational performance. Two main challenges exist performing i.e. dimensionality complexity system The former results that conventional analytics, such as distance-based measures, lose their effectiveness, later negatively influences robustness reliability identification events. proposed method is specially designed tackle these by integrating power techniques. It consists four steps, i.e., preparation, detection, diagnosis, post-mining. adopted analyze tallest Hong Kong. successfully detected diagnosed, providing clues enhance

参考文章(22)
Oded Z. Maimon, Lior Rokach, Data Mining and Knowledge Discovery Handbook, 2nd ed ,(2010)
Arthur Zimek, Hans-Peter Kriegel, Erich Schubert, Peer Kröger, Interpreting and Unifying Outlier Scores siam international conference on data mining. pp. 13- 24 ,(2011)
Hans-Peter Kriegel, Peer Kröger, Erich Schubert, Arthur Zimek, Outlier Detection in Axis-Parallel Subspaces of High Dimensional Data Advances in Knowledge Discovery and Data Mining. pp. 831- 838 ,(2009) , 10.1007/978-3-642-01307-2_86
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis, Torsten Hothorn, Bias in random forest variable importance measures: Illustrations, sources and a solution BMC Bioinformatics. ,vol. 8, pp. 25- 25 ,(2007) , 10.1186/1471-2105-8-25
M.R. Amin-Naseri, A.R. Soroush, Combined use of unsupervised and supervised learning for daily peak load forecasting Energy Conversion and Management. ,vol. 49, pp. 1302- 1308 ,(2008) , 10.1016/J.ENCONMAN.2008.01.016
Hans-Peter Kriegel, Matthias S hubert, Arthur Zimek, Angle-based outlier detection in high-dimensional data Proceeding of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD 08. pp. 444- 452 ,(2008) , 10.1145/1401890.1401946
Arthur Zimek, Matthew Gaudet, Ricardo J.G.B. Campello, Jörg Sander, Subsampling for efficient and effective unsupervised outlier detection ensembles knowledge discovery and data mining. pp. 428- 436 ,(2013) , 10.1145/2487575.2487676
Bing Dong, Cheng Cao, Siew Eang Lee, Applying support vector machines to predict building energy consumption in tropical region Energy and Buildings. ,vol. 37, pp. 545- 553 ,(2005) , 10.1016/J.ENBUILD.2004.09.009