Hierarchical feature selection based on relative dependency for gear fault diagnosis

作者: Mariela Cerrada , René-Vinicio Sánchez , Fannia Pacheco , Diego Cabrera , Grover Zurita

DOI: 10.1007/S10489-015-0725-3

关键词:

摘要: Feature selection is an important aspect under study in machine learning based diagnosis, that aims to remove irrelevant features for reaching good performance the diagnostic systems. The behaviour of models could be sensitive with regard amount features, and significant can represent problem better than entire set. Consequently, algorithms identify these are valuable contributions. This work deals feature through attribute clustering. proposed algorithm inspired by existing approaches, where relative dependency between attributes used calculate dissimilarity values. centroids created clusters selected as representative attributes. uses a random process proposing centroid candidates, this way, inherent exploration search included. A hierarchical procedure implementing algorithm. In each level hierarchy, set available split disjoint sets applied on subset. Once subset, new runs again next level. implementation refine space reduced attributes, while computational time-consumption improved also. approach tested real data collected from test bed, results show diagnosis precision using Random Forest classifier over 98 % only 12

参考文章(47)
Laurens van der Maaten, Jaap van den Herik, Eric Postma, Dimensionality Reduction: A Comparative Review ,(2009)
Yifeng Li, Alioune Ngom, The non-negative matrix factorization toolbox for biological data mining Source Code for Biology and Medicine. ,vol. 8, pp. 10- 10 ,(2013) , 10.1186/1751-0473-8-10
Girish Chandrashekar, Ferat Sahin, A survey on feature selection methods Computers & Electrical Engineering. ,vol. 40, pp. 16- 28 ,(2014) , 10.1016/J.COMPELECENG.2013.11.024
Chuan Li, René-Vinicio Sanchez, Grover Zurita, Mariela Cerrada, Diego Cabrera, Rafael E. Vásquez, Multimodal deep support vector classification with homologous features and its application to gearbox fault diagnosis Neurocomputing. ,vol. 168, pp. 119- 127 ,(2015) , 10.1016/J.NEUCOM.2015.06.008
Leo Breiman, Random Forests Machine Learning archive. ,vol. 45, pp. 5- 32 ,(2001) , 10.1023/A:1010933404324
Ian Witten, Data Mining ,(2008)
Nour El Islem Karabadji, Ilyes Khelf, Hassina Seridi, Lakhdar Laouar, Genetic Optimization of Decision Tree Choice for Fault Diagnosis in an Industrial Ventilator Springer, Berlin, Heidelberg. pp. 277- 283 ,(2012) , 10.1007/978-3-642-28768-8_29