Extensive Experimental Evaluation of Self-Organizing Maps for Automatic Classification of a Multi-Class Multi-Label Corpus

作者: Eleni Giannopoulou , Nikolas Mitrou

DOI: 10.1109/ACCESS.2018.2875497

关键词:

摘要: This paper aims at bridging the gap between feature selection and space size by utilizing both square non-square self-organizing maps under different configuration scenarios for classifying a multi-class multi-label corpus, Reuters Mod Apte’ split. The of is based on heuristic process finding suitable map. Vector construction simple, yet effective procedure aiming transforming vectors from to uni-label. proposed solution improves classification efficiency not only in terms accuracy but also computational resources needed time training. Extensive experiments were conducted, using configurations regarding map vector sizes, training cycles, context words, assess their impact classifier’s performance. Furthermore, an intelligent algorithm label being proposed, show that neighboring nodes affect labels specific node. According our approach achieves 10% increase Macro-Average F1 scores, $30\times $ decrease dimensionality, approximately $34\times smaller when compared baseline scenario.

参考文章(32)
Shengran Su, Zhenghui Hu, Qiang Lin, William Kongto Hau, Zhifan Gao, Heye Zhang, An artificial neural network method for lumen and media-adventitia border detection in IVUS. Computerized Medical Imaging and Graphics. ,vol. 57, pp. 29- 39 ,(2017) , 10.1016/J.COMPMEDIMAG.2016.11.003
Jundong Li, Huan Liu, Challenges of Feature Selection for Big Data Analytics IEEE Intelligent Systems. ,vol. 32, pp. 9- 15 ,(2017) , 10.1109/MIS.2017.38
Zhifan Gao, Huahua Xiong, Xin Liu, Heye Zhang, Dhanjoo Ghista, Wanqing Wu, Shuo Li, Robust estimation of carotid artery wall motion using the elasticity-based state-space approach. Medical Image Analysis. ,vol. 37, pp. 1- 21 ,(2017) , 10.1016/J.MEDIA.2017.01.004
Rui Zhao, Kezhi Mao, Fuzzy Bag-of-Words Model for Document Representation IEEE Transactions on Fuzzy Systems. ,vol. 26, pp. 794- 804 ,(2018) , 10.1109/TFUZZ.2017.2690222
Zhifan Gao, Yanjie Li, Yuanyuan Sun, Jiayuan Yang, Huahua Xiong, Heye Zhang, Xin Liu, Wanqing Wu, Dong Liang, Shuo Li, Motion Tracking of the Carotid Artery Wall From Ultrasound Image Sequences: a Nonlinear State-Space Approach IEEE Transactions on Medical Imaging. ,vol. 37, pp. 273- 283 ,(2018) , 10.1109/TMI.2017.2746879
Yufa Xia, Huailing Zhang, Lin Xu, Zhifan Gao, Heye Zhang, Huafeng Liu, Shuo Li, An Automatic Cardiac Arrhythmia Classification System With Wearable Electrocardiogram IEEE Access. ,vol. 6, pp. 16529- 16538 ,(2018) , 10.1109/ACCESS.2018.2807700
Ladislav Lenc, Pavel Král, Deep Neural Networks for Czech Multi-label Document Classification arXiv: Computation and Language. ,(2017) , 10.1007/978-3-319-75487-1_36
Julie Beth Lovins, Development of a Stemming Algorithm Mech. Transl. Comput. Linguistics. ,vol. 11, pp. 22- 31 ,(1968)
Tomáš Brychcín, Pavel Král, Novel Unsupervised Features for Czech Multi-label Document Classification mexican international conference on artificial intelligence. pp. 70- 79 ,(2014) , 10.1007/978-3-319-13647-9_8
Teuvo Kohonen, Hongbing Xing, Contextually self-organized maps of chinese words workshop on self organizing maps. pp. 16- 29 ,(2011) , 10.1007/978-3-642-21566-7_2