Chinese semantic document classification based on strategies of semantic similarity computation and correlation analysis

作者: Shuo Yang , Ran Wei , Jingzhi Guo , Hengliang Tan

DOI: 10.1016/J.WEBSEM.2020.100578

关键词: AmbiguityInformation systemSemantic similarityWord embeddingSemantic analysis (machine learning)Artificial intelligenceSynonym (database)Computer sciencePolysemyNatural language processingDocument classification

摘要: … words and multi-scene characteristics of synonyms simultaneously. … context-free, any document composed of iids would also be … Then, we compare the proposed strategies with several …

参考文章(57)
Maosong Sun, Fanchao Qi, Chenghao Yang, Zhendong Dong, Qiang Dong, Zhiyuan Liu, OpenHowNet: An Open Sememe-based Lexical Knowledge Base. arXiv: Computation and Language. ,(2019)
Hao Tian, Hua Wu, Yu Sun, Xin Tian, Danxiang Zhu, Shuohuan Wang, Han Zhang, Yukun Li, Shikun Feng, Xuyi Chen, ERNIE: Enhanced Representation through Knowledge Integration arXiv: Computation and Language. ,(2019)
Yoon Kim, Convolutional Neural Networks for Sentence Classification arXiv: Computation and Language. ,(2014)
Ilya Sutskever, Tomas Mikolov, Greg Corrado, Kai Chen, Jeffrey Dean, Distributed Representations of Words and Phrases and their Compositionality arXiv: Computation and Language. ,(2013)
M. Thangaraj, M Sivakami, Text Classification Techniques: A Literature Review Interdisciplinary Journal of Information, Knowledge, and Management. ,vol. 13, pp. 117- 135 ,(2018) , 10.28945/4066
Zhibiao Wu, Martha Palmer, None, Verb Semantics and Lexical Selection arXiv: Computation and Language. ,(1994)
Sosuke Kobayashi, Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers). ,vol. 2, pp. 452- 457 ,(2018) , 10.18653/V1/N18-2072
Renato Bruni, Gianpiero Bianchi, Website categorization: A formal approach and robustness analysis in the case of e-commerce detection Expert Systems With Applications. ,vol. 142, pp. 113001- ,(2020) , 10.1016/J.ESWA.2019.113001
Christof Monz, Arianna Bisazza, Marzieh Fadaee, Data Augmentation for Low-Resource Neural Machine Translation meeting of the association for computational linguistics. ,vol. 2, pp. 567- 573 ,(2017) , 10.18653/V1/P17-2090