Classification of Negative Information on Socially Significant Topics in Mass Media

作者: Ravil I Mukhamediev , Kirill Yakunin , Rustam Mussabayev , Timur Buldybayev , Yan Kuchin

DOI: 10.3390/SYM12121945

关键词:

摘要: Mass media not only reflect the activities of state bodies but also shape informational context, sentiment, depth, and significance level attributed to certain initiatives social events. Multilateral quantitative (to practicable extent) assessment activity is important for understanding their objectivity, role, focus, and, ultimately, quality society’s “fourth power”. The paper proposes a method evaluating in several modalities (topics, evaluation criteria/properties, classes), combining topic modeling text corpora multiple-criteria decision making. based on an analysis as follows: conditional probability distribution by topics, properties, classes calculated after formation model corpora. Several approaches are used obtain weights that describe how each relates criterion/property class described paper, including manual high-level labeling, multi-corpora approach, automatic approach. proposed approach suggests topical asymmetry describing topic’s relationship criterion/property. These weights, combined with model, can be applied evaluate document according considered criteria classes. was corpus 804,829 news publications from 40 Kazakhstani sources published 01 January 2018 31 December 2019, classify negative information socially significant topics. A BigARTM derived (200 topics) applied, fill table analytical hierarchical process (AHP) all necessary labeling procedures. Experiments confirm general possibility using corpora, because area under receiver operating characteristics curve (ROC AUC) score 0.81 achieved classification task, which comparable results obtained same task applying BERT (Bidirectional Encoder Representations Transformers) model.

参考文章(30)
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Paul Hansen, Franz Ombler, A new method for scoring additive multi‐attribute value models using pairwise rankings of alternatives Journal of Multi-criteria Decision Analysis. ,vol. 15, pp. 87- 107 ,(2008) , 10.1002/MCDA.428
Yassine Charabi, Adel Gastli, PV site suitability analysis using GIS-based spatial fuzzy multi-criteria evaluation Renewable Energy. ,vol. 36, pp. 2554- 2561 ,(2011) , 10.1016/J.RENENE.2010.10.037
Young-Jou Lai, Ting-Yun Liu, Ching-Lai Hwang, TOPSIS for MODM European Journal of Operational Research. ,vol. 76, pp. 486- 500 ,(1994) , 10.1016/0377-2217(94)90282-8
Thomas Wanderer, Stefan Herle, Creating a spatial multi-criteria decision support system for energy related integrated environmental impact assessment Environmental Impact Assessment Review. ,vol. 52, pp. 2- 8 ,(2015) , 10.1016/J.EIAR.2014.09.002
Serafim Opricovic, Gwo-Hshiung Tzeng, Extended VIKOR method in comparison with outranking methods European Journal of Operational Research. ,vol. 178, pp. 514- 529 ,(2007) , 10.1016/J.EJOR.2006.01.020
A. Tatar, P. Antoniadis, M. D. de Amorim, S. Fdida, Ranking News Articles Based on Popularity Prediction advances in social networks analysis and mining. pp. 106- 110 ,(2012) , 10.1109/ASONAM.2012.28
R.R. Yager, On ordered weighted averaging aggregation operators in multicriteria decisionmaking systems man and cybernetics. ,vol. 18, pp. 183- 190 ,(1988) , 10.1109/21.87068
Abbas Mardani, Ahmad Jusoh, Edmundas Zavadskas, Fausto Cavallaro, Zainab Khalifah, Sustainable and Renewable Energy: An Overview of the Application of Multiple Criteria Decision Making Techniques and Approaches Sustainability. ,vol. 7, pp. 13947- 13984 ,(2015) , 10.3390/SU71013947