Applying clustering to analyze opinion diversity

作者: Mohammad Mahdi Hassan , Martin Blom

DOI: 10.1145/2745802.2745809

关键词:

摘要: In empirical software engineering research there is an increased use of questionnaires and surveys to collect information from practitioners. Typically, such data then analyzed based on overall, descriptive statistics. Even though this can capture the general trends a risk that opinions different (minority) sub-groups are lost. Here we propose clustering segment respondents so more detailed analysis be achieved. Our findings suggest it give better insight about survey population participants' opinions. This partitioning approach show precisely extent opinion differences between groups. also gives opportunity for minorities heard. Through process significant new may obtained. our example study regarding state testing requirement activities in industry, found several groups showed overall conclusion.

参考文章(27)
Charles X. Ling, Chenghui Li, Data mining for direct marketing: problems and solutions knowledge discovery and data mining. pp. 73- 79 ,(1998)
M. Hall, Correlation-based Feature Selection for Machine Learning PhD Thesis, Waikato Univer-sity. ,(1998)
Michael Gamon, Anthony Aue, Simon Corston-Oliver, Eric Ringger, Pulse: mining customer opinions from free text intelligent data analysis. ,vol. 3646, pp. 121- 132 ,(2005) , 10.1007/11552253_12
Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, Toshikazu Fukushima, Mining product reputations on the Web Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '02. pp. 341- 349 ,(2002) , 10.1145/775047.775098
J. A. Hartigan, M. A. Wong, A K-Means Clustering Algorithm Journal of The Royal Statistical Society Series C-applied Statistics. ,vol. 28, pp. 100- 108 ,(1979) , 10.2307/2346830
John Moses, Malcolm Farrow, Tests for consistent measurement of external subjective software quality attributes Empirical Software Engineering. ,vol. 13, pp. 261- 287 ,(2008) , 10.1007/S10664-007-9058-0
T.K. Moon, The expectation-maximization algorithm IEEE Signal Processing Magazine. ,vol. 13, pp. 47- 60 ,(1996) , 10.1109/79.543975
Ron Kohavi, George H. John, Wrappers for feature subset selection Artificial Intelligence. ,vol. 97, pp. 273- 324 ,(1997) , 10.1016/S0004-3702(97)00043-X
John Moses, Benchmarking quality measurement Software Quality Journal. ,vol. 15, pp. 449- 462 ,(2007) , 10.1007/S11219-007-9025-4