Tests for consistent measurement of external subjective software quality attributes

作者: John Moses , Malcolm Farrow

DOI: 10.1007/S10664-007-9058-0

关键词:

摘要: One reason that researchers may wish to demonstrate an external software quality attribute can be measured consistently is so they validate a prediction system for the attribute. However, attempts at validating systems subjective attributes have tended rely on experts indicating values provided by informally agree with experts' intuition about These are undertaken without pre-defined scale which it known consistently. Consequently, valid unbiased estimate of predictive capability cannot given because measurement process not independent system's values. Usually, no justification checking see if measure It seems assumed that: isn't proper or quantified one knows true anyway and estimated. even though classification systems' artefacts' subjective, possible quantify measurements in terms conditional probabilities. then possible, using statistical approach, assess formally whether considered consistent. If consistent, also identify estimates values, system. used In this paper we use Bayesian inference, Markov chain Monte Carlo simulation missing data imputation develop tests consistent ordinal attributes.

参考文章(54)
A. P. Dawid, A. M. Skene, Maximum Likelihood Estimation of Observer Error‐Rates Using the EM Algorithm Journal of The Royal Statistical Society Series C-applied Statistics. ,vol. 28, pp. 20- 28 ,(1979) , 10.2307/2346806
Introducing Markov chain Monte Carlo Markov Chain Monte Carlo in Practice. pp. 19- 38 ,(1995) , 10.1201/B14835-6
S. W. Coniam, A. W. Diamond, Practical Pain Management ,(1995)
Henry Ely Kyburg, Theory and Measurement ,(1984)
Jarrett Rosenberg, Problems and Prospects in Quantifying Software Maintainability Empirical Software Engineering. ,vol. 2, pp. 173- 177 ,(1997) , 10.1023/A:1009761520228
Tomàs Aluja, Book review: Multiple correspondence analysis and related methods. Greenacre, M. and Blasius, J. Chapman & Hall/CRC, 2006. Sort-statistics and Operations Research Transactions. ,vol. 30, ,(2006)
Joseph L. Fleiss, Measuring nominal scale agreement among many raters. Psychological Bulletin. ,vol. 76, pp. 378- 382 ,(1971) , 10.1037/H0031619