Evaluation Evaluation: a Monte Carlo study

作者: David Martin Powers

DOI:

关键词: Conditional probabilityMonte Carlo methodRecallCommon value auctionNounCohen's kappaStatisticsComputational linguisticsComputer science

摘要: … Powers [4] also derived an unbiased accuracy measure to avoid the bias of Recall, Precision and Accuracy due to population Prevalence and label bias. The Bookmaker algorithm …

参考文章(9)
David R. Shanks, Is human learning rational Quarterly Journal of Experimental Psychology. ,vol. 48, pp. 257- 279 ,(1995) , 10.1080/14640749508401390
John S. Uebersax, Diversity of decision-making models and the measurement of interrater agreement. Psychological Bulletin. ,vol. 101, pp. 140- 146 ,(1987) , 10.1037/0033-2909.101.1.140
Jacob Cohen, A Coefficient of agreement for nominal Scales Educational and Psychological Measurement. ,vol. 20, pp. 37- 46 ,(1960) , 10.1177/001316446002000104
T. P. Hutchinson, Kappa muddles together two sources of disagreement: Tetrachoric correlation is preferable Research in Nursing & Health. ,vol. 16, pp. 313- 316 ,(1993) , 10.1002/NUR.4770160410
Pierre Perruchet, Ronald Peereman, The exploitation of distributional information in syllable processing Journal of Neurolinguistics. ,vol. 17, pp. 97- 119 ,(2004) , 10.1016/S0911-6044(03)00059-9
Douglas G. Bonett, Robert M. Price, Inferential Methods for the Tetrachoric Correlation Coefficient Journal of Educational and Behavioral Statistics. ,vol. 30, pp. 213- 225 ,(2005) , 10.3102/10769986030002213
Peter A. Flach, The geometry of ROC space: understanding machine learning metrics through ROC isometrics international conference on machine learning. pp. 194- 201 ,(2003)
John D. Lafferty, Andrew McCallum, Fernando C. N. Pereira, Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data international conference on machine learning. pp. 282- 289 ,(2001)
Jean Carletta, Assessing agreement on classification tasks: the kappa statistic Computational Linguistics. ,vol. 22, pp. 249- 254 ,(1996)