Evaluating presence-absence models in ecology: the need to account for prevalence

作者: Stéphanie Manel , H. Ceri Williams , S.J. Ormerod

DOI: 10.1046/J.1365-2664.2001.00647.X

关键词: MathematicsTest dataMode (statistics)Environmental dataCohen's kappaKappaSampling (statistics)StatisticLogistic regressionEcology

摘要: 1. Models for predicting the distribution of organisms from environmental data are widespread in ecology and conservation biology. Their performance is invariably evaluated percentage success at occurrence test locations. 2. Using logistic regression with real 34 families aquatic invertebrates 180 Himalayan streams, we illustrate how this measure predictive accuracy affected systematically by prevalence (i.e. frequency occurrence) target organism. Many evaluations presence-absence models ecologists inherently misleading. 3. With same invertebrate models, examined alternative measures used remote sensing medical diagnostics. We particularly explored receiver-operating characteristic (ROC) plots, which were derived (i) area under each curve (AUC), considered an effective indicator model independent threshold probability presence organism accepted, (ii) optimized thresholds that maximize true absences presences correctly identified. also Cohen's kappa, a proportion all possible cases or absence predicted after accounting chance effects. 4. AUC ROC plots prevalence, but highly significantly correlated much more easily computed kappa. Moreover, when applied mode to data, erroneously overestimated among scarcer organisms, often those greatest interest. advocate caution using methods optimize required prediction. 5. Our strongest recommendation reduce their reliance on prediction as modelling. kappa provides simple, effective, standardized appropriate statistic evaluating comparing even based different statistical algorithms. None tests significance accuracy, identify priority research development.

参考文章(63)
E. Arthur Robertson, Mark H. Zweig, Andre C. Van Steirteghem, Evaluating the clinical efficacy of laboratory tests. American Journal of Clinical Pathology. ,vol. 79, pp. 78- 86 ,(1983) , 10.1093/AJCP/79.1.78
Ty Andrew Ridenour, Andrew C. Heath, A note on issues in meta-analysis for behavioral genetic studies using categorical phenotypes. Behavior Genetics. ,vol. 29, pp. 155- 162 ,(1999) , 10.1023/A:1021635800770
P. J. Cheek, P. McCullagh, J. A. Nelder, Generalized Linear Models, 2nd Edn. Applied Statistics. ,vol. 39, pp. 385- 386 ,(1990) , 10.2307/2347392
Alan Fielding, Machine Learning Methods for Ecological Applications Kluwer Academic Publishers. ,(1999) , 10.1007/978-1-4615-5289-5
G. M. Foody, On the compensation for chance agreement in image classification accuracy assessment Photogrammetric Engineering and Remote Sensing. ,vol. 58, pp. 1459- 1460 ,(1992)
M H Zweig, E A Robertson, Why we need better test evaluations. Clinical Chemistry. ,vol. 28, pp. 1272- 1276 ,(1982) , 10.1093/CLINCHEM/28.6.1272
T. P. Milsom, S. D. Langton, W. K. Parkin, S. Peel, J. D. Bishop, J. D. Hart, N. P. Moore, Habitat models of bird species' distribution: an aid to the management of coastal grazing marshes. Journal of Applied Ecology. ,vol. 37, pp. 706- 727 ,(2000) , 10.1046/J.1365-2664.2000.00529.X