‘Size’ and ‘shape’ in the measurement of multivariate proximity

作者: Michael Greenacre

DOI: 10.1111/2041-210X.12776

关键词: Correspondence analysisUnivariateStatisticsBray–Curtis dissimilarityMultivariate analysisMeasure (mathematics)OrdinationSampling (statistics)MathematicsEconometricsMultivariate statistics

摘要: Summary Ordination and clustering methods are widely applied to ecological data that non-negative, for example, species abundances or biomasses. These rely on a measure of multivariate proximity quantifies differences between the sampling units (e.g. individuals, stations, time points), leading results such as: (i) ordinations units, where interpoint distances optimally display measured differences; (ii) into homogeneous clusters (iii) assessing pre-specified groups regions, periods, treatment–control groups). These all conceal fundamental question: To what extent computed according chosen function, capturing ‘size’ in observations, their ‘shape’? ‘Size’ means overall level measurements: some samples contain higher total more biomass, others less. ‘Shape’ relative levels have different abundances, i.e. compositions. answer this question, several well-known measures considered two datasets, one which is used simulation exercise ‘shape’ been eliminated by randomization. For any dataset measure, quantification achieved proportion variance capturing, as well confounds together. The consistently show Bray–Curtis coefficient incorporates both differences, varying degrees. components thus always confounded determination ordinations, clusters, group comparisons relations environmental variables. There implications these results, main being researchers should be aware issue when they choose measure. They compute particular can radically affect interpretation results. It recommended separate components: analysing other univariate methods, using analysis size has specifically excluded.

参考文章(32)
David I. Warton, Stephen T. Wright, Yi Wang, Distance‐based multivariate analyses confound location and dispersion effects Methods in Ecology and Evolution. ,vol. 3, pp. 89- 101 ,(2012) , 10.1111/J.2041-210X.2011.00127.X
Achim Zeileis, Christian Kleiber, Simon Jackman, Regression Models for Count Data in R Journal of Statistical Software. ,vol. 27, pp. 1- 25 ,(2008) , 10.18637/JSS.V027.I08
Daniel Borcard, Pierre Legendre, Pierre Drapeau, Partialling out the Spatial Component of Ecological Variation Ecology. ,vol. 73, pp. 1045- 1055 ,(1992) , 10.2307/1940179
Michael Greenacre, None, Fuzzy coding in constrained ordinations Ecology. ,vol. 94, pp. 280- 286 ,(2013) , 10.1890/12-0981.1
PJ Somerfield, Identification of the Bray-Curtis similarity index: Comment on Yoshioka (2008) Marine Ecology Progress Series. ,vol. 372, pp. 303- 306 ,(2008) , 10.3354/MEPS07841
David G. Kendall, A Survey of the Statistical Theory of Shape Statistical Science. ,vol. 4, pp. 87- 99 ,(1989) , 10.1214/SS/1177012582
Wolfgang Cramer, Håkan Hytteborn, The separation of fluctuation and long-term change in vegetation dynamics of a rising seashore Plant Ecology. ,vol. 69, pp. 157- 167 ,(1987) , 10.1007/978-94-009-4061-1_16
Michaela Aschan, Maria Fossheim, Michael Greenacre, Raul Primicerio, Change in Fish Community Structure in the Barents Sea PLoS ONE. ,vol. 8, pp. e62748- ,(2013) , 10.1371/JOURNAL.PONE.0062748
PM Yoshioka, Misidentification of the Bray-Curtis similarity index Marine Ecology Progress Series. ,vol. 368, pp. 309- 310 ,(2008) , 10.3354/MEPS07728