Statistical analysis of co-occurrence patterns in microbial presence-absence datasets.

作者: Kumar P Mainali , Sharon Bewick , Peter Thielen , Thomas Mehoke , Florian P Breitwieser

DOI: 10.1371/JOURNAL.PONE.0187132

关键词: Jaccard indexSpurious relationshipStatisticsCo-occurrenceCorrelation coefficientNull modelRare speciesMacroecologySimilarity (network science)Biology

摘要: Drawing on a long history in macroecology, correlation analysis of microbiome datasets is becoming common practice for identifying relationships or shared ecological niches among bacterial taxa. However, many the statistical issues that plague such analyses macroscale communities remain unresolved microbial communities. Here, we discuss problems species correlations based presence-absence data. We focus data because this information more readily obtainable from sequencing studies, especially whole-genome sequencing, where abundance estimation still its infancy. First, show how Pearson’s coefficient (r) and Jaccard’s index (J)–two most metrics data–can contradict each other when applied to typical dataset. In our dataset, example, 14% species-pairs predicted be significantly correlated by r were not using J, while 37.4% J r. Mismatch was particularly with at least one rare (<10% prevalence), explaining why might differ strongly datasets, there are large numbers Indeed 74% all study had species. Next, can result artificial inflation positive taxon particular problem studies. then illustrate similarity (J) yield improvements over coefficient. standard null model flawed, thus introduces own set spurious conclusions. identify better hypergeometric distribution, which appropriately corrects prevalence. This available recent statistics literature, used evaluating significance any value an empirically observed index. The resulting simple, yet effective method handling provides robust means testing finding and/or environmental responses

参考文章(112)
María Guadalupe Frías-De León, Esperanza Duarte-Escalante, María del Carmen Calderón-Ezquerro, María del Carmen Jiménez-Martínez, Gustavo Acosta-Altamirano, Mario Adán Moreno-Eutimio, Gerardo Zúñiga, Rafael García-González, Maritoña Ramírez-Pérez, María del Rocío Reyes-Montes, None, Diversity and characterization of airborne bacteria at two health institutions Aerobiologia. ,vol. 32, pp. 187- 198 ,(2016) , 10.1007/S10453-015-9389-Z
Joseph A. Veech, A probabilistic model for analysing species co-occurrence Global Ecology and Biogeography. ,vol. 22, pp. 252- 260 ,(2013) , 10.1111/J.1466-8238.2012.00789.X
Yanping Wang, Yixin Bao, Mingjian Yu, Gaofu Xu, Ping Ding, Nestedness for different reasons: the distributions of birds, lizards and small mammals on islands of an inundated lake. Diversity and Distributions. ,vol. 16, pp. 862- 873 ,(2010) , 10.1111/J.1472-4642.2010.00682.X
Ümit Kebapçi, Mustafa Öztop, İskender Gülle, Mehmet Zeki Yildirim, Duygu Ceren Çağlan, The land snail fauna of Mut District (Mersin Province, Turkey) Turkish Journal of Zoology. ,vol. 36, pp. 307- 318 ,(2012)
JOHN E. FA, MIGUEL ANGEL FARFÁN, ANA LUZ MARQUEZ, JESÚS DUARTE, JANET NACKONEY, AMY HALL, JEF DUPAIN, SARAH SEYMOUR, PAUL J. JOHNSON, DAVID W. MACDONALD, J. MARIO VARGAS, Mapping Hotspots of Threatened Species Traded in Bushmeat Markets in the Cross–Sanaga Rivers Region Conservation Biology. ,vol. 28, pp. 224- 233 ,(2014) , 10.1111/COBI.12151
M. R. Reyes-Montes, A. Pérez-Torres, A. Parás-García, G. Rodríguez-Arellanes, M. L. Taylor, C. Juan-Sallés, A. G. Rosas-Rosas, Identification of the source of histoplasmosis infection in two captive maras (Dolichotis patagonum) from the same colony by using molecular and immunologic assays Revista Argentina De Microbiologia. ,vol. 41, pp. 102- 104 ,(2009)
Kumar P. Mainali, Dan L. Warren, Kunjithapatham Dhileepan, Andrew McConnachie, Lorraine Strathie, Gul Hassan, Debendra Karki, Bharat B. Shrestha, Camille Parmesan, Projecting future expansion of invasive species: comparing and improving methodologies for species distribution modeling Global Change Biology. ,vol. 21, pp. 4464- 4480 ,(2015) , 10.1111/GCB.13038
Pelayo Acevedo, Alberto Jiménez-Valverde, Jorge M. Lobo, Raimundo Real, Delimiting the geographical background in species distribution modelling Journal of Biogeography. ,vol. 39, pp. 1383- 1390 ,(2012) , 10.1111/J.1365-2699.2012.02713.X
Karoline Faust, J. Fah Sathirapongsasuti, Jacques Izard, Nicola Segata, Dirk Gevers, Jeroen Raes, Curtis Huttenhower, Microbial Co-occurrence Relationships in the Human Microbiome PLoS Computational Biology. ,vol. 8, pp. e1002606- 1002606 ,(2012) , 10.1371/JOURNAL.PCBI.1002606