Mash-based analyses of Escherichia coli genomes reveal 14 distinct phylogroups

作者: Michael S. Robeson , David W. Ussery , Trudy M. Wassenaar , Zulema Udaondo , Visanu Wanchai

DOI: 10.1038/S42003-020-01626-5

关键词:

摘要: In this study, more than one hundred thousand Escherichia coli and Shigella genomes were examined classified. This is, to our knowledge, the largest E. genome dataset analyzed date. A Mash-based analysis of a cleaned set 10,667 from GenBank revealed 14 distinct phylogroups. representative or medoid identified for each phylogroup was used as proxy classify 95,525 unassembled Sequence Read Archive (SRA). We find that most sequenced belong four phylogroups (A, C, B1 E2(O157)). Authenticity is supported by several different lines evidence: phylogroup-specific core genes, phylogenetic tree constructed with 2613 single copy differences in rates gene gain/loss/duplication. The methodology work able reproduce known phylogroups, well identify previously uncharacterized species.

参考文章(53)
Elizabeth W. Alm, Seth T. Walk, David M. Gordon, The Niche of Escherichia coli ASM Press. pp. 69- 89 ,(2011) , 10.1128/9781555817114.CH6
Louis-Marie Bobay, Charles C. Traverse, Howard Ochman, Impermanence of bacterial clones. Proceedings of the National Academy of Sciences of the United States of America. ,vol. 112, pp. 8893- 8900 ,(2015) , 10.1073/PNAS.1501724112
Anja Struyf, Mia Hubert, Peter Rousseeuw, Clustering in an Object-Oriented Environment Journal of Statistical Software. ,vol. 1, pp. 1- 30 ,(1997) , 10.18637/JSS.V001.I04
David M. Gordon, Olivier Clermont, Heather Tolley, Erick Denamur, Assigning Escherichia coli strains to phylogenetic groups: multi‐locus sequence typing versus the PCR triplex method Environmental Microbiology. ,vol. 10, pp. 2484- 2496 ,(2008) , 10.1111/J.1462-2920.2008.01669.X
Zulema Udaondo, Lázaro Molina, Ana Segura, Estrella Duque, Juan L. Ramos, Analysis of the core genome and pangenome of Pseudomonas putida Environmental Microbiology. ,vol. 18, pp. 3268- 3283 ,(2016) , 10.1111/1462-2920.13015
Ruiting Lan, Peter R. Reeves, Escherichia coli in disguise: molecular origins of Shigella. Microbes and Infection. ,vol. 4, pp. 1125- 1132 ,(2002) , 10.1016/S1286-4579(02)01637-4
Olivier Tenaillon, David Skurnik, Bertrand Picard, Erick Denamur, The population genetics of commensal Escherichia coli Nature Reviews Microbiology. ,vol. 8, pp. 207- 217 ,(2010) , 10.1038/NRMICRO2298
G. E. Sims, S.-H. Kim, Whole-genome phylogeny of Escherichia coli/Shigella group by feature frequency profiles (FFPs) Proceedings of the National Academy of Sciences of the United States of America. ,vol. 108, pp. 8329- 8334 ,(2011) , 10.1073/PNAS.1105168108
N. K. Petty, N. L. Ben Zakour, M. Stanton-Cook, E. Skippington, M. Totsika, B. M. Forde, M.-D. Phan, D. Gomes Moriel, K. M. Peters, M. Davies, B. A. Rogers, G. Dougan, J. Rodriguez-Bano, A. Pascual, J. D. D. Pitout, M. Upton, D. L. Paterson, T. R. Walsh, M. A. Schembri, S. A. Beatson, Global dissemination of a multidrug resistant Escherichia coli clone Proceedings of the National Academy of Sciences of the United States of America. ,vol. 111, pp. 5694- 5699 ,(2014) , 10.1073/PNAS.1322678111
Jerónimo Rodríguez-Beltrán, Jérôme Tourret, Olivier Tenaillon, Elena López, Emmanuelle Bourdelier, Coloma Costas, Ivan Matic, Erick Denamur, Jesús Blázquez, High Recombinant Frequency in Extraintestinal Pathogenic Escherichia coli Strains Molecular Biology and Evolution. ,vol. 32, pp. 1708- 1716 ,(2015) , 10.1093/MOLBEV/MSV072