A model-based clustering method to detect infectious disease transmission outbreaks from sequence variation

作者: Rosemary M McCloskey , Art FY Poon

DOI: 10.1101/165357

关键词: Data miningCluster analysisInfectious disease transmissionModel based clusteringBiologyOutbreakNonparametric statisticsInfectious disease (medical specialty)CURE data clustering algorithmSequence variation

摘要: Clustering infections by genetic similarity is a popular technique for identifying potential outbreaks of infectious disease, in part because sequences are now routinely collected clinical management many diseases. A diverse number nonparametric clustering methods have been developed this purpose. These generally intuitive, rapid to compute, and readily scale with large data sets. However, we found that can be biased towards clusters diagnosis --- where individuals sampled sooner post-infection rather than the transmission meant foci public health efforts. We develop fundamentally new approach based on fitting Markov-modulated Poisson process (MMPP), which represents evolution rates along tree relating different infections. evaluated model-based method alongside five using both simulated actual HIV sequence For transmission, MMPP obtained higher mean sensitivity (85%) specificity (91%) methods. When applied these published HIV-1 from study cohort men who sex Seattle, USA, categorized about half (46%) as compared other methods, were more consistent outbreaks. This has significant implications application pathogen analysis health, it critical robustly accurately identify most cost-effective deployment prevention services resources.

参考文章(51)
Nikolaus Hansen, The CMA Evolution Strategy: A Comparing Review Towards a new evolutionary computation. ,vol. 192, pp. 75- 102 ,(2006) , 10.1007/3-540-32494-1_4
Richard G. FitzJohn, Diversitree : comparative phylogenetic analyses of diversification in R Methods in Ecology and Evolution. ,vol. 3, pp. 1084- 1092 ,(2012) , 10.1111/J.2041-210X.2012.00234.X
Brendan Jacka, Tanya Applegate, Mel Krajden, Andrea Olmstead, P. Richard Harrigan, Brandon D.L. Marshall, Kora DeBeck, M.-J. Milloy, Francois Lamoury, Oliver G. Pybus, Viviane D. Lima, Gkikas Magiorkinis, Vincent Montoya, Julio Montaner, Jeffrey Joy, Conan Woods, Sabina Dobrer, Gregory J. Dore, Art F.Y. Poon, Jason Grebely, Phylogenetic clustering of hepatitis C virus among people who inject drugs in Vancouver, Canada Hepatology. ,vol. 60, pp. 1571- 1580 ,(2014) , 10.1002/HEP.27310
Jennifer Gardy, Nicholas J. Loman, Andrew Rambaut, Real-time digital pathogen surveillance — the time is now Genome Biology. ,vol. 16, pp. 155- 155 ,(2015) , 10.1186/S13059-015-0726-X
Vlad Novitsky, Sikhulile Moyo, Quanhong Lei, Victor DeGruttola, Myron Essex, Impact of Sampling Density on the Extent of HIV Clustering AIDS Research and Human Retroviruses. ,vol. 30, pp. 1226- 1235 ,(2014) , 10.1089/AID.2014.0173
T. G. Buchman, B. Roizman, G. Adams, B. H. Stover, Restriction Endonuclease Fingerprinting of Herpes Simplex Virus DNA: A Novel Epidemiological Tool Applied to a Nosocomial Outbreak The Journal of Infectious Diseases. ,vol. 138, pp. 488- 498 ,(1978) , 10.1093/INFDIS/138.4.488
S. K. Gire, A. Goba, K. G. Andersen, R. S. G. Sealfon, D. J. Park, L. Kanneh, S. Jalloh, M. Momoh, M. Fullah, G. Dudas, S. Wohl, L. M. Moses, N. L. Yozwiak, S. Winnicki, C. B. Matranga, C. M. Malboeuf, J. Qu, A. D. Gladden, S. F. Schaffner, X. Yang, P.-P. Jiang, M. Nekoui, A. Colubri, M. R. Coomber, M. Fonnie, A. Moigboi, M. Gbakie, F. K. Kamara, V. Tucker, E. Konuwa, S. Saffa, J. Sellu, A. A. Jalloh, A. Kovoma, J. Koninga, I. Mustapha, K. Kargbo, M. Foday, M. Yillah, F. Kanneh, W. Robert, J. L. B. Massally, S. B. Chapman, J. Bochicchio, C. Murphy, C. Nusbaum, S. Young, B. W. Birren, D. S. Grant, J. S. Scheiffelin, E. S. Lander, C. Happi, S. M. Gevao, A. Gnirke, A. Rambaut, R. F. Garry, S. H. Khan, P. C. Sabeti, Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak Science. ,vol. 345, pp. 1369- 1372 ,(2014) , 10.1126/SCIENCE.1259657
T. Leitner, D. Escanilla, C. Franzen, M. Uhlen, J. Albert, Accurate reconstruction of a known HIV-1 transmission history by phylogenetic tree analysis Proceedings of the National Academy of Sciences of the United States of America. ,vol. 93, pp. 10864- 10869 ,(1996) , 10.1073/PNAS.93.20.10864
Morgan N. Price, Paramvir S. Dehal, Adam P. Arkin, FastTree 2--approximately maximum-likelihood trees for large alignments. PLOS ONE. ,vol. 5, ,(2010) , 10.1371/JOURNAL.PONE.0009490