Orthology detection combining clustering and synteny for very large datasets.

作者: Marcus Lechner , Maribel Hernandez-Rosales , Daniel Doerr , Nicolas Wieseke , Annelyse Thévenin

DOI: 10.1371/JOURNAL.PONE.0105015

关键词:

摘要: The elucidation of orthology relationships is an important step both in gene function prediction as well towards understanding patterns sequence evolution. Orthology assignments are usually derived directly from similarities for large data because more exact approaches exhibit too high computational costs. Here we present PoFF, extension the standalone tool Proteinortho, which enhances detection by combining clustering, similarity, and synteny. In course this work, FFAdj-MCS, a heuristic that assesses pairwise order using adjacencies (a similarity measure related to breakpoint distance) was adapted support multiple linear chromosomes extended detect duplicated regions. PoFF largely reduces number false positives enables fine-grained predictions than purely similarity-based approaches. maintains low memory requirements efficient concurrency options its basis making software applicable very datasets.

参考文章(58)
Walter M. Fitch, Homology: a personal view on some of the problems Trends in Genetics. ,vol. 16, pp. 227- 231 ,(2000) , 10.1016/S0168-9525(00)02005-9
Stephanie Keller-Schmidt, Víctor M. Eguíluz, Emilio Hernández-García, Konstantin Klemm, Murat Tugrul, An Age Dependent Branching Model for Macroevolution ,(2012)
Anne Bergeron, Sylvie Corteel, Mathieu Raffinot, The Algorithmic of Gene Teams workshop on algorithms in bioinformatics. pp. 464- 476 ,(2002) , 10.1007/3-540-45784-4_36
Chen Ting, H. E. Yong, Optimal algorithms for uncovering synteny problem Journal of Combinatorial Optimization. ,vol. 12, pp. 421- 432 ,(2006) , 10.1007/S10878-006-9008-6
Wei Xu, Chunfang Zheng, David Sankoff, Paths and cycles in breakpoint graph of random multichromosomal genomes. Journal of Computational Biology. ,vol. 14, pp. 423- 435 ,(2007) , 10.1089/CMB.2007.A004
Feng Chen, Aaron J Mackey, Christian J Stoeckert Jr, David S Roos, OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups Nucleic Acids Research. ,vol. 34, pp. 363- 368 ,(2006) , 10.1093/NAR/GKJ123
Sonja J Prohaska, Lydia Steiner, Marcus Lechner, Manja Marz, Peter F Stadler, Sven Findeiß, None, Proteinortho : Detection of (Co-)orthologs in large-scale analysis BMC Bioinformatics. ,vol. 12, pp. 124- 124 ,(2011) , 10.1186/1471-2105-12-124
Hailan Liu, Xiaoqin Guo, Jiasheng Wu, Guo-Bo Chen, Yeqing Ying, Development of universal genetic markers based on single-copy orthologous (COSII) genes in Poaceae Plant Cell Reports. ,vol. 32, pp. 379- 388 ,(2013) , 10.1007/S00299-012-1371-4
Ilan Wapinski, Avi Pfeffer, Nir Friedman, Aviv Regev, Natural history and evolutionary principles of gene duplication in fungi Nature. ,vol. 449, pp. 54- 61 ,(2007) , 10.1038/NATURE06107