作者: Alexander F Auch , Stefan R Henz , Barbara R Holland , Markus Göker
关键词: Selection (genetic algorithm) 、 Sequence alignment 、 Biology 、 Genetics 、 Sequence analysis 、 Genome 、 Phylogenetic tree 、 Distance matrices in phylogeny 、 Computational biology 、 Phylogenetics 、 UPGMA
摘要: Background: Phylogenetic methods which do not rely on multiple sequence alignments are important tools in inferring trees directly from completely sequenced genomes. Here, we extend the recently described Genome BLAST Distance Phylogeny (GBDP) strategy to compute phylogenetic all plastid genomes currently available and a selection of mitochondrial representing major eukaryotic lineages. BLASTN, TBLASTX, or combinations both used locate high-scoring segment pairs (HSPs) between two sequences pairwise similarities distances computed different ways resulting total 96 GBDP variants. The suitability these distance formulae for phylogeny reconstruction is estimated by computing measure "treelikeness", so-called δ value, respective matrices. Additionally, compare inferred matrices using UPGMA, NJ, BIONJ, FastME, STC, respectively, with NCBI taxonomy tree taxa under study. Results: Our results indicate that, at this taxonomic level, much more valuable phylogenies than genomes, that based breakpoints little use. Distances proportion "matched" HSP length average genome were best estimation. Additionally found TBLASTX instead BLASTN and, particularly, combining leads small but significant increase accuracy. Other factors significantly affect outcome. BIONJ algorithm most accordance current taxonomy, NJ FastME performing insignificantly worse, STC as well if applied high quality values be reliable predictor Conclusion: Using treelike matrices, judged their values, able recover plant lineages, Apicomplexa organelles being derived "green" plastids "red" type. GBDP-like can reliably infer kinds genomic data. A framework established further develop improve such methods. topologyindependent tool general use development assessment inference.