Innovative assembly strategy contributes to the understanding of evolution and conservation genetics of the critically endangered Solenodon paradoxus from the island of Hispaniola

作者: Kirill Grigorev , Sergey Kliver , Pavel Dobrynin , Aleksey Komissarov , Walter Wolfsberger

DOI: 10.1101/164574

关键词: ParadoxusPhylogenetic treeSubspeciesPopulationConservation geneticsZoologySolenodonBiologyReference genomeGenome

摘要: Solenodons are insectivores living on the Caribbean islands, with few surviving related taxa. The genus occupies one of most ancient branches among placental mammals. history, unique biology and adaptations these enigmatic venomous species, can be greatly advanced given availability genome data, but whole assembly for solenodons has never been previously performed, partially due to difficulty in obtaining samples from field. Island isolation likely resulted extreme homozygosity within Hispaniolan solenodon ( Solenodon paradoxus ), thus we tested performance several strategies genetically impoverished species9 genomes. string-graph based strategy seems a better choice compared conventional de Brujn graph approach, high levels homozygosity, which is often hallmark endemic or endangered species. A consensus reference was assembled sequences five individuals southern subspecies S. p. woodi ). In addition, obtained additional sequence northern resulting assemblies were each other, annotated genes, specific emphasis repeats, variable microsatellite loci other genomic variants. Phylogenetic positioning selection signatures inferred 4,416 single copy orthologs 10 Patterns SNP variation allowed us infer population demography, indicated split species at least 300 Kya.

参考文章(84)
Martin Hunt, Taisei Kikuchi, Mandy Sanders, Chris Newbold, Matthew Berriman, Thomas D Otto, REAPR: a universal tool for genome assembly evaluation. Genome Biology. ,vol. 14, pp. 1- 10 ,(2013) , 10.1186/GB-2013-14-5-R47
James L. Weber, Carmen Wong, Mutation of human short tandem repeats Human Molecular Genetics. ,vol. 2, pp. 1123- 1128 ,(1993) , 10.1093/HMG/2.8.1123
G. Benson, Tandem repeats finder: a program to analyze DNA sequences Nucleic Acids Research. ,vol. 27, pp. 573- 580 ,(1999) , 10.1093/NAR/27.2.573
M. Kolmogorov, B. Raney, B. Paten, S. Pham, Ragout—a reference-assisted assembly tool for bacterial genomes Bioinformatics. ,vol. 30, pp. 302- 309 ,(2014) , 10.1093/BIOINFORMATICS/BTU280
Samuel T. Turvey, Helen M.R. Meredith, R. Paul Scofield, Continued survival of Hispaniolan solenodon Solenodon paradoxus in Haiti Oryx. ,vol. 42, pp. 611- 614 ,(2008) , 10.1017/S0030605308001324
Marten Boetzer, Christiaan V. Henkel, Hans J. Jansen, Derek Butler, Walter Pirovano, Scaffolding pre-assembled contigs using SSPACE Bioinformatics. ,vol. 27, pp. 578- 579 ,(2011) , 10.1093/BIOINFORMATICS/BTQ683
Ruibang Luo, Binghang Liu, Yinlong Xie, Zhenyu Li, Weihua Huang, Jianying Yuan, Guangzhu He, Yanxiang Chen, Qi Pan, Yunjie Liu, Jingbo Tang, Gengxiong Wu, Hao Zhang, Yujian Shi, Yong Liu, Chang Yu, Bo Wang, Yao Lu, Changlei Han, David W Cheung, Siu-Ming Yiu, Shaoliang Peng, Zhu Xiaoqian, Guangming Liu, Xiangke Liao, Yingrui Li, Huanming Yang, Jian Wang, Tak-Wah Lam, Jun Wang, None, SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler GigaScience. ,vol. 1, pp. 30- 30 ,(2012) , 10.1186/2047-217X-1-18
R. C. Edgar, MUSCLE: multiple sequence alignment with high accuracy and high throughput Nucleic Acids Research. ,vol. 32, pp. 1792- 1797 ,(2004) , 10.1093/NAR/GKH340
R. D. Finn, J. Clements, S. R. Eddy, HMMER web server: interactive sequence similarity searching Nucleic Acids Research. ,vol. 39, pp. 29- 37 ,(2011) , 10.1093/NAR/GKR367