Lost branches on the tree of life.

作者: Bryan T. Drew , Romina Gazis , Patricia Cabezas , Kristen S. Swithers , Jiabin Deng

DOI: 10.1371/JOURNAL.PBIO.1001636

关键词: SystematicsEvolutionary taxonomyGenBankPhylogeneticsBiologyData scienceData managementTree of lifeDryad (programming)GeneticsPhylogenetic tree

摘要: Given that reproducibility is a pillar of scientific research, the preservation knowledge (underlying data) paramount importance. The standard can be evaluated based on criteria methodological rigor and legitimacy, which sometimes used to distinguish “hard” from “soft” sciences. In phylogenetics, discipline routinely uses DNA sequences build trees reflecting organismal relationships, scale data collection complexity analytical software have both increased dramatically during past decade. Consequently, ability navigate publications reproduce analyses more challenging than ever. When sequencing was initially employed in systematics late 1980s, there some reluctance deposit nucleotide open repositories such as GenBank [1]. This ultimately changed when high-impact journals (e.g., Proceedings National Academy Sciences, Nature, Science) began requiring submission prerequisite for publication [1],[2]; now virtually every evolutionary biology journal observes this requirement (but see [3]). Until recently, uploading (or EMBL) generally considered sufficient ensure phylogenetic studies using sequence data. Increasingly, however, community realizing archiving raw not adequate, underlying alignments well resulting are pivotal reproducibility, comparative purposes, meta-analyses, synthesis. Indeed, has been growing clamor adopt enforce rigorous practices across diverse disciplines [4]–[8]. As result, about 35 [5],[9] adopted policies encourage or require authors upload alignments, trees, other files requisite study [5] TreeBASE (http://treebase.org/) and/or public Dryad (http://datadryad.org). Unfortunately, enforcement deposition lax, most evolution still do alignment tree deposition. published papers systematics/phylogenetics remain inaccessible at large [8],[10].

参考文章(23)
J. M. Mercer, The Effects of Cenozoic Global Change on Squirrel Phylogeny Science. ,vol. 299, pp. 1568- 1572 ,(2003) , 10.1126/SCIENCE.1079705
R. D. Gray, A. J. Drummond, S. J. Greenhill, Language Phylogenies Reveal Expansion Pulses and Pauses in Pacific Settlement Science. ,vol. 323, pp. 479- 483 ,(2009) , 10.1126/SCIENCE.1166858
C. Burks, L. J. Tomlinson, Submission of data to GenBank Proceedings of the National Academy of Sciences of the United States of America. ,vol. 86, pp. 408- 408 ,(1989) , 10.1073/PNAS.86.2.408
Bryan T Drew, None, Data deposition: Missing data mean holes in tree of life. Nature. ,vol. 493, pp. 305- 305 ,(2013) , 10.1038/493305F
Jelte M. Wicherts, Denny Borsboom, Judith Kats, Dylan Molenaar, The poor availability of psychological research data for reanalysis American Psychologist. ,vol. 61, pp. 726- 728 ,(2006) , 10.1037/0003-066X.61.7.726
Alawi A. Alsheikh-Ali, Waqas Qureshi, Mouaz H. Al-Mallah, John P. A. Ioannidis, Public Availability of Published Research Data in High-Impact Journals PLoS ONE. ,vol. 6, pp. e24357- ,(2011) , 10.1371/JOURNAL.PONE.0024357
KATHERINE W. McCAIN, Mandating Sharing: Journal Policies in the Natural Sciences Science Communication. ,vol. 16, pp. 403- 431 ,(1995) , 10.1177/1075547095016004003
Heather A. Piwowar, Roger S. Day, Douglas B. Fridsma, Sharing Detailed Research Data Is Associated with Increased Citation Rate PLoS ONE. ,vol. 2, pp. e308- ,(2007) , 10.1371/JOURNAL.PONE.0000308
Caroline J. Savage, Andrew J. Vickers, Empirical Study of Data Sharing by Authors Publishing in PLoS Journals PLoS ONE. ,vol. 4, pp. e7078- ,(2009) , 10.1371/JOURNAL.PONE.0007078
Mohamed A. F Noor, Katherine J Zimmerman, Katherine C Teeter, Data Sharing: How Much Doesn't Get Submitted to GenBank? PLOS Biology. ,vol. 4, ,(2006) , 10.1371/JOURNAL.PBIO.0040228