Text analysis tools for identification of emerging topics and research gaps in conservation science.

作者: Martin J. Westgate , Philip S. Barton , Jennifer C. Pierson , David B. Lindenmayer

DOI: 10.1111/COBI.12605

关键词: Statistical hypothesis testingEcologyBiologyTopic modelData scienceTask (project management)Latent Dirichlet allocationIdentification (information)Scientific literaturePopularitySystematic review

摘要: Keeping track of conceptual and methodological developments is a critical skill for research scientists, but this task increasingly difficult due to the high rate academic publication. As crisis discipline, conservation science particularly in need tools that facilitate rapid yet insightful synthesis. We show how common text-mining method (latent Dirichlet allocation, or topic modeling) statistical tests familiar ecologists (cluster analysis, regression, network analysis) can be used investigate trends identify potential gaps scientific literature. tested these methods on literature ecological surrogates indicators. Analysis popularity within corpus showed strong emphasis monitoring management fragmented ecosystems, while analysis suggested greater role genetic Our results automated text with care, provide information complementary given by systematic reviews meta-analyses, increasing scientists' capacity

参考文章(42)
Kevin W. Boyack, Richard Klavans, Creation of a highly detailed, dynamic, global model and map of science association for information science and technology. ,vol. 65, pp. 670- 685 ,(2014) , 10.1002/ASI.22990
Yi Wang, Ulrike Naumann, Stephen T. Wright, David I. Warton, mvabund– an R package for model‐based analysis of multivariate abundance data Methods in Ecology and Evolution. ,vol. 3, pp. 471- 474 ,(2012) , 10.1111/J.2041-210X.2012.00190.X
Mark J. Whittingham, The future of agri-environment schemes: biodiversity gains and ecosystem service delivery? Journal of Applied Ecology. ,vol. 48, pp. 509- 513 ,(2011) , 10.1111/J.1365-2664.2011.01987.X
Bettina Grün, Kurt Hornik, topicmodels: An R Package for Fitting Topic Models Journal of Statistical Software. ,vol. 40, pp. 1- 30 ,(2011) , 10.18637/JSS.V040.I13
David M Blei, Andrew Y Ng, Michael I Jordan, None, Latent dirichlet allocation Journal of Machine Learning Research. ,vol. 3, pp. 993- 1022 ,(2003) , 10.5555/944919.944937
Brian J. McGill, Towards a unification of unified theories of biodiversity Ecology Letters. ,vol. 13, pp. 627- 642 ,(2010) , 10.1111/J.1461-0248.2010.01449.X
David B. Lindenmayer, Charles Zammit, Simon J. Attwood, Emma Burns, Claire L. Shepherd, Geoff Kay, Jeff Wood, A Novel and Cost-Effective Monitoring Approach for Outcomes in an Australian Biodiversity Conservation Incentive Program PLoS ONE. ,vol. 7, pp. e50872- ,(2012) , 10.1371/JOURNAL.PONE.0050872
David B. Lindenmayer, Philip S. Barton, Peter W. Lane, Martin J. Westgate, Lachlan McBurney, David Blair, Philip Gibbons, Gene E. Likens, An empirical assessment and comparison of species-based and habitat-based surrogates: a case study of forest vertebrates and large old trees. PLOS ONE. ,vol. 9, ,(2014) , 10.1371/JOURNAL.PONE.0089807
Richard Van Noorden, Brendan Maher, Regina Nuzzo, The top 100 papers Nature. ,vol. 514, pp. 550- 553 ,(2014) , 10.1038/514550A