Detecting Positive Selection in Populations Using Genetic Data

作者: Angelos Koropoulis , Nikolaos Alachiotis , Pavlos Pavlidis

DOI: 10.1007/978-1-0716-0199-0_5

关键词:

摘要: High-throughput genomic sequencing allows to disentangle the evolutionary forces acting in populations. Among forces, positive selection has received a lot of attention because it is related adaptation populations their environments, both biotic and abiotic. Positive selection, also known as Darwinian occurs when an allele favored by natural selection. The frequency increases population and, due genetic hitchhiking, neighboring linked variation diminishes, creating so-called selective sweeps. Such process leaves traces genomes that can be detected future time point. Detecting achieved searching for signatures introduced sweeps, such regions reduced variation, specific shift site spectrum, particular linkage disequilibrium (LD) patterns region. A variety approaches used detecting ranging from simple implementations compute summary statistics more advanced statistical approaches, e.g., Bayesian maximum-likelihood-based methods, machine learning methods. In this chapter, we discuss sweep detection methodologies on basis capacity analyze whole or just subgenomic regions, polymorphism they exploit signatures. We summarize results comparisons among five open-source software releases (SweeD, SweepFinder, SweepFinder2, OmegaPlus, RAiSD) regarding sensitivity, specificity, execution times. Furthermore, test methods present thorough performance analysis. equilibrium neutral models mild bottlenecks, most are able detect sweeps accurately. Methods tools rely rather than single SNPs exhibit higher true rates spectrum (SFS)-based under model recurrent hitchhiking. However, false rate elevated misspecified demographic build distribution statistic null hypothesis. Both LD SFS-based suffer decreased accuracy localizing target bottleneck scenarios. extensive analysis effects gene flow detection, problem been understudied literature.

参考文章(97)
Nikolaos Alachiotis, Pavlos Pavlidis, Alexandros Stamatakis, Exploiting Multi-grain Parallelism for Efficient Selective Sweep Detection Algorithms and Architectures for Parallel Processing. pp. 56- 68 ,(2012) , 10.1007/978-3-642-33078-0_5
Montserrat Aguadé, Charles H. Langley, Polymorphism and Divergence in Regions of Low Recombination in Drosophila Springer US. pp. 67- 76 ,(1994) , 10.1007/978-1-4615-2383-3_6
Laurent A F Frantz, Joshua G Schraiber, Ole Madsen, Hendrik-Jan Megens, Alex Cagan, Mirte Bosse, Yogesh Paudel, Richard P M A Crooijmans, Greger Larson, Martien A M Groenen, Evidence of long-term gene flow and selection during domestication from analyses of Eurasian wild and domestic pig genomes. Nature Genetics. ,vol. 47, pp. 1141- 1148 ,(2015) , 10.1038/NG.3394
Bernhard Schölkopf, Alexander J. Smola, Learning with Kernels The MIT Press. pp. 626- ,(2018) , 10.7551/MITPRESS/4175.001.0001
J. G. Carbonell, T. M. Mitchell, R. S. Michalski, Machine Learning: An Artificial Intelligence Approach Springer Publishing Company, Incorporated. ,(2013)
Christopher M. Bishop, Pattern Recognition and Machine Learning ,(2006)
W P Maddison, R R Hudson, M Slatkin, Estimation of levels of gene flow from DNA sequence data. Genetics. ,vol. 132, pp. 583- 589 ,(1992) , 10.1093/GENETICS/132.2.583
Valeria Montano, Xavier Didelot, Matthieu Foll, Bodo Linz, Richard Reinhardt, Sebastian Suerbaum, Yoshan Moodley, Jeffrey D. Jensen, Worldwide Population Structure, Long-Term Demography, and Local Adaptation of Helicobacter pylori Genetics. ,vol. 200, pp. 947- 963 ,(2015) , 10.1534/GENETICS.115.176404
C H Langley, R R Hudson, N L Kaplan, The "hitchhiking effect" revisited. Genetics. ,vol. 123, pp. 887- 899 ,(1989) , 10.1093/GENETICS/123.4.887