作者: R. C. Lewontin , Hua Tang
DOI: 10.1093/GENETICS/153.1.485
关键词:
摘要: In the comparison of DNA and protein sequences between species or paralogues among individuals within a population, there is often some indication that different regions sequence are divergent polymorphic to degrees, indicating differential constraint diversifying selection operating in sequence. The problem test statistically whether observed regional differences density variant sites represent real then estimate as accurately possible location regions. A method given for testing locating variation. consists calculating G(x(k)) = k/n - x(k)/N, where x(k) position kth site along sequence, n total number sites, N length. estimated region longest stretch adjacent which monotonically increasing (a hot spot) decreasing cold spot). Critical values this length tests significance given, sequential developed multiple regions, power against various alternatives explored. locates endpoints spots variation with high accuracy.