Locating regions of differential variability in DNA and protein sequences.

作者: R. C. Lewontin , Hua Tang

DOI: 10.1093/GENETICS/153.1.485

关键词:

摘要: In the comparison of DNA and protein sequences between species or paralogues among individuals within a population, there is often some indication that different regions sequence are divergent polymorphic to degrees, indicating differential constraint diversifying selection operating in sequence. The problem test statistically whether observed regional differences density variant sites represent real then estimate as accurately possible location regions. A method given for testing locating variation. consists calculating G(x(k)) = k/n - x(k)/N, where x(k) position kth site along sequence, n total number sites, N length. estimated region longest stretch adjacent which monotonically increasing (a hot spot) decreasing cold spot). Critical values this length tests significance given, sequential developed multiple regions, power against various alternatives explored. locates endpoints spots variation with high accuracy.

参考文章(6)
Carol J. Feltz, Gerald A. Goldin, Generalization of the kolmogorov-smirnov goodness-of-fit test, usinggroup invariance Journal of Nonparametric Statistics. ,vol. 1, pp. 357- 370 ,(1992) , 10.1080/10485259208832535
R. C. Lewontin, Peter J. E. Goss, Detecting heterogeneity of substitution along DNA and protein sequences. Genetics. ,vol. 143, pp. 589- 602 ,(1996) , 10.1093/GENETICS/143.1.589
Jeffrey S. Simonoff, Smoothing Methods in Statistics ,(1996)
Manyuan Long, R. C. Lewontin, Eiji Nitasaka, Brent Richter, Nucleotide Variation and Conservation at the dpp Locus, a Gene Controlling Early Development in Drosophila Genetics. ,vol. 145, pp. 311- 323 ,(1997) , 10.1093/GENETICS/145.2.311
Nicolaas H. Kuiper, Tests concerning random points on a circle Indagationes Mathematicae (Proceedings). ,vol. 63, pp. 38- 47 ,(1960) , 10.1016/S1385-7258(60)50006-0
D. V. HINKLEY, Inference about the change-point from cumulative sum tests Biometrika. ,vol. 58, pp. 509- 523 ,(1971) , 10.1093/BIOMET/58.3.509