Statistical algorithms in the study of mammalian DNA methylation

作者: Meromit Singer

DOI:

关键词:

摘要: DNA methylation is a dynamic chemical modification that abundant on sequences and plays central role in the regulatory mechanisms of cells. This can be inherited across cell divisions generations, providing ``memory mechanism for programs more flexible than coded sequence. In recent years, high-throughput sequencing technologies have enabled genome-wide annotation methylation. Coupled with novel computational machinery, these developments unperceivable insight to characteristics, biological function disease association this phenomenon. The collaborations between experimental researches who take part efforts has been closer ever before due need involve methodologies throughout entire research pipeline, from design through bias correction analysis large datasets. first thesis we present contributions field We introduce statistically sound criteria detection signatures sequence, an algorithm informative non-overlapping subset such regions optimal under biologically motivated assumptions. Our method outputs sequence-generated list are interest respect their states. then Bayesian network infer corrected site-specific states favorable but biased method, describe its incorporation software package. Along calls our package annotates experiment-specific by considering both state inferences genomic These serve as basis comparative studies. last chapter section bring results genome-scale study conducted humans, chimpanzees orangutan, evidence differences propagate generations distinguish closely related species. second concerns error course studying discovered systematic false-positive variant significantly affect variety classifier correct errors show it performs very well sensitivity specificity.

参考文章(177)
Fabio Mohn, Michael Weber, Dirk Schübeler, Tim-Christoph Roloff, Methylated DNA immunoprecipitation (MeDIP). Methods of Molecular Biology. ,vol. 507, pp. 55- 64 ,(2009) , 10.1007/978-1-59745-522-0_5
Kevin Atteson, Calculating the Exact Probability of Language-Like Patterns in Biomolecular Sequences intelligent systems in molecular biology. ,vol. 6, pp. 17- 24 ,(1998)
Nir Friedman, Daniel L. Koller, Probabilistic graphical models : principles and techniques The MIT Press. ,(2009)
Hugh D. Morgan, Heidi G.E. Sutherland, David I.K. Martin, Emma Whitelaw, Epigenetic inheritance at the agouti locus in the mouse Nature Genetics. ,vol. 23, pp. 314- 318 ,(1999) , 10.1038/15490
Daniel Yekutieli, Yoav Benjamini, THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY Annals of Statistics. ,vol. 29, pp. 1165- 1188 ,(2001) , 10.1214/AOS/1013699998
Lior Pachter, Models for transcript quantification from RNA-Seq arXiv: Genomics. ,(2011)
Christoph Bock, Analysing and interpreting DNA methylation data. Nature Reviews Genetics. ,vol. 13, pp. 705- 719 ,(2012) , 10.1038/NRG3273
Rasmus Nielsen, In search of rare human variants Nature. ,vol. 467, pp. 1050- 1051 ,(2010) , 10.1038/4671050A
Ian R. Henderson, Steven E. Jacobsen, Epigenetic inheritance in plants Nature. ,vol. 447, pp. 418- 424 ,(2007) , 10.1038/NATURE05917
John G. Fleagle, Primate Adaptation and Evolution ,(1998)