作者: Jes Stollberg , Johann Urschitz , Zsolt Urban , Charles D Boyd
DOI: 10.1101/GR.10.8.1241
关键词:
摘要: Serial Analysis of Gene Expression (SAGE) is an innovative technique that offers the potential cataloging both identity and relative frequencies mRNA transcripts in a given poly(A+) RNA preparation. Although it very effective approach for determining expression populations, there are significant biases observed results inherent experimental process. These caused by sampling error, sequencing nonuniqueness, nonrandomness tag sequences. The quantitative information desired from SAGE experiments consists estimates number genes frequency distribution transcript copy numbers. Of additional concern extent to which sequence can be assumed unique its gene. present study takes these mathematical into account presents basis maximum likelihood estimation gene set results. true state genomic markedly different those based directly on observations underlying experiments. It also shown while many cases probable within genome, larger genomes this cannot safely assumed.