Recognition of protein coding regions in DNA sequences

作者: James W. Fickett

DOI: 10.1093/NAR/10.17.5303

关键词: DNA sequencingComputational biologyOpen reading frameBiologyGeneticsDNANoncoding DNAGeneIntergenic regionCoding (social sciences)Protein coding

摘要: We give a test for protein coding regions which is based on simple and universal differences between protein-coding noncoding DNA. The enough to use without computer completely objective. has been thoroughly proven 400,000 bases of sequence data: it misclassifies 5% the tested gives an answer "No Opinion" one fifth time. predict some new in published sequences.

参考文章(31)
J. Hindley, G.A. Phear, Sequence of 1019 nucleotides encompassing one of the inverted repeats from the yeast 2 μm plasmid Nucleic Acids Research. ,vol. 7, pp. 361- 375 ,(1979) , 10.1093/NAR/7.2.361
Marc J. Shulman, Charles M. Steinberg, Nicholas Westmoreland, The coding function of nucleotide sequences can be discerned by statistical analysis. Journal of Theoretical Biology. ,vol. 88, pp. 409- 420 ,(1981) , 10.1016/0022-5193(81)90274-5
R Breathnach, P Chambon, Organization and Expression of Eucaryotic Split Genes Coding for Proteins Annual Review of Biochemistry. ,vol. 50, pp. 349- 383 ,(1981) , 10.1146/ANNUREV.BI.50.070181.002025
Jürgen Brosius, Thomas J. Dull, Donald D. Sleeter, Harry F. Noller, Gene organization and primary structure of a ribosomal RNA operon from Escherichia coli Journal of Molecular Biology. ,vol. 148, pp. 107- 127 ,(1981) , 10.1016/0022-2836(81)90508-8
Hisako Ohtsubo, Kate Nyman, Wlodzimierz Doroszkiewicz, Eiichi Ohtsubo, Multiple copies of iso-insertion sequences of IS1 in Shigella dysenteriae chromosome. Nature. ,vol. 292, pp. 640- 643 ,(1981) , 10.1038/292640A0
Piet Borst, Leslie A. Grivell, One gene's intron is another gene's exon Nature. ,vol. 289, pp. 439- 440 ,(1981) , 10.1038/289439A0
J. Brosius, M. L. Palmer, P. J. Kennedy, H. F. Noller, Complete nucleotide sequence of a 16S ribosomal RNA gene from Escherichia coli. Proceedings of the National Academy of Sciences of the United States of America. ,vol. 75, pp. 4801- 4805 ,(1978) , 10.1073/PNAS.75.10.4801
R. Grantham, C. Gautier, M. Gouy, Codon frequencies in 119 individual genes confirm corsistent choices of degenerate bases according to genome type Nucleic Acids Research. ,vol. 8, pp. 1893- 1912 ,(1980) , 10.1093/NAR/8.9.1893
M. Meijer, E. Beck, F. G. Hansen, H. E. Bergmans, W. Messer, K. von Meyenburg, H. Schaller, Nucleotide sequence of the origin of replication of the Escherichia coli K-12 chromosome. Proceedings of the National Academy of Sciences. ,vol. 76, pp. 580- 584 ,(1979) , 10.1073/PNAS.76.2.580