作者: Jure Zupan , Milan Randić
DOI: 10.1021/CI040104J
关键词:
摘要: An algorithm for encoding long strings of building blocks, like 4 DNA bases (adenine - A, cytosine C, thymine T, and guanidine G), 20 natural amino acids (from Alanine Ala to Valine Val, plus the stop triplet), or all 64 possible base triplets AAA TTT), into “zigzag” “spectrum-like” representations is suggested. The new scheme can be derived in 3-, 2-, 1-dimensional form depending on user's wishes. only information, besides string which representation sought, initial positioning complete set units from composed, i.e., four positions G, stop, etc. This initialized either 1-D form. As an illustration suggested visual chemometric comparison first 10 exon beta globin gene different species, each consisting about 100 basic a...