Analysis of Genomic and Proteomic Sequences using DSP Techniques

作者: Raja Sekhar Kakumani

DOI:

关键词:

摘要: Analysis of biological sequences by detecting the hidden periodicities and symbolic patterns has been an active area research since couple decades. The periodic components help locating biologically relevant motifs such as protein coding regions (exons), CpG islands (CGI) hot-spots that characterize various functions. discrete nature prompted many researchers to use digital signal processing (DSP) techniques for their analysis. After mapping numerical sequences, DSP using filters, wavelets, neural networks, filter banks etc. have developed detect recurring in these sequences. This thesis attempts develop effective based solve some important problems sequence Specifically, statistically optimal null filters (SONF), matched networks algorithms are analysis deoxyribonucleic acid (DNA), ribonucleic (RNA) sequences. In first part this study, DNA investigated order identify locations CGIs regions, i.e., exons. SONFs, which known ability efficiently estimate short-duration signals embedded noise combining maximum signal-to-noise ratio least squares optimization criteria, utilized problems. Basis characterizing exons formulated be used SONF technique solving problems. In second RNA analyzed predict secondary structures. For purpose, on 2-dimensional convolution stem loop structure. knowledge thus obtained then presence pseudoknot, leading determination entire Finally, third thesis, predicting structure identifying hot-spots. a two-stage network scheme is developed, whereas approach proposed. Hot-spots proteins exhibit characteristic frequency corresponding function. A basis function SONFs belonging functional group. Extensive experiments performed throughout demonstrate effectiveness validity schemes investigation. performance proposed compared with previously reported results validated databases containing annotations. It shown result superior those existing techniques.

参考文章(83)
Michael Zuker, Prediction of RNA Secondary Structure by Energy Minimization Methods of Molecular Biology. ,vol. 25, pp. 267- 294 ,(1994) , 10.1385/0-89603-276-0:267
Eugene I. Plotkin, Signal-Controlled Time-Series Modeling Based on ARMA Blocks, and Separation of Superimposed,Overlapping Spectra Signals (Special Section on Nonlinear Theory and its Applications) IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. ,vol. 79, pp. 1676- 1681 ,(1996)
Jean Garnier, Jean-François Gibrat, Barry Robson, GOR method for predicting protein secondary structure from amino acid sequence. Methods in Enzymology. ,vol. 266, pp. 540- 553 ,(1996) , 10.1016/S0076-6879(96)66034-0
Carl Ivar Branden, John Tooze, Introduction to protein structure ,(1991)
Harvey F. Lodish, Molecular Cell Biology ,(1986)
Jitender S Deogun, Ruben Donis, Olga Komina, Fangrui Ma, None, RNA secondary structure prediction with simple pseudoknots asia pacific bioinformatics conference. pp. 239- 246 ,(2004)
Gary D. Stormo, Robert B. Cary, Graph-theoretic approach to RNA modeling using comparative data. intelligent systems in molecular biology. ,vol. 3, pp. 75- 80 ,(1995)
Jean-Pierre Issa, CpG island methylator phenotype in cancer Nature Reviews Cancer. ,vol. 4, pp. 988- 993 ,(2004) , 10.1038/NRC1507
F. H. D. van Batenburg, PseudoBase: structural information on RNA pseudoknots Nucleic Acids Research. ,vol. 29, pp. 194- 195 ,(2001) , 10.1093/NAR/29.1.194