Bioinformatics Tools for the Analysis of Gene-Phenotype Relationships Coupled with a Next Generation ChIP-Sequencing Data Analysis Pipeline

作者: Erinija Pranckeviciene

DOI: 10.20381/RUOR-2702

关键词: GenePhenotypeDNA sequencingChIP-sequencingChromatin immunoprecipitationBioinformaticsRegulation of gene expressionProcess (engineering)Pipeline (software)Biology

摘要: The rapidly advancing high-throughput and next generation sequencing technologies facilitate deeper insights into the molecular mechanisms underlying expression of phenotypes in living organisms. Experimental data scientific publications following this technological advancement have accumulated public databases. Meaningful analysis currently available genomic databases requires sophisticated computational tools algorithms, presents considerable challenges to biologists without specialized training bioinformatics. To study their phenotype interest must prioritize large lists poorly characterized genes generated experiments. date, prioritization primarily been designed work with human diseases as defined by known be associated those diseases. There is therefore a need for more which are not related generally or no yet particular. Chromatin immunoprecipitation followed (ChIP-Seq) method choice gene regulation processes responsible cellular phenotypes. Among publicly pipelines processing ChIP-Seq data, there lack downstream composite motifs preferred binding distances DNA proteins. This thesis aimed address gap existing process provide rapid interpretation genes. Additionally, programs transcription factors were integrated pipeline expedited results. A algorithm linking non-disease described meaningful keywords was developed. can used candidate genetic targets factor produced analysis.

参考文章(194)
Carolina Perez-Iratxeta, Harindar S. Keer, Peer Bork, Miguel A. Andrade, Computing fuzzy associations for the analysis of biological literature. BioTechniques. ,vol. 32, pp. 1380- 1385 ,(2002) , 10.2144/02326BC03
Martin Krallinger, Florian Leitner, Alfonso Valencia, Analysis of biological processes and diseases using text mining approaches. Methods of Molecular Biology. ,vol. 593, pp. 341- 382 ,(2010) , 10.1007/978-1-60327-194-3_16
Ivan V. Kulakovskiy, Vsevolod J. Makeev, DNA Sequence Motif: A Jack of All Trades for ChIP-Seq Data Advances in Protein Chemistry. ,vol. 91, pp. 135- 171 ,(2013) , 10.1016/B978-0-12-411637-5.00005-6
Mark Halling-Brown, Adrian J. Shepherd, Constructing computational pipelines Methods of Molecular Biology. ,vol. 453, pp. 451- 470 ,(2008) , 10.1007/978-1-60327-429-6_24
Eric Sayers, The E-utilities In-Depth: Parameters, Syntax and More National Center for Biotechnology Information (US). ,(2015)
Hagit Shatkay, Mark Craven, Mining the Biomedical Literature ,(2012)
Carolina Perez-Iratxeta, Matthias Wjst, Peer Bork, Miguel A Andrade, G2D: a tool for mining genes associated with disease BMC Genetics. ,vol. 6, pp. 45- 45 ,(2005) , 10.1186/1471-2156-6-45
Pentao Liu, Peng Li, Shannon Burke, Critical roles of Bcl11b in T-cell development and maintenance of T-cell identity Immunological Reviews. ,vol. 238, pp. 138- 149 ,(2010) , 10.1111/J.1600-065X.2010.00953.X
Jianxing Feng, Tao Liu, Yong Zhang, None, Using MACS to Identify Peaks from ChIP-Seq Data Current protocols in human genetics. ,vol. 34, ,(2011) , 10.1002/0471250953.BI0214S34