作者: Shuchang Liu
DOI:
关键词:
摘要: More data provide more possibilities. Growing number of genomic new perspectives to understand some complex biological problems. Many algorithms for single-study have been developed, however, their results are not stable small sample size or overwhelmed by study-specific signals. Taking the advantage high throughput from multiple cohorts, in this dissertation, we able detect novel fusion transcripts, explore gene regulations and discovery disease subtypes within an integrative analysis framework. In first project, evaluated 15 transcript detection tools paired-end RNA-seq data. Though no single method had distinguished performance over others, several top were selected according F-measures. We further developed a meta-caller algorithm combining methods re-prioritize candidate transcripts. The showed that our can successfully balance precision recall compared any tool. In second extended liquid association two meta-analytic frameworks (MetaLA MetaMLA). Liquid is dynamic gene-gene correlation depending on expression level third gene. Our MetaLA MetaMLA provided stronger signals consistent analysis. When applied five Yeast datasets related environmental changes, genes triplets highly enriched fundamental processes corresponding changes. In plaid model cohorts bicluster detection. meta-biclustering biclusters with higher Jaccard accuracy toward large noise size. also introduced concept gap statistic pruning parameter estimation. In addition, detected breast cancer mRNA select associated many pathways split samples significantly different survival behaviors. In conclusion, improved transcripts detection, through integrative-analysis frameworks. These strong evidence structure variation, three-way regulation subtype thus contribute better understanding mechanism ultimately.