作者: Muhammad Sardaraz , Muhammad Tahir , Ataul Aziz Ikram , Hassan Bajwa , None
DOI: 10.1016/J.YGENO.2014.08.007
关键词:
摘要: The growth of Next Generation Sequencing technologies presents significant research challenges, specifically to design bioinformatics tools that handle massive amount data efficiently. Biological sequence storage cost has become a noticeable proportion total in the generation and analysis. Particularly increase DNA sequencing rate is significantly outstripping disk capacity, which may go beyond limit capacity. It essential develop algorithms large sets via better memory management. This article compression algorithm SeqCompress copes with space complexity biological sequences. based on lossless uses statistical model as well arithmetic coding compress proposed compared recent specialized for Experimental results show gain other existing algorithms.