作者: Deloula Mansouri , Xiaohui Yuan , Abdeldjalil Saidani
DOI: 10.3390/A13040099
关键词:
摘要: With the emergent evolution in DNA sequencing technology, a massive amount of genomic data is produced every day, mainly sequences, craving for more storage and bandwidth. Unfortunately, managing, analyzing specifically storing these large amounts become major scientific challenge bioinformatics. Therefore, to overcome challenges, compression has necessary. In this paper, we describe new reference-free compressor abbreviated as DNAC-SBE. DNAC-SBE lossless hybrid that consists three phases. First, starting from largest base (Bi), positions each Bi are replaced with ones other bases have smaller frequencies than zeros. Second, encode generated streams, propose single-block encoding scheme (SEB) based on exploitation position neighboring bits within block using two different techniques. Finally, proposed algorithm dynamically assigns shorter length code block. Results show outperforms state-of-the-art compressors proves its efficiency terms special conditions imposed compressed data, space transfer rate regardless file format or size data.