Data compression method

作者: Manabu Kawabe , Takuro Sato , Yoshihito Shimazaki

DOI:

关键词: Computer scienceSpeech recognitionNatural language processingSeries (mathematics)Word (computer architecture)Artificial intelligenceCode wordData compressionAssociation (psychology)

摘要: In a method of compressing text data including codewords characters, an input word each composed series one or more characters is extracted from the data, dictionary containing, as entries, words made up provided, codeword stored in association with word, occurrence counts respective are also stored, searched to find whether not matches any words, assigned which has been found match produced, count updated; and when introduced new word.

参考文章(4)
David J. Van Maren, David W. Ruska, Jeff J. Kato, Performance-based reset of data compression dictionary ,(1988)
R. Gallager, Variations on a theme by Huffman IEEE Transactions on Information Theory. ,vol. 24, pp. 668- 674 ,(1978) , 10.1109/TIT.1978.1055959