On the nature of long-range letter correlations in texts

作者: Dmitrii Y. Manin

DOI:

关键词:

摘要: The origin of long-range letter correlations in natural texts is studied using random walk analysis and Jensen-Shannon divergence. It concluded that they result from slow variations frequency distribution, which are a consequence lexical composition within the text. These preserved by shuffling moving window. As such, do reflect structural properties text, but very indirect manner.

参考文章(9)
MARCELO A. MONTEMURRO, PEDRO A. PURY, LONG-RANGE FRACTAL CORRELATIONS IN LITERARY CORPORA Fractals. ,vol. 10, pp. 451- 461 ,(2002) , 10.1142/S0218348X02001257
Alexey N. Pavlov, Werner Ebeling, Lutz Molgedey, Amir R. Ziganshin, Vadim S. Anishchenko, Scaling features of texts, images and time series Physica A-statistical Mechanics and Its Applications. ,vol. 300, pp. 310- 324 ,(2001) , 10.1016/S0378-4371(01)00341-7
Peter Kokol, Vili Podgorelec, Milan Zorman, Tatjana Kokol, Tatjana Njivar, Computer and natural language texts—a comparison based on long-range correlations Journal of the Association for Information Science and Technology. ,vol. 50, pp. 1295- 1301 ,(1999) , 10.1002/(SICI)1097-4571(1999)50:14<1295::AID-ASI4>3.0.CO;2-5
Werner Ebeling, Alexander Neiman, Long-range correlations between letters and sentences in texts Physica A-statistical Mechanics and Its Applications. ,vol. 215, pp. 233- 241 ,(1995) , 10.1016/0378-4371(95)00025-3
C-K Peng, Sergej V Buldyrev, Ary L Goldberger, Shlomo Havlin, Francesco Sciortino, Michael Simons, H Eugene Stanley, Long-range correlations in nucleotide sequences Nature. ,vol. 356, pp. 168- 170 ,(1992) , 10.1038/356168A0
M. AMIT, Y. SHMERLER, E. EISENBERG, M. ABRAHAM, N. SHNERB, LANGUAGE AND CODIFICATION DEPENDENCE OF LONG-RANGE CORRELATIONS IN TEXTS Fractals. ,vol. 02, pp. 7- 13 ,(1994) , 10.1142/S0218348X94000028
Ivo Grosse, Pedro Bernaola-Galván, Pedro Carpena, Ramón Román-Roldán, Jose Oliver, H. Eugene Stanley, Analysis of symbolic sequences using the Jensen-Shannon divergence. Physical Review E. ,vol. 65, pp. 041905- ,(2002) , 10.1103/PHYSREVE.65.041905
Damian H. Zanette, Segmentation and Context of Literary and Musical Sequences arXiv: Computation and Language. ,(2007)