Segment predictability as a cue in word segmentation

作者: C. Anton Rytting

DOI: 10.3115/1622153.1622163

关键词:

摘要: Several computational simulations of how children solve the word segmentation problem have been proposed, but most applied only to a limited number languages. One model with some experimental support uses distributional statistics sound sequence predictability (Saffran et al. 1996). However, design does not fully specify is best measured or modeled in simulation. Saffran (1996) assume transitional probability, Brent (1999a) claims mutual information (MI) more appropriate. Both locally, relative neighboring segment-pairs. This paper replicates Brent's mutual-information on corpus childdirected speech Modern Greek, and introduces variant using global threshold. finding regarding superiority MI confirmed; performance local comparisons thresholds depends evaluation metric.

参考文章(18)
Anja Belz, An Approach to the Automatic Acquisition of Phonotactic Constraints meeting of the association for computational linguistics. ,(1998)
Carl G. De Marcken, Robert C. Berwick, Unsupervised language acquisition PhDT. ,(1996)
Irene Philippaki-Warburton, David Holton, Peter A. Mackridge, Greek: A Comprehensive Grammar of the Modern Language ,(1997)
Mehryar Mohri, Fernando Pereira, Michael Riley, A rational design for a weighted finite-state transducer library Lecture Notes in Computer Science. pp. 144- 158 ,(1998) , 10.1007/BFB0031388
J. R. Saffran, R. N. Aslin, E. L. Newport, Statistical Learning by 8-Month-Old Infants Science. ,vol. 274, pp. 1926- 1928 ,(1996) , 10.1126/SCIENCE.274.5294.1926
C. Anton Rytting, Greek word segmentation using minimal information Proceedings of the Student Research Workshop at HLT-NAACL 2004 on XX - HLT-NAACL '04. pp. 43- 48 ,(2004) , 10.3115/1614038.1614046