Segment predictability as a cue in word segmentation

作者： C. Anton Rytting

关键词:

摘要: Several computational simulations of how children solve the word segmentation problem have been proposed, but most applied only to a limited number languages. One model with some experimental support uses distributional statistics sound sequence predictability (Saffran et al. 1996). However, design does not fully specify is best measured or modeled in simulation. Saffran (1996) assume transitional probability, Brent (1999a) claims mutual information (MI) more appropriate. Both locally, relative neighboring segment-pairs. This paper replicates Brent's mutual-information on corpus childdirected speech Modern Greek, and introduces variant using global threshold. finding regarding superiority MI confirmed; performance local comparisons thresholds depends evaluation metric.

参考文章(18)

Anja Belz, An Approach to the Automatic Acquisition of Phonotactic Constraints meeting of the association for computational linguistics. ,(1998)

Matthew Harold Davis, Lexical Segmentation in Spoken Word Recognition ,(2000)

Carl G. De Marcken, Robert C. Berwick, Unsupervised language acquisition PhDT. ,(1996)

Irene Philippaki-Warburton, David Holton, Peter A. Mackridge, Greek: A Comprehensive Grammar of the Modern Language ,(1997)

Mehryar Mohri, Fernando Pereira, Michael Riley, A rational design for a weighted finite-state transducer library Lecture Notes in Computer Science. pp. 144- 158 ,(1998) , 10.1007/BFB0031388

Eleanor Olds Batchelder, Bootstrapping the lexicon: A computational model of infant speech segmentation Cognition. ,vol. 83, pp. 167- 206 ,(2002) , 10.1016/S0010-0277(02)00002-1

J. R. Saffran, R. N. Aslin, E. L. Newport, Statistical Learning by 8-Month-Old Infants Science. ,vol. 274, pp. 1926- 1928 ,(1996) , 10.1126/SCIENCE.274.5294.1926

Brian MacWhinney, The Childes Project: Tools for Analyzing Talk ,(1991)

C. Anton Rytting, Greek word segmentation using minimal information Proceedings of the Student Research Workshop at HLT-NAACL 2004 on XX - HLT-NAACL '04. pp. 43- 48 ,(2004) , 10.3115/1614038.1614046

10.

Michael R. Brent, An Efficient, Probabilistically Sound Algorithm for Segmentation andWord Discovery Machine Learning. ,vol. 34, pp. 71- 105 ,(1999) , 10.1023/A:1007541817488

Segment predictability as a cue in word segmentation

来源期刊

我的账户

Segment predictability as a cue in word segmentation

来源期刊

相似文章 2

Adding Generalization to Statistical Learning: The Induction of Phonotactics from Continuous Speech.

Applying Collocation Segmentation to the ACL Anthology Reference Corpus

我的账户