Using Statistics in Lexical Analysis

作者： K.W. Church

DOI:

关键词: Linguistics 、 sort 、 Salient 、 Lexical analysis 、 Introspection 、 Key Word in Context 、 Lexical functional grammar 、 Computer science 、 Syntax 、 Lexical density

摘要: The computational tools available for studying machine-readable corpora are at present still rather primitive. In the more advanced lexicographic organizations, there concordancing programs (see figure below), which basically KWIC (key word in context (Aho et al., 1988, p. 122), (Salton, 1989, 384)) indexes with additional features such as ability to extend context, sort leftwards well rightwards, and so on. There is very little interactive software. lack of software perhaps part reason why dictionaries produced United States pay attention corpora, based on collections selected citations, augmented by introspection, than analysis whole texts. situation somewhat different Britain. British lexicographers, especially those working foreign learners, beginning depend heavily corpora. They use these basic tool mentioned above fill detailed syntactic descriptions (prompting a move, that will probably dominate lexicography 1990s, towards thorough lexical syntax). Cobuild project 1980s, example, typical procedure was lexicographer given concordances or group words, marked up printout colored pens order identify salient senses, then wrote definitions.

nii.ac.jp 本地加速

uni-leipzig.de 本地加速

hjkm.dk PDF 下载加速

参考文章(20)

Jong-Nae Wang, Jing-Shin Chang, Keh-Yih Su, Mei-Hui Su, A Sequential Truncation Parsing Algorithm Based on the Score Function international workshop/conference on parsing technologies. pp. 95- 104 ,(1989)

Stephanie Seneff, Probabilistic Parsing for Spoken Language Applications international workshop/conference on parsing technologies. pp. 209- 218 ,(1989)

Leonore Crary Hauck, Stuart Berg Flexner, The Random House dictionary of the English language Random House. ,(1968)

Zellig Sabbettai Harris, Mathematical structures of language ,(1968)

Philip Babcock Gove, Webster's third new international dictionary of the English language, unabridged : utilizing all the experience and resources of more than one hundred years of Merriam-Webster dictionaries G. & C. Merriam. ,(1961)

Alinda NELSON, Collins COBUILD English Language Dictionary ,(1987)

Gerald Salton, Automatic text processing ,(1988)

Steven J. DeRose, Grammatical category disambiguation by statistical optimization Computational Linguistics. ,vol. 14, pp. 31- 39 ,(1988) , 10.5555/49084.49087

Patrick Hanks, Kenneth Ward Church, Word association norms, mutual information, and lexicography Computational Linguistics. ,vol. 16, pp. 22- 29 ,(1990) , 10.5555/89086.89095

10.

Robert Burchfield, Frequency Analysis of English Usage: Lexicon and Grammar. By W. Nelson Francis and Henry Kučera with the assistance of Andrew W. Mackie. Boston: Houghton Mifflin. 1982. x + 561: Journal of English Linguistics. ,vol. 18, pp. 64- 70 ,(1985) , 10.1177/007542428501800107

Using Statistics in Lexical Analysis

来源期刊

我的账户

Using Statistics in Lexical Analysis

来源期刊

相似文章 10

我的账户