Computerizing a machine readable dictionary

作者: G. Jan Wilms

DOI: 10.1145/98949.99149

关键词:

摘要: Current research in natural language processing is characterized by the development of theories grammar which strongly depend on lexicon to drive parsing systems (e.g. Lexical Function Grammar, General Phrase Structured Functional Unification Grammar). These requirements go far beyond typical small, hand-coded vocabularies developed for theoretical or demonstration purposes. Many researchers have independently discovered rich, though unstructured, knowledge sources that machine readable dictionaries offer. This paper reports an attempt impose structure Funk and Wagnalls Dictionary, means a parser written Turbo Pascal, using mixed approach pattern matching transition networks. The resulting computerized dictionary 95 % accurate, but correcting final 5% incorrectly parsed involves painstakingly scrutinizing output modifying handle exceptional cases occur only once twice entire MRD, editing remove errors introduced OCR process.

参考文章(11)
Sidney I. Landau, Funk & Wagnalls Standard desk dictionary Funk & Wagnalls. ,(1976)
Ted Briscoe, Bran Boguraev, Large lexicons for natural language processing: utilising the grammar coding system of LDOCE Computational Linguistics. ,vol. 13, pp. 203- 218 ,(1987)
Jean-Louis Binot, Karen Jensen, Disambiguating prepositional phrase attachments by using on-line dictionary definitions Computational Linguistics. ,vol. 13, pp. 251- 260 ,(1987)
Theresa A. Waldspurger, Bran Boguraev, Ted Briscoe, Computational lexicography for natural language processing Language. ,vol. 66, pp. 626- ,(1989) , 10.2307/414634
Karen Jensen, Jean-Louis Binot, DICTIONARY TEXT ENTRIES AS A SOURCE OF KNOWLEDGE FOR SYNTACTIC AND OTHER DISAMBIGUATIONS conference on applied natural language processing. pp. 152- 159 ,(1988) , 10.3115/974235.974262
Kenneth Ward Church, Patrick Hanks, Word association norms, mutual information, and lexicography Proceedings of the 27th annual meeting on Association for Computational Linguistics -. pp. 76- 83 ,(1989) , 10.3115/981623.981633
Robert A. Amsler, A taxonomy for English nouns and verbs Proceedings of the 19th annual meeting on Association for Computational Linguistics -. pp. 133- 138 ,(1981) , 10.3115/981923.981959
Doug Lenat, Mayank Prakash, Mary Shepherd, CYC: Using common sense knowledge to overcome brittleness and knowledge acquistion bottlenecks Ai Magazine. ,vol. 6, pp. 65- 85 ,(1986) , 10.1609/AIMAG.V6I4.510
Edward J. Briscoe, Computational lexicography for natural language Halsted Press. ,(1988)
David Palermo, James Jenkins, Word association norms ,(1964)