Automatic acquisition of subcategorization frames from tagged text

作者: Michael R. Brent , Robert C. Berwick

DOI: 10.3115/112405.112478

关键词: Computer scienceArtificial intelligenceSpeech recognitionSubcategorizationVerbCompleteness (order theory)Natural language processingText corpus

摘要: This paper describes an implemented program that takes a tagged text corpus and generates partial list of the subcategorization frames in which each verb occurs. The completeness output increases monotonically with total occurrences training corpus. False positive rates are one to three percent. Five currently detected we foresee no impediment detecting many more. Ultimately, expect provide large dictionary NLP community train dictionaries for specific corpora.

参考文章(10)
Patrick Hanks, Kenneth Ward Church, Word association norms, mutual information, and lexicography Computational Linguistics. ,vol. 16, pp. 22- 29 ,(1990) , 10.5555/89086.89095
Michael R. Brent, Automatic semantic classification of verbs from their syntactic contexts Proceedings of the fifth conference on European chapter of the Association for Computational Linguistics -. pp. 222- 226 ,(1991) , 10.3115/977180.977219
Eric Brill, David Magerman, Mitchell Marcus, Beatrice Santorini, Deducing linguistic structure from the statistics of large corpora human language technology. pp. 275- 282 ,(1990) , 10.3115/116580.116670
Steven Pinker, Learnability and Cognition Language. ,vol. 68, pp. 402- ,(2013) , 10.7551/MITPRESS/9700.001.0001
Lila Gleitman, The Structural Sources of Verb Meanings Language Acquisition. ,vol. 1, pp. 3- 55 ,(1990) , 10.1207/S15327817LA0101_2
Kenneth Ward Church, A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text conference on applied natural language processing. pp. 136- 143 ,(1988) , 10.3115/974235.974260
Carl G. de Marcken, Parsing the LOB corpus Proceedings of the 28th annual meeting on Association for Computational Linguistics -. pp. 243- 251 ,(1990) , 10.3115/981823.981854
Donald Hindle, Noun classification from predicate-argument structures Proceedings of the 28th annual meeting on Association for Computational Linguistics -. pp. 268- 275 ,(1990) , 10.3115/981823.981857
Frank A. Smadja, Kathleen R. McKeown, Automatically extracting and representing collocations for language generation Proceedings of the 28th annual meeting on Association for Computational Linguistics -. pp. 252- 259 ,(1990) , 10.3115/981823.981855
Arnold M. Zwicky, In a Manner of Speaking Ohio State University. Department of Linguistics. ,(1971)