FASTUS: A System for Extracting Information from Natural-Language Text

作者: Jerry R. Hobbs , Douglas Appelt Sr , John S. Bear , David Israel Sr , W. M. Tyson

DOI: 10.21236/ADA259435

关键词:

摘要: Abstract : FASTUS is a system for extracting information from free text in English, and potentially other languages as well, entry into database, applications. It works essentially cascaded, nondeterministic finite state automaton. There are four steps the operation of FASTUS. In Step (1) sentences scanned certain trigger words to determine whether further processing should be done. (2) noun groups, verb prepositions some particles recognized. The input (3) sequence phrases recognized (2); patterns interest identified corresponding incident structures built up. (4) that derive same merged, these used generating database entries. an order magnitude faster than any comparable system; it can process news report average less eleven seconds. This translates directly fast development time. three half weeks between its first use MUC-4 evaluation May 1992, we were able build up domain knowledge point where was among leaders evaluation.

参考文章(5)
Alan W Black, Finite state machines from feature grammars international workshop/conference on parsing technologies. pp. 277- 285 ,(1989)
Fernando Pereira, Finite-state approximations of grammars human language technology. pp. 20- 25 ,(1990) , 10.3115/116580.116592
Jerry R. Hobbs, Douglas E. Appelt, John Bear, Mabry Tyson, Robust Processing of Real-World Natural-Language Texts conference on applied natural language processing. pp. 186- 192 ,(1992) , 10.3115/974499.974533
Fernando C. N. Pereira, Rebecca N. Wright, FINITE-STATE APPROXIMATION OF PHRASE STRUCTURE GRAMMARS meeting of the association for computational linguistics. pp. 246- 255 ,(1991) , 10.3115/981344.981376