Automatic identification of causal relations in text and their use for improving precision in information retrieval

作者: Christopher Soo-Guan Khoo

DOI:

关键词: Identification (information)Natural language processingDocument retrievalInformation retrievalWildcardArtificial intelligenceComputational linguisticsThesaurus (information retrieval)Information extractionMatching (statistics)SentenceComputer science

摘要: This study represents one attempt to make use of relations expressed in text improve information retrieval effectiveness. In particular, the investigated whether obtained by matching causal documents with users' queries could be used document results comparison using just term without considering relations. An automatic method for identifying and extracting cause-effect Wall Street Journal was developed. The uses linguistic clues identify recourse knowledge-based inferencing. successful about 68% that were clearly within a sentence or between adjacent sentences text. Of instances computer program identified as relations, 72% can considered correct. The an experimental system database full-text documents. Causal relation found yield small but significant improvement when weights combining scores from different types customized each query--as SDI routing situation. best combined word proximity (matching pairs causally related words query co-occur sentences). An analysis manually indicate bigger improvements expected more accurate identification relations. kind which member (either cause effect) represented wildcard match any term. The also Roget's International Thesaurus (3rd ed.) expand terms synonymous would Using Roget category codes addition keywords did give better results. However, at nonrelevant than relevant ones.

参考文章(141)
Maria Sidiropoulou, John Kontos, On the acquisition of causal knowledge from scientific texts with attribute grammars International Journal of Applied Expert Systems archive. ,vol. 4, pp. 31- 48 ,(1991)
Donna Harman, Overview of the First Text REtrieval Conference (TREC-1). text retrieval conference. pp. 1- 20 ,(1992)
Jim Daniell, Dan Simmons, Mallory Selfridge, Learning Causal Models by Understanding Real-World Natural Language Explanations. CAIA. pp. 378- 383 ,(1985)
David E. Rumelhart, Metaphor and Thought: Some problems with the notion of literal meanings Cambridge University Press. pp. 71- 82 ,(1993) , 10.1017/CBO9781139173865.007
Swanson Dr, Medical literature as a potential source of new knowledge. Bulletin of The Medical Library Association. ,vol. 78, pp. 29- 37 ,(1990)
D K Harman, The first text REtrieval conference (TREC-1) Special Publication (NIST SP) - 500-207. ,(1993) , 10.6028/NIST.SP.500-207
Richard Henry Wojcik, The expression of causation in English clauses Univ. Microfilms, a Xerox. ,(1973)
Xin Lu, On application of case relations to document retrieval An application of case relations to document retrieval. pp. 127- 127 ,(1992)
Shiyali Ramamrita Ranganathan, The colon classification Graduate School of Library Service, Rutgers, the State Univ.. ,(1965)
M. J. Cresswell, Adverbs of Causation Adverbial Modification. pp. 173- 192 ,(1981) , 10.1007/978-94-009-5414-4_7