Learning to Refine Expansion Terms for Biomedical Information Retrieval Using Semantic Resources

作者: Bo Xu , Hongfei Lin , Yuan Lin

DOI: 10.1109/TCBB.2018.2801303

关键词:

摘要: With the rapid development of biomedicine, number biomedical articles has increased accordingly, which presents a great challenge for biologists trying to keep up with latest research. Information retrieval seeks meet this by searching among large based on given queries and providing most relevant ones fulfill information needs. As an effective technique, query expansion some room improvement achieve desired performance when directly applied because there exist many domain-related terms both in users' related articles. To solve problem, we propose framework learning-to-rank methods, refine candidate training term-ranking models select terms. train models, first pseudo-relevance feedback method MeSH then represent as feature vectors defining corpus-based term features resource-based features. Experimental results obtained TREC genomics datasets show that our can capture more expand original effectively improve performance.

参考文章(38)
Mike Gatford, Micheline Hancock-Beaulieu, Susan Jones, Stephen E. Robertson, Steve Walker, Okapi at TREC text retrieval conference. pp. 109- 123 ,(1994)
Alan R. Aronson, Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program american medical informatics association annual symposium. pp. 17- 21 ,(2001)
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Matthew Lease, James Allan, W. Bruce Croft, Regression Rank: Learning to Meet the Opportunity of Descriptive Queries Lecture Notes in Computer Science. pp. 90- 101 ,(2009) , 10.1007/978-3-642-00958-7_11
Chengxiang Zhai, John Lafferty, Model-based feedback in the language modeling approach to information retrieval Proceedings of the tenth international conference on Information and knowledge management - CIKM'01. pp. 403- 410 ,(2001) , 10.1145/502585.502654
Zongcheng Ji, Bin Wang, Learning to rank for question routing in community question answering conference on information and knowledge management. pp. 2363- 2368 ,(2013) , 10.1145/2505515.2505670
Dongqing Zhu, Stephen Wu, Ben Carterette, Hongfang Liu, Using large clinical corpora for query expansion in text-based cohort identification Journal of Biomedical Informatics. ,vol. 49, pp. 275- 281 ,(2014) , 10.1016/J.JBI.2014.03.010
Yuan Lin, Hongfei Lin, Song Jin, Zheng Ye, Social annotation in query expansion: a machine learning approach international acm sigir conference on research and development in information retrieval. pp. 405- 414 ,(2011) , 10.1145/2009916.2009972