Dialect Topic Modeling for Improved Consumer Medical Search

作者: Hongyuan Zha , Steven P Crain , Shuang-Hong Yang , Yu Jiao

DOI:

关键词:

摘要: Access to health information by consumers is hampered a fundamental language gap. Current attempts close the gap leverage consumer oriented information, which does not, however, have good coverage of slang medical terminology. In this paper, we present Bayesian model automatically align documents with different dialects (slang, common and technical) while extracting their semantic topics. The proposed diaTM enables effective retrieval, even when query contains words, explicitly modeling mixtures in joint influence topics on word selection. Simulations using questions retrieve from corpus show that achieves 25% improvement retrieval relevance nDCG@5 over an LDA baseline.

参考文章(5)
Qing Zeng, Sandra Kogan, Nachman Ash, Robert A Greenes, Aziz A Boxwala, None, Characteristics of consumer terminology for health information retrieval. Methods of Information in Medicine. ,vol. 41, pp. 289- 298 ,(2002) , 10.1055/S-0038-1634490
Alla Keselman, Tony Tse, Jon Crowell, Allen Browne, Long Ngo, Qing Zeng, Assessing Consumer Health Vocabulary Familiarity: An Exploratory Study Journal of Medical Internet Research. ,vol. 9, ,(2007) , 10.2196/JMIR.9.1.E5
Qing T. Zeng, Jonathan Crowell, Robert M. Plovnick, Eunjung Kim, Long Ngo, Emily Dibble, Assisting Consumer Health Information Retrieval with Query Recommendations Journal of the American Medical Informatics Association. ,vol. 13, pp. 80- 90 ,(2006) , 10.1197/JAMIA.M1820
Aysu Betin Can, Nazife Baykal, MedicoPort: A medical search engine for all Computer Methods and Programs in Biomedicine. ,vol. 86, pp. 73- 86 ,(2007) , 10.1016/J.CMPB.2007.01.007
David R. Kaufman, Connie V. Chan, Lisa A. Matthews, A taxonomy characterizing complexity of consumer eHealth Literacy. american medical informatics association annual symposium. ,vol. 2009, pp. 86- 90 ,(2009)