Detecting high-quality posts in community question answering sites

作者: Yuan Yao , Hanghang Tong , Tao Xie , Leman Akoglu , Feng Xu

DOI: 10.1016/J.INS.2014.12.038

关键词:

摘要: Community question answering (CQA) has become a new paradigm for seeking and sharing information. In CQA sites, users can ask answer questions, provide feedback (e.g., by voting or commenting) to these questions/answers. this article, we propose the early detection of high-quality Such help discover high-impact that would be widely recognized in as well identify useful gain much positive from site users. particular, view post quality perspective outcome. First, our key intuition is score an strongly positively correlated with its question, verify such correlation two real data sets. Second, armed verified correlation, family algorithms jointly detecting questions answers soon after they are posted sites. We conduct extensive experimental evaluations demonstrate effectiveness efficiency approaches. Overall, outperform best competitor prediction performance, while enjoying linear scalability respect total number posts.

参考文章(32)
Tao Xie, Jian Lu, Hanghang Tong, Leman Akoglu, Yuan Yao, Feng Xu, Want a Good Answer? Ask a Good Question First! arXiv: Databases. ,(2013)
Swapna Gottipati, David Lo, Jing Jiang, Finding relevant answers in software forums automated software engineering. pp. 323- 332 ,(2011) , 10.1109/ASE.2011.6100069
Qiaoling Liu, Eugene Agichtein, Gideon Dror, Evgeniy Gabrilovich, Yoelle Maarek, Dan Pelleg, Idan Szpektor, Predicting web searcher satisfaction with existing community-based answers international acm sigir conference on research and development in information retrieval. pp. 415- 424 ,(2011) , 10.1145/2009916.2009974
Baoli Li, Yandong Liu, Eugene Agichtein, CoCQA Proceedings of the Conference on Empirical Methods in Natural Language Processing - EMNLP '08. pp. 937- 946 ,(2008) , 10.3115/1613715.1613836
Eugene Agichtein, Carlos Castillo, Debora Donato, Aristides Gionis, Gilad Mishne, Finding high-quality content in social media web search and data mining. pp. 183- 194 ,(2008) , 10.1145/1341531.1341557
Seyed Mehdi Nasehi, Jonathan Sillito, Frank Maurer, Chris Burns, What makes a good code example?: A study of programming Q&A in StackOverflow international conference on software maintenance. pp. 25- 34 ,(2012) , 10.1109/ICSM.2012.6405249
Anton Barua, Stephen W. Thomas, Ahmed E. Hassan, What are developers talking about? An analysis of topics and trends in Stack Overflow Empirical Software Engineering. ,vol. 19, pp. 619- 654 ,(2014) , 10.1007/S10664-012-9231-Y
Chirag Shah, Jefferey Pomerantz, Evaluating and predicting answer quality in community QA international acm sigir conference on research and development in information retrieval. pp. 411- 418 ,(2010) , 10.1145/1835449.1835518
Ripon K. Saha, Avigit K. Saha, Dewayne E. Perry, Toward understanding the causes of unanswered questions in software information sites: a case study of stack overflow foundations of software engineering. pp. 663- 666 ,(2013) , 10.1145/2491411.2494585
Maggy Anastasia Suryanto, Ee Peng Lim, Aixin Sun, Roger H. L. Chiang, Quality-aware collaborative question answering Proceedings of the Second ACM International Conference on Web Search and Data Mining - WSDM '09. pp. 142- 151 ,(2009) , 10.1145/1498759.1498820