Discovering genres of online discussion threads via text mining

作者: Fu-Ren Lin , Lu-Shih Hsieh , Fu-Tai Chuang

DOI: 10.1016/J.COMPEDU.2008.10.005

关键词:

摘要: As course management systems (CMS) gain popularity in facilitating teaching. A forum is a key component to facilitate the interactions among students and teachers. Content analysis most popular way study discussion forum. But content human labor intensity process; for example, coding process relies heavily on manual interpretation; it time energy consuming. In an asynchronous virtual learning environment, instructor needs keep monitoring from order maintain quality of However, consuming difficult instructors fulfill this need especially K12 This research proposes genre classification system, called GCS, automatic process. We treat as document task via modern data mining techniques. The posting can be perceived announcement, question, clarification, interpretation, conflict, assertion, etc. examines coherence between GCS experts' judgment terms recall precision, discusses how we adjust parameters improve coherence. Based empirical results, adopts cascade model achieve evaluation classified genres repository postings online earth science senior high school shows that effectively process, proposed deal with imbalanced distribution nature postings. These results imply based perform system.

参考文章(26)
Nathalie Japkowicz, Learning from Imbalanced Data Sets: A Comparison of Various Strategies * International Workshop on Learning from Imbalanced Data Sets. ,(2000)
Erping Zhu, Meaning Negotiation, Knowledge Construction, and Mentoring in a Distance Learning Course Selected Research and Development Presentations at the 1996 National Convention of the Association for Educational Communications and Technology. ,vol. 1996, ,(1996)
Sarah Schrire, A Model for Evaluating the Process of Learning in Asynchronous Computer Conferencing. The Journal of Instruction Delivery Systems. ,vol. 17, pp. 6- 12 ,(2003)
Judith Pena-Shaff, Wendy Martin, An Epistemological Framework for Analyzing Student Interactions in Computer-Mediated Communication Environments The Journal of Interactive Learning Research. ,vol. 12, pp. 41- 68 ,(2001)
Sam Scott, Stan Matwin, Feature Engineering for Text Classification international conference on machine learning. pp. 379- 388 ,(1999)
Noriko Hara, Curtis Jay Bonk, Charoula Angeli, Content analysis of online discussion in an applied educational psychology course Instructional Science. ,vol. 28, pp. 115- 152 ,(2000) , 10.1023/A:1003764722829
Stephan Bloehdorn, Andreas Hotho, Boosting for text classification with semantic features web mining and web usage analysis. pp. 149- 166 ,(2004) , 10.1007/11899402_10
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
Fuchun Peng, Xiangji Huang, Dale Schuurmans, Shaojun Wang, Text Classification in Asian Languages without Word Segmentation Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages. ,vol. 11, pp. 41- 48 ,(2003) , 10.3115/1118935.1118941