Automatic Text Categorization of Mathematical Word Problems

作者: Suleyman Cetintas , Luo Si , Joo Young Park , Yan Ping Xin , Dake Zhang

DOI:

关键词: Discriminative modelComputer scienceWord problem (mathematics education)Natural language processingArtificial intelligenceProbabilistic logicPart of speechCategorizationPreprocessorSpeech recognitionSupport vector machineText processing

摘要: This paper describes a novel application of text categorization for mathematical word problems , namely Multiplicative Compare and Equal Group problems. The empirical results analysis show that common processing techniques such as stopword removal stemming should be selectively used. It is highly beneficial not to remove stopwords do stemming. Part speech tagging also used distinguish words in discriminative parts from the non-discriminative which only fail help but even mislead decision An SVM classifier with these outperforms an default setting (i.e. stemming). Furthermore, probabilistic meta proposed combine weighted two classifiers different problem representations generated by preprocessing techniques. further improves accuracy.

参考文章(17)
Erik de Corte, G. B. Greer, Lieven Verschaffel, Making sense of word problems ,(2000)
Sam Scott, Stan Matwin, Feature Engineering for Text Classification international conference on machine learning. pp. 379- 388 ,(1999)
Thorsten Joachims, Making large-scale support vector machine learning practical Advances in kernel methods. pp. 169- 184 ,(1999)
Belur V. Dasarathy, Nearest neighbor (NN) norms: NN pattern classification techniques Los Alamitos: IEEE Computer Society Press. ,(1991)
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
James P Callan, W.Bruce Croft, John Broglio, TREC and TIPSTER experiments with INQUERY text retrieval conference. ,vol. 31, pp. 327- 343 ,(1995) , 10.1016/0306-4573(94)00050-D
William B. Frakes, Ricardo Baeza-Yates, Information Retrieval: Data Structures and Algorithms ,(1992)
Rwey-Lin Shiah, Margo A. Mastropieri, Thomas E. Scruggs, Barbara J. Mushinski Fulk, The Effects of Computer-Assisted Instruction on the Mathematical Problem Solving of Students With Learning Disabilities Exceptionality. ,vol. 5, pp. 131- 161 ,(1994) , 10.1207/S15327035EX0503_2
Yiming Yang, Xin Liu, A re-examination of text categorization methods international acm sigir conference on research and development in information retrieval. pp. 42- 49 ,(1999) , 10.1145/312624.312647