Learning and Evaluation in the Presence of Class Hierarchies: Application to Text Categorization

作者: Svetlana Kiritchenko , Stan Matwin , Richard Nock , A. Fazel Famili

DOI: 10.1007/11766247_34

关键词:

摘要: This paper deals with categorization tasks where categories are partially ordered to form a hierarchy. First, it introduces the notion of consistent classification which takes into account semantics class Then, presents novel global hierarchical approach that produces classification. algorithm AdaBoost as underlying learning procedure significantly outperforms corresponding “flat” approach, i.e. does not take information. In addition, proposed surpasses local top-down on many synthetic and real tasks. For evaluation purposes, we use measure has some attractive properties: is simple, requires no parameter tuning, gives credit correct discriminates errors by both distance depth in

参考文章(10)
Andrew McCallum, Ronald Rosenfeld, Thomas Mitchell, Andrew Y Ng, None, Improving Text Classification by Shrinkage in a Hierarchy of Classes international conference on machine learning. pp. 359- 367 ,(1998)
Yu He, Ke Wang, Senqiang Zhou, Hierarchical Classification of Real Life Documents. siam international conference on data mining. pp. 1- 16 ,(2001)
Mehran Sahami, Daphne Koller, Hierarchically Classifying Documents Using Very Few Words international conference on machine learning. pp. 170- 178 ,(1997)
Miguel E. Ruiz, Padmini Srinivasan, Hierarchical Text Categorization Using Neural Networks Information Retrieval. ,vol. 5, pp. 87- 118 ,(2002) , 10.1023/A:1012782908347
Robert E. Schapire, Yoram Singer, Improved boosting algorithms using confidence-rated predictions conference on learning theory. ,vol. 37, pp. 80- 91 ,(1998) , 10.1145/279943.279960
Susan Dumais, Hao Chen, Hierarchical classification of Web content international acm sigir conference on research and development in information retrieval. ,vol. 34, pp. 256- 263 ,(2000) , 10.1145/345508.345593
Jin Huang, C.X. Ling, Using AUC and accuracy in evaluating learning algorithms IEEE Transactions on Knowledge and Data Engineering. ,vol. 17, pp. 299- 310 ,(2005) , 10.1109/TKDE.2005.50
Aixin Sun, Ee-Peng Lim, Hierarchical text classification and evaluation international conference on data mining. pp. 521- 528 ,(2001) , 10.1109/ICDM.2001.989560
Ioannis Tsochantaridis, Thomas Hofmann, Thorsten Joachims, Yasemin Altun, Support vector machine learning for interdependent and structured output spaces international conference on machine learning. pp. 104- ,(2004) , 10.1145/1015330.1015341
Hendrik Blockeel, Maurice Bruynooghe, Sašo Džeroski, Jan Ramon, Jan Struyf, None, Hierarchical multi-classification knowledge discovery and data mining. pp. 21- 35 ,(2002)