Medical text categorization using SEBLA and Kernel Discriminant Analysis

作者: Muhammad Atif Tahir , Emdad Khan , Adel Al Salem

DOI: 10.1109/WSWAN.2015.7210310

关键词: Text categorizationTree kernelComponent (UML)Natural language processingComputer scienceMachine learningCategorizationField (computer science)Text miningEducational dataArtificial intelligenceKernel Fisher discriminant analysis

摘要: Education data mining (EDM) is an emerging field which take advantage of natural language processing, mining, statistics and machine learning algorithms for different types educational especially related to medical e-learning. Medical text categorization the one major component help students more easily effectively search e-Learning. In this paper, spectral based Kernel Discriminant Analysis has been introduced categorization. We evaluated proposed approach on 10 most frequent categories cardiovascular diseases group from Ohsumed sets. When compared with existing approaches, results have indicated significant increase in performance. order further refine search, Semantic Engine that uses Brain-Like (SEBLA) also paper.

参考文章(16)
A. Seara Vieira, E. L. Iglesias, L. Borrajo, T-HMM: A Novel Biomedical Text Classifier Based on Hidden Markov Models PACBB. pp. 225- 234 ,(2014) , 10.1007/978-3-319-07581-5_27
Adriana Pietramala, Veronica L. Policicchio, Pasquale Rullo, Inderbir Sidhu, A Genetic Algorithm for Text Classification Rule Induction european conference on machine learning. ,vol. 5212, pp. 188- 203 ,(2008) , 10.1007/978-3-540-87481-2_13
Shady Shehata, Fakhri Karray, Mohamed Kamel, Enhancing Text Categorization Using Sentence Semantics advanced data mining and applications. pp. 87- 98 ,(2008) , 10.1007/978-3-540-88192-6_10
Meliha Yetisgen-Yildiz, Wanda Pratt, The effect of feature representation on MEDLINE document classification. american medical informatics association annual symposium. ,vol. 2005, pp. 849- 853 ,(2005)
Gene H. Golub, Charles F. Van Loan, Matrix computations (3rd ed.) Johns Hopkins University Press. ,(1996)
Cristóbal Romero, Sebastián Ventura, Educational Data Mining: A Review of the State of the Art systems man and cybernetics. ,vol. 40, pp. 601- 618 ,(2010) , 10.1109/TSMCC.2010.2053532
Bevan Koopman, Guido Zuccon, Peter Bruza, Laurianne Sitbon, Michael Lawley, An evaluation of corpus-driven measures of medical concept similarity for information retrieval Proceedings of the 21st ACM international conference on Information and knowledge management - CIKM '12. pp. 2439- 2442 ,(2012) , 10.1145/2396761.2398661
G. Baudat, F. Anouar, Generalized Discriminant Analysis Using a Kernel Approach Neural Computation. ,vol. 12, pp. 2385- 2404 ,(2000) , 10.1162/089976600300014980
Rafal Rak, Lukasz Kurgan, Marek Reformat, Multilabel associative classification categorization of MEDLINE aticles into MeSH keywords IEEE Engineering in Medicine and Biology Magazine. ,vol. 26, pp. 47- 55 ,(2007) , 10.1109/MEMB.2007.335581