Tree-based models for inductive classification on the Web Of Data

作者: Giuseppe Rizzo , Claudia d’Amato , Nicola Fanizzi , Floriana Esposito

DOI: 10.1016/J.WEBSEM.2017.05.001

关键词:

摘要: Abstract The Web of Data, which is one the dimensions Semantic (SW), represents a tremendous source information, motivates increasing attention to formalization and application machine learning methods for solving tasks such as concept learning, link prediction, inductive instance retrieval in this context. However, Data also characterized by various forms uncertainty, owing its inherent incompleteness (missing uneven data distributions) noise, may affect open distributed architectures. In paper, we focus on task regarded classification problem. proposed solution framework Terminological Decision Trees from examples described an ontological knowledge base, be used performing classifications. For purpose, suitable pruning strategies new prediction procedure are proposed. Furthermore, order tackle class-imbalance distribution problem, extended ensembles called Random Forests . has been evaluated, comparative experiments, with main state art solutions grounded similar approach, showing that: (1) employment formalized can improve model predictiveness; (2) outperform usage single Tree , particularly when base endowed large number concepts roles; (3) exploited related problems, predicting values given properties finite ranges.

参考文章(43)
Giuseppe Rizzo, Claudia d’Amato, Nicola Fanizzi, Floriana Esposito, Assertion Prediction with Ontologies through Evidence Combination URSW (LNCS Vol.). pp. 282- 299 ,(2013) , 10.1007/978-3-642-35975-0_15
Linh Anh Nguyen, Andrzej Szałas, Logic-Based Roughification Rough Sets and Intelligent Systems (1). pp. 517- 543 ,(2013) , 10.1007/978-3-642-30344-9_19
Gary M. Weiss, The Impact of Small Disjuncts on Classifier Learning Annals of Information Systems. pp. 193- 226 ,(2010) , 10.1007/978-1-4419-1280-0_9
I Bratko, T. Niblett, Learning decision rules in noisy domains Proceedings of Expert Systems '86, The 6Th Annual Technical Conference on Research and development in expert systems III. pp. 25- 34 ,(1987)
Achim Rettinger, Uta Lösch, Volker Tresp, Claudia d’Amato, Nicola Fanizzi, Mining the Semantic Web Data Mining and Knowledge Discovery. ,vol. 24, pp. 613- 662 ,(2012) , 10.1007/S10618-012-0253-2
Bin Fu, Zhihai Wang, Rong Pan, Guandong Xu, Peter Dolog, An Integrated Pruning Criterion for Ensemble Learning Based on Classification Accuracy and Diversity Advances in Intelligent Systems and Computing. pp. 47- 58 ,(2013) , 10.1007/978-3-642-30867-3_5
Claudia d’Amato, Nicola Fanizzi, Floriana Esposito, Query Answering and Ontology Population: An Inductive Approach Lecture Notes in Computer Science. pp. 288- 302 ,(2008) , 10.1007/978-3-540-68234-9_23
Claudia d'Amato, Nicola Fanizzi, Floriana Esposito, Inductive learning for the Semantic Web: What does it buy? Social Work. ,vol. 1, pp. 53- 59 ,(2010) , 10.3233/SW-2010-0007
Nicola Fanizzi, Claudia d’Amato, Floriana Esposito, Induction of concepts in web ontologies through terminological decision trees european conference on machine learning. pp. 442- 457 ,(2010) , 10.1007/978-3-642-15880-3_34
Nicola Fanizzi, Claudia d’Amato, Floriana Esposito, ReduCE: A Reduced Coulomb Energy Network Method for Approximate Classification Lecture Notes in Computer Science. pp. 323- 337 ,(2009) , 10.1007/978-3-642-02121-3_26