Ontology Learning Based on Word Embeddings for Text Big Data Extraction

作者: Nesma Mahmoud , Heba Elbeh , Hatem M. Abdlkader

DOI: 10.1109/ICENCO.2018.8636154

关键词:

摘要: Big Data term describes data that exists everywhere in humongous volumes, raw forms, and heterogenous types. Unstructured uncategorized forms 95% of big data. Text lacks to efficiently extract domain-relevant a suitable time. Thus, text stills barrier for integration subsequently analytics. Because can’t consider its process preparing On the other side, ontology represents information knowledge graph schema provides shareable, reusing domain-specific fits needs extracting domain relevant So, this paper proposes an learning (OL) methodology extraction. OL aims algorithms, techniques, tools automatic construction from text. The proposed method exploits deep approach i.e., word embeddings, advanced hierarchical clustering BIRCH. utilization embeddings improve quality extraction reduce processing Also, unsupervisory learns massive amount unlabeled This great benefit solves analytical challenge In evaluation, precision, recall, f – value work running time performance are measured. is evaluated by comparing results with gold standard datasets results. Experimental evaluation demonstrate

参考文章(18)
Cheikh Kacfah Emani, Nadine Cullot, Christophe Nicolle, Understandable Big Data Computer Science Review. ,vol. 17, pp. 70- 81 ,(2015) , 10.1016/J.COSREV.2015.05.002
Qian Zhang, Bin Wu, Juan Yang, Parallelization of ontology construction and fusion based on MapReduce international conference on cloud computing. pp. 439- 443 ,(2014) , 10.1109/CCIS.2014.7175775
H. V. Jagadish, Johannes Gehrke, Alexandros Labrinidis, Yannis Papakonstantinou, Jignesh M. Patel, Raghu Ramakrishnan, Cyrus Shahabi, Big data and its technical challenges Communications of The ACM. ,vol. 57, pp. 86- 94 ,(2014) , 10.1145/2611567
Tian Zhang, Raghu Ramakrishnan, Miron Livny, BIRCH: an efficient data clustering method for very large databases international conference on management of data. ,vol. 25, pp. 103- 114 ,(1996) , 10.1145/233269.233324
Elias Zavitsanos, George Paliouras, George A. Vouros, Gold Standard Evaluation of Ontology Learning Methods through Ontology Transformation and Alignment IEEE Transactions on Knowledge and Data Engineering. ,vol. 23, pp. 1635- 1648 ,(2011) , 10.1109/TKDE.2010.195
Maryam M Najafabadi, Flavio Villanustre, Taghi M Khoshgoftaar, Naeem Seliya, Randall Wald, Edin Muharemagic, Deep learning applications and challenges in big data analytics Journal of Big Data. ,vol. 2, pp. 1- 21 ,(2015) , 10.1186/S40537-014-0007-7
A. Maedche, S. Staab, Ontology Learning for the Semantic Web ,(2002)
Tian Zhang, Raghu Ramakrishnan, Miron Livny, BIRCH: A New Data Clustering Algorithm and Its Applications Data Mining and Knowledge Discovery. ,vol. 1, pp. 141- 182 ,(1997) , 10.1023/A:1009783824328
Ilya Sutskever, Tomas Mikolov, Greg S Corrado, Kai Chen, Jeff Dean, Distributed Representations of Words and Phrases and their Compositionality neural information processing systems. ,vol. 26, pp. 3111- 3119 ,(2013)
Klaas Dellschaft, Steffen Staab, On how to perform a gold standard based evaluation of ontology learning international semantic web conference. pp. 228- 241 ,(2006) , 10.1007/11926078_17