Classifying natural-language spatial relation terms with random forest algorithm

作者: Shihong Du , Xiaonan Wang , Chen-Chieh Feng , Xiuyuan Zhang

DOI: 10.1080/13658816.2016.1212356

关键词:

摘要: The exponential growth of natural language text data in social media has contributed a rich source for geographic information. However, incorporating such GIS analysis faces tremendous challenges as existing tend to be geometry based while rely on spatial relation NLSR terms. To alleviate this problem, one critical step is translate geometric configurations into terms, but methods date e.g. mean value or decision tree algorithm are insufficient obtain precise translation. This study addresses issue by adopting the random forest RF automatically learn robust mapping model from large number samples and evaluate importance each variable term. Because semantic similarity collected terms reduces classification accuracy, different grouping schemes used, with their influences results being evaluated. experiment demonstrate that learned can accurately transform recognizing groups require sets variables. More importantly, evaluation indicate topology types determined 9-intersection weaker than metric variables defining which contrasts assertion ‘topology matters, refines’ studies.

参考文章(28)
Max J. Egenhofer, A. Rashid B. M. Shariff, David M. Mark, Natural-Language Spatial Relations Between Linear and Areal Objects: The Topology and Metric of English- Language Terms * International Journal of Geographical Information Science. ,vol. 12, pp. 215- 245 ,(1998)
Pascal Matsakis, Dennis Nikitenko, Combined Extraction of Directional and Topological Relationship Information from 2D Concave Objects Fuzzy Modeling with Spatial Information for Geographic Problems. pp. 15- 40 ,(2005) , 10.1007/3-540-26886-3_2
Anthony G. Cohn, Brandon Bennett, John Gooday, Nicholas Mark Gotts, Qualitative Spatial Representation and Reasoning with the Region Connection Calculus Geoinformatica. ,vol. 1, pp. 275- 316 ,(1997) , 10.1023/A:1009712514511
Marjorie Skubic, Pascal Matsakis, George Chronis, James Keller, Generating Multi-Level Linguistic Spatial Descriptions from Range Sensor Readings Using the Histogram of Forces Autonomous Robots. ,vol. 14, pp. 51- 69 ,(2003) , 10.1023/A:1020927503616
Vipin Kumar, Pang-Ning Tan, Michael M. Steinbach, Introduction to Data Mining ,(2013)
Leo Breiman, Randomizing Outputs to Increase Prediction Accuracy Machine Learning. ,vol. 40, pp. 229- 242 ,(2000) , 10.1023/A:1007682208299
Alia I. Abdelmoty, Philip D. Smart, Baher A. El-Geresy, Christopher B. Jones, Supporting Frameworks for the Geospatial Semantic Web symposium on large spatial databases. pp. 355- 372 ,(2009) , 10.1007/978-3-642-02982-0_23
Max J. Egenhofer, Reasoning about Binary Topological Relations SSD '91 Proceedings of the Second International Symposium on Advances in Spatial Databases. pp. 143- 160 ,(1991) , 10.1007/3-540-54414-3_36
Philip D. Smart, Alia I. Abdelmoty, Baher A. El-Geresy, Christopher B. Jones, A framework for combining rules and geo-ontologies web reasoning and rule systems. pp. 133- 147 ,(2007) , 10.1007/978-3-540-72982-2_10
Ross S. Purves, Paul Clough, Christopher B. Jones, Avi Arampatzis, Benedicte Bucher, David Finch, Gaihua Fu, Hideo Joho, Awase Khirni Syed, Subodh Vaid, Bisheng Yang, The design and implementation of SPIRIT: a spatially aware search engine for information retrieval on the Internet International Journal of Geographical Information Science. ,vol. 21, pp. 717- 745 ,(2007) , 10.1080/13658810601169840