Web-a-where

作者: Einat Amitay , Nadav Har'El , Ron Sivan , Aya Soffer

DOI: 10.1145/1008992.1009040

关键词:

摘要: We describe Web-a-Where, a system for associating geography with Web pages. Web-a-Where locates mentions of places and determines the place each name refers to. In addition, it assigns to page geographic focus --- locality that discusses as whole. The tagging process is simple fast, aimed be applied large collections pages facilitate variety location-based applications data analyses.Geotagging involves arbitrating two types ambiguities: geo/non-geo geo/geo. A ambiguity occurs when also has non-geographic meaning, such person (e.g., Berlin) or common word (Turkey). Geo/geo arises distinct have same name, in London, England vs. Ontario.An implementation tagger within framework WebFountain mining described, evaluated on several corpora real Precision up 82% individual geotags achieved. evaluate relative contribution various heuristics employs, focus-finding algorithm using corpus pretagged localities, showing many 91% foci reported are correct country level.

参考文章(18)
Per Lidén, Lars Asker, Kristofer Franzén, Fredrik Olsson, Gunnar Eriksson, Exploiting Syntax when Detecting Protein Names in Text EFMI Workshop on Natural Language Processing in Biomedical Applications, March 8-9, 2002, Nicosia, Cyprus. ,(2002)
Narayanan Shivakumar, Luis Gravano, Junyan Ding, Computing Geographical Scopes of Web Resources very large data bases. pp. 545- 556 ,(2000) , 10.7916/D8PV6XJG
Nina Wacholder, Yael Ravin, T. J. Watson, Extracting Names from Natural-Language Text ,(2000)
David A. Smith, Gregory Crane, Disambiguating Geographic Names in a Historical Digital Library european conference on research and advanced technology for digital libraries. pp. 127- 136 ,(2001) , 10.1007/3-540-44796-2_12
Erik Rauch, Michael Bukatin, Kenneth Baker, A confidence-based framework for disambiguating geographic terms north american chapter of the association for computational linguistics. pp. 50- 54 ,(2003) , 10.3115/1119394.1119402
Jon Patrick, Casey Whitelaw, Robert Munro, SLINERC: the Sydney Language-Independent Named Entity Recogniser and Classifier international conference on computational linguistics. pp. 1- 4 ,(2002) , 10.3115/1118853.1118875
Jochen L. Leidner, Gail Sinclair, Bonnie Webber, Grounding spatial named entities for information extraction and question answering north american chapter of the association for computational linguistics. pp. 31- 38 ,(2003) , 10.3115/1119394.1119399
Huifeng Li, Rohini K. Srihari, Cheng Niu, Wei Li, Location normalization for information extraction Proceedings of the 19th international conference on Computational linguistics -. pp. 1- 7 ,(2002) , 10.3115/1072228.1072355
Silviu Cucerzan, David Yarowsky, Language independent NER using a unified model of internal and contextual evidence international conference on computational linguistics. pp. 1- 4 ,(2002) , 10.3115/1118853.1118860
John D. Burger, John C. Henderson, William T. Morgan, Statistical named entity recognizer adaptation international conference on computational linguistics. pp. 1- 4 ,(2002) , 10.3115/1118853.1118856