作者: Einat Amitay , Nadav Har'El , Ron Sivan , Aya Soffer
关键词:
摘要: We describe Web-a-Where, a system for associating geography with Web pages. Web-a-Where locates mentions of places and determines the place each name refers to. In addition, it assigns to page geographic focus --- locality that discusses as whole. The tagging process is simple fast, aimed be applied large collections pages facilitate variety location-based applications data analyses.Geotagging involves arbitrating two types ambiguities: geo/non-geo geo/geo. A ambiguity occurs when also has non-geographic meaning, such person (e.g., Berlin) or common word (Turkey). Geo/geo arises distinct have same name, in London, England vs. Ontario.An implementation tagger within framework WebFountain mining described, evaluated on several corpora real Precision up 82% individual geotags achieved. evaluate relative contribution various heuristics employs, focus-finding algorithm using corpus pretagged localities, showing many 91% foci reported are correct country level.