摘要: The increasing use of large open-domain document sources is exacerbating the problem ambiguity in named entities. This paper explores a range syntactic and semantic features unsupervised clustering documents that result from ad hoc queries containing names. From these experiments, we find robust can significantly improve state art for disambiguation performance personal names both Chinese English.