作者: Allen Yilun Lin , Joshua Ford , Eytan Adar , Brent Hecht
关键词:
摘要: Data visualizations in news articles (e.g., maps, line graphs, bar charts) greatly enrich the content of and result well-established improvements to reader comprehension. However, existing systems that generate data visualiza-tions either require substantial manual effort or are limited very specific types visualizations, thereby re-stricting number can be enhanced. To address this issue, we define a new problem: given ar-ticle, retrieve relevant already exist on web. We show problem is tractable through system, VizByWiki, mines contextually from Wikimedia Commons, central file reposi-tory for Wikipedia. Using novel ground truth dataset, VizByWiki successfully augment as many 48% popular online with visualizations. also demonstrate automatically rank according their usefulness reasonable accuracy (nDCG@5 0.82). facilitate further advances our "news visualization retrieval problem", release dataset make system its source code publicly available.