On methods and tools of table detection, extraction and annotation in PDF documents

作者: Shah Khusro , Asima Latif , Irfan Ullah

DOI: 10.1177/0165551514551903

关键词: Table (database)Information retrievalComputer scienceState (computer science)AnnotationImportant researchInformation extraction

摘要: Table detection, extraction and annotation have been an important research problem for years. To handle this issue, different approaches have been designed for different types of …

参考文章(118)
Micheline Hancock-Beaulieu, Stephen E. Robertson, Steve Walker, Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive. text retrieval conference. pp. 199- 210 ,(1998)
Prasenjit Mitra, C. Lee Giles, Kun Bai, Ying Liu, Tablerank: a ranking algorithm for table search and retrieval national conference on artificial intelligence. pp. 317- 322 ,(2007)
Jingjing Wang, Haixun Wang, Zhongyuan Wang, Kenny Q. Zhu, Understanding tables on the web international conference on conceptual modeling. pp. 141- 155 ,(2012) , 10.1007/978-3-642-34002-4_11
Varish Mulwad, Tim Finin, Zareen Syed, Anupam Joshi, Exploiting a Web of Semantic Data for Interpreting Tables web science. ,(2010)
Jonathan J Hull, Suzanne L Taylor, Document Analysis Systems II World Scientific. ,(1998) , 10.1142/3446
Silvia Miksch, Burcu Yildiz, Katharina Kaiser, pdf2table: A Method to Extract Table Information from PDF Files. indian international conference on artificial intelligence. pp. 1773- 1785 ,(2005)