Difficult and urgent open problems in document image analysis for libraries

作者: H.S. Baird

DOI: 10.1109/DIAL.2004.1263234

关键词:

摘要: Open problems in document image analysis research and development relevant to digital libraries (DLs) are briefly described their difficulty urgency estimated. They grouped according the stage of processing, construction DLs, where they first apply: capture, early content extraction recognition, structure analysis, retrieval, & presentation; personal interactive DLs.

参考文章(18)
Thomas A. Nartker, Frank R. Jenkins, Stephen V. Rice, The Fourth Annual Test of OCR Accuracy Information Science Research Institute Technical Report. ,(1995)
Elisa Barney Smith, Xiaohui Qiu, Relating Statistical Image Differences and Degradation Features document analysis systems. pp. 1- 12 ,(2002) , 10.1007/3-540-45869-7_1
Abigail J. Sellen, Richard H.R. Harper, The Myth of the Paperless Office ,(2001)
Suzanne Thorin, Daniel Greenstein, The digital library: A biography ,(2002)
K. Popat, Decoding of text lines in grayscale document images international conference on acoustics, speech, and signal processing. ,vol. 3, pp. 1513- 1516 ,(2001) , 10.1109/ICASSP.2001.941219
Monica Chew, Henry S. Baird, BaffleText: a Human Interactive Proof document recognition and retrieval. ,vol. 5010, pp. 305- 316 ,(2003) , 10.1117/12.479682
Francine R Chen, Dan S Bloomberg, Summarization of Imaged Documents without OCR Computer Vision and Image Understanding. ,vol. 70, pp. 307- 320 ,(1998) , 10.1006/CVIU.1998.0688
Kristen M. Summers, Document image improvement for OCR as a classification problem Document Recognition and Retrieval X. ,vol. 5010, pp. 73- 83 ,(2003) , 10.1117/12.476023