Thick 2D relations for document understanding

作者: Marco Aiello , Arnold M.W. Smeulders

DOI: 10.1016/J.INS.2003.05.015

关键词:

摘要: We use a propositional language of qualitative rectangle relations to detect the reading order from document images. To this end, we define notion encoding rule and analyze possible formalisms express rules such as LaTeX SGML. Document expressed in rectangles are used build detector for In achieve robusmess avoid brittleness when applying system real life images, thick boundary interpretation relation is introduced. The framework tested on collection heterogeneous images showing recall rates up 89%.

参考文章(33)
Zhan Cui, Anthony G. Cohn, David A. Randell, A Spatial Logic based on Regions and Connection. principles of knowledge representation and reasoning. pp. 165- 176 ,(1992)
Stefan Klink, Thomas Kieninger, Andreas Dengel, Document Structure Analysis Based on Layout and Textual Features ,(2000)
Philippe Balbiani, Jean-François Condotta, Luis Fariñas del Cerro, A Model for Reasoning about Bidemsional Temporal Relations. principles of knowledge representation and reasoning. pp. 124- 130 ,(1998)
Frank Mittelbach, Alexander Samarin, Michel Goossens, The LaTeX companion ,(1993)
J. F. A. K. Van Benthem, The Logic of Time Springer Netherlands. ,(1983) , 10.1007/978-94-010-9868-7
Floriana Esposito, Donato Malerba, Francesca A. Lisi, Machine Learning for Intelligent Processing of Printed Documents intelligent information systems. ,vol. 14, pp. 175- 198 ,(2000) , 10.1023/A:1008735902918
Leon Todoran, Marco Aiello, Christof Monz, Marcel Worring, Logical structure detection for heterogeneous document classes document recognition and retrieval. ,vol. 4307, pp. 99- 110 ,(2000) , 10.1117/12.410827
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
H. Walischewski, Automatic knowledge acquisition for spatial document interpretation international conference on document analysis and recognition. ,vol. 1, pp. 243- 247 ,(1997) , 10.1109/ICDAR.1997.619849