Structured document classification by matching local salient features

作者: Satoshi Naoi , Yuan He , Siyuan Chen , Jun Sun

DOI:

关键词: Digital image processingInformation retrievalStructured documentImage textureArtificial intelligenceFeature detection (computer vision)Image processingContextual image classificationTemplate matchingFeature extractionStandard test imageDigital imageImage registrationPattern recognitionFeature (computer vision)Automatic image annotationComputer scienceBinary image

摘要: Following the recent trend in using low level image features classifying document images, this paper we present a novel approach for structured classification by matching salient feature points between query and reference images. Our method is robust to diverse training data size, formats qualities. Through points, registration available as well. Although aimed large domain of our already achieved zero error rates tests on benchmark NIST tax form databases.

参考文章(11)
Jian Liang, David Doermann, Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning document analysis systems. pp. 224- 235 ,(2002) , 10.1007/3-540-45869-7_26
Eric Saund, Scientific challenges underlying production document processing Document Recognition and Retrieval XVIII. ,vol. 7874, pp. 787402- ,(2011) , 10.1117/12.876948
Marcal Rusinol, David Aldavert, Ricardo Toledo, Josep Llados, Browsing Heterogeneous Document Collections by a Segmentation-Free Word Spotting Method international conference on document analysis and recognition. pp. 63- 67 ,(2011) , 10.1109/ICDAR.2011.22
Sergey Usilin, Dmitry Nikolaev, Vassili Postnikov, Gerald Schaefer, Visual appearance based document image classification international conference on image processing. pp. 2133- 2136 ,(2010) , 10.1109/ICIP.2010.5652024
Eric Saund, A Graph Lattice Approach to Maintaining and Learning Dense Collections of Subgraphs as Image Features IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 35, pp. 2323- 2339 ,(2013) , 10.1109/TPAMI.2012.267
Andrea Vedaldi, Brian Fulkerson, Vlfeat Proceedings of the international conference on Multimedia - MM '10. pp. 1469- 1472 ,(2010) , 10.1145/1873951.1874249
Christian Shin, David Doermann, Azriel Rosenfeld, Classification of document pages using structure-based features International Journal on Document Analysis and Recognition. ,vol. 3, pp. 232- 247 ,(2001) , 10.1007/PL00013566
Nawei Chen, Dorothea Blostein, A survey of document image classification: problem statement, classifier architecture and performance evaluation International Journal on Document Analysis and Recognition. ,vol. 10, pp. 1- 16 ,(2007) , 10.1007/S10032-006-0020-2
S. Chen, S. Mao, G. Thoma, Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents international conference on document analysis and recognition. ,vol. 1, pp. 118- 122 ,(2007) , 10.1109/ICDAR.2007.4378687
P. Sarkar, Image classification: Classifying distributions of visual features international conference on pattern recognition. ,vol. 2, pp. 472- 475 ,(2006) , 10.1109/ICPR.2006.683