Signature Segmentation from Machine Printed Documents Using Conditional Random Field

作者: Ranju Mandal , Partha Pratim Roy , Umapada Pal

DOI: 10.1109/ICDAR.2011.236

关键词:

摘要: Automatic separation of signatures from a document page involves difficult challenges due to the free-flow nature handwriting, overlapping/touching signature parts with printed text, noise, etc. In this paper, we have proposed novel approach for segmentation machine signed documents. The algorithm first locates block in using word level feature extraction. Next, strokes that touch or overlap texts are separated. A stroke classification is then performed skeleton analysis separate overlapping text signature. Gradient based features and Support Vector Machine (SVM) used our scheme. Finally, Conditional Random Field (CRF) model energy minimization concept on approximated labeling by graph cut applied label as "signature" "printed text" accurate signatures. Signature experiment "tobacco" dataset1 obtained encouraging results.

参考文章(14)
Umapada Pal, Nabin Sharma, Tetsushi Wakabayashi, Fumitaka Kimura, Handwritten Numeral Recognition of Six Popular Indian Scripts international conference on document analysis and recognition. ,vol. 2, pp. 749- 753 ,(2007) , 10.1109/ICDAR.2007.4377015
F. Farooq, K. Sridharan, V. Govindaraju, Identifying Handwritten Text in Mixed Documents international conference on pattern recognition. ,vol. 2, pp. 1142- 1145 ,(2006) , 10.1109/ICPR.2006.676
J.F. Vargas, M.A. Ferrer, C.M. Travieso, J.B. Alonso, Off-line signature verification based on grey level information using texture features Pattern Recognition. ,vol. 44, pp. 375- 385 ,(2011) , 10.1016/J.PATCOG.2010.07.028
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, Ramachandrula Sitaram, Overlapped text segmentation using Markov random field and aggregation document analysis systems. pp. 129- 134 ,(2010) , 10.1145/1815330.1815347
Shravya Shetty, Harish Srinivasan, Matthew Beal, Sargur Srihari, Segmentation and labeling of documents using conditional random fields document recognition and retrieval. ,vol. 6500, ,(2007) , 10.1117/12.704410
Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, Ramachandrula Sitaram, Kiran Bhuvanagiri, Markov Random Field Based Text Identification from Annotated Machine Printed Documents international conference on document analysis and recognition. pp. 431- 435 ,(2009) , 10.1109/ICDAR.2009.237
M. Blumenstein, Miguel A. Ferrer, J. F. Vargas, The 4NSigComp2010 Off-line Signature Verification Competition: Scenario 2 international conference on frontiers in handwriting recognition. pp. 721- 726 ,(2010) , 10.1109/ICFHR.2010.117
J. P. Swanepoel, J. Coetzer, Off-line Signature Verification Using Flexible Grid Features and Classifier Fusion international conference on frontiers in handwriting recognition. pp. 297- 302 ,(2010) , 10.1109/ICFHR.2010.52
V. Kolmogorov, R. Zabih, What energy functions can be minimized via graph cuts IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 26, pp. 147- 159 ,(2004) , 10.1109/TPAMI.2004.1262177
J.K. Guo, M.Y. Ma, Separating handwritten material from machine printed text using hidden Markov models international conference on document analysis and recognition. pp. 439- 443 ,(2001) , 10.1109/ICDAR.2001.953828