Arbitrary Keyword Spotting in Handwritten Documents

作者: Mehdi Haji

DOI:

关键词:

摘要: Despite the existence of electronic media in today’s world, a considerable amount written communications is paper form such as books, bank cheques, contracts, etc. There an increasing demand for automation information extraction, classification, search, and retrieval documents. The goal this research to develop complete methodology spotting arbitrary keywords handwritten document images. We propose top-down approach Our composed two major steps: segmentation decision. In former, we generate word hypotheses. latter, decide whether generated hypothesis specific keyword or not. We carry out decision step through two-level classification where first, assign input image non-keyword class; then transcribe if it passed keyword. By reducing problem from domain text domain, do not only address search documents, but also retrieval, without need transcription whole image. The main contribution thesis development generalized minimum edit distance words, prove that equivalent Ergodic Hidden Markov Model (EHMM). To best our knowledge, work first present exact 2D model temporal handwriting while satisfying practical constraints. Some other contributions include: 1) removal page margins based on corner detection projection profiles; 2) noise patterns images using expectation maximization fuzzy inference systems; 3) extraction lines fast Fourier-based steerable filtering; 4) characters skeletal graphs; 5) merging broken graph partitioning. Our experiments with benchmark database English documents real-world collection French indicate that, even any word/document-level training, results are comparable state-of-the-art systems

参考文章(107)
Darrin L Dimmick, Jon Geist, Stanley A Janet, Gerald T Candela, Charles L Wilson, Patrick J Grother, NIST Form-Based Handprint Recognition System NIST Interagency/Internal Report (NISTIR) - 5959. ,(1994) , 10.1002/HTTPS://DX.DOI.ORG/10.6028/NIST.IR.5959
Rafael C. Gonzalez, Richard E. Woods, Digital Image Processing 3rd Edition ,(2014)
Fotis Daskas, Ergina Kavallieratou, Text Line Detection and Segmentation: Uneven Skew Angles and Hill-and-Dale Writing. Journal of Universal Computer Science. ,vol. 17, pp. 16- 29 ,(2011)
Joyeeta Gupta, A Theoretical Framework The Climate Change Convention and Developing Countries: From Conflict to Consensus?. pp. 21- 45 ,(1997) , 10.1007/978-94-015-8925-3_2
David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd Edition) Wiley-Interscience. ,(2000)
M. Mehdi Haji, Tien D. Bui, Ching Y. Suen, Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles Image Analysis and Processing – ICIAP 2009. pp. 1025- 1034 ,(2009) , 10.1007/978-3-642-04146-4_109
Jian-xiong Dong, Adam Krzyżak, Ching Y. Suen, Dominique Ponson, Low-Level Cursive Word Representation Based on Geometric Decomposition Machine Learning and Data Mining in Pattern Recognition. pp. 590- 599 ,(2005) , 10.1007/11510888_58
Lambert Schomaker, Merijn van Erp, Variants of the Borda count method for combining ranked classifier hypotheses international conference on frontiers in handwriting recognition. pp. 443- 452 ,(2000)
Volkmar Frinken, Andreas Fischer, Horst Bunke, A novel word spotting algorithm using bidirectional long short-term memory neural networks artificial neural networks in pattern recognition. pp. 185- 196 ,(2010) , 10.1007/978-3-642-12159-3_17