Arbitrary Keyword Spotting in Handwritten Documents

作者： Mehdi Haji

DOI:

关键词:

摘要: Despite the existence of electronic media in today’s world, a considerable amount written communications is paper form such as books, bank cheques, contracts, etc. There an increasing demand for automation information extraction, classification, search, and retrieval documents. The goal this research to develop complete methodology spotting arbitrary keywords handwritten document images. We propose top-down approach Our composed two major steps: segmentation decision. In former, we generate word hypotheses. latter, decide whether generated hypothesis specific keyword or not. We carry out decision step through two-level classification where first, assign input image non-keyword class; then transcribe if it passed keyword. By reducing problem from domain text domain, do not only address search documents, but also retrieval, without need transcription whole image. The main contribution thesis development generalized minimum edit distance words, prove that equivalent Ergodic Hidden Markov Model (EHMM). To best our knowledge, work first present exact 2D model temporal handwriting while satisfying practical constraints. Some other contributions include: 1) removal page margins based on corner detection projection profiles; 2) noise patterns images using expectation maximization fuzzy inference systems; 3) extraction lines fast Fourier-based steerable filtering; 4) characters skeletal graphs; 5) merging broken graph partitioning. Our experiments with benchmark database English documents real-world collection French indicate that, even any word/document-level training, results are comparable state-of-the-art systems

concordia.ca 本地加速

暂无可下载资源，当前可以选择系统获取到有开放资源时通知我或者直接发起求助文献求助

参考文章(107)

Darrin L Dimmick, Jon Geist, Stanley A Janet, Gerald T Candela, Charles L Wilson, Patrick J Grother, NIST Form-Based Handprint Recognition System NIST Interagency/Internal Report (NISTIR) - 5959. ,(1994) , 10.1002/HTTPS://DX.DOI.ORG/10.6028/NIST.IR.5959

Rafael C. Gonzalez, Richard E. Woods, Digital Image Processing 3rd Edition ,(2014)

Fotis Daskas, Ergina Kavallieratou, Text Line Detection and Segmentation: Uneven Skew Angles and Hill-and-Dale Writing. Journal of Universal Computer Science. ,vol. 17, pp. 16- 29 ,(2011)

Joyeeta Gupta, A Theoretical Framework The Climate Change Convention and Developing Countries: From Conflict to Consensus?. pp. 21- 45 ,(1997) , 10.1007/978-94-015-8925-3_2

William H. Majoros, Methods for Computational Gene Prediction: Generalized hidden Markov models ,(2007) , 10.1017/CBO9780511811135.010

David G. Stork, Richard O. Duda, Peter E. Hart, Pattern Classification (2nd Edition) Wiley-Interscience. ,(2000)

M. Mehdi Haji, Tien D. Bui, Ching Y. Suen, Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles Image Analysis and Processing – ICIAP 2009. pp. 1025- 1034 ,(2009) , 10.1007/978-3-642-04146-4_109

Jian-xiong Dong, Adam Krzyżak, Ching Y. Suen, Dominique Ponson, Low-Level Cursive Word Representation Based on Geometric Decomposition Machine Learning and Data Mining in Pattern Recognition. pp. 590- 599 ,(2005) , 10.1007/11510888_58

Lambert Schomaker, Merijn van Erp, Variants of the Borda count method for combining ranked classifier hypotheses international conference on frontiers in handwriting recognition. pp. 443- 452 ,(2000)

10.

Volkmar Frinken, Andreas Fischer, Horst Bunke, A novel word spotting algorithm using bidirectional long short-term memory neural networks artificial neural networks in pattern recognition. pp. 185- 196 ,(2010) , 10.1007/978-3-642-12159-3_17

Arbitrary Keyword Spotting in Handwritten Documents

来源期刊

我的账户

Arbitrary Keyword Spotting in Handwritten Documents

来源期刊

相似文章 2

Method and system for the spotting of arbitrary words in handwritten documents

A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier

我的账户