Recognising text in real scenes

作者: Paul Clark , Majid Mirmehdi

DOI: 10.1007/S10032-001-0072-2

关键词:

摘要: We present two different approaches to the location and recovery of text in images real scenes. The techniques we describe are invariant scale 3D orientation text, allow cluttered first approach uses page edges other rectangular boundaries around locate a surface containing recover fronto-parallel view. This is performed using line detection, perceptual grouping, comparison potential regions confidence measure. second low-level texture measures with neural network classifier an image. Then view each located paragraph by separating individual lines determining vanishing points plane. illustrate our results number images.

参考文章(16)
Martin A. Fischler, Robert C. Bolles, A RANSAC-based approach to model fitting and its application to finding cylinders in range data international joint conference on artificial intelligence. pp. 637- 643 ,(1981)
Robert Wilensky, Thomas A. Phelps, Multivalent Documents: A New Model for Digital Documents University of California at Berkeley. ,(1998)
Huiping Li, D. Doermann, Automatic identification of text in digital video key frames international conference on pattern recognition. ,vol. 1, pp. 129- 132 ,(1998) , 10.1109/ICPR.1998.711097
Raymond K.K Yip, A Hough transform technique for the detection of reflectional symmetry and skew-symmetry Pattern Recognition Letters. ,vol. 21, pp. 117- 130 ,(2000) , 10.1016/S0167-8655(99)00138-5
M. Petrou, J. Kittler, Optimal edge detectors for ramp edges IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 13, pp. 483- 491 ,(1991) , 10.1109/34.134047
Victor Wu, R. Manmatha, Edward M. Riseman, Finding text in images acm international conference on digital libraries. pp. 3- 12 ,(1997) , 10.1145/263690.263766
Anil K. Jain, Sushil Bhattacharjee, Text segmentation using Gabor filters for automatic document processing machine vision applications. ,vol. 5, pp. 169- 184 ,(1992) , 10.1007/BF02626996
C.L. Tan, P.O. Ng, Text extraction using pyramid Pattern Recognition. ,vol. 31, pp. 63- 72 ,(1998) , 10.1016/S0031-3203(97)00026-5
S. Messelodi, C.M. Modena, Automatic identification and skew estimation of text lines in real scene images Pattern Recognition. ,vol. 32, pp. 791- 810 ,(1999) , 10.1016/S0031-3203(98)00108-3