Touching text character localization in graphical documents using SIFT

作者: Partha Pratim Roy , Umapada Pal , Josep Lladós

DOI: 10.1007/978-3-642-13728-0_18

关键词:

摘要: Interpretation of graphical document images is a challenging task as it requires proper understanding text/graphics symbols present in such documents. Difficulties arise recognition when text and symbol overlapped/touched. Intersection with lines curves occur frequently documents hence separation very difficult. Several pattern classification techniques exist to recognize isolated text/symbol. But, the touching/overlapping has not yet been dealt successfully. An interesting technique, Scale Invariant Feature Transform (SIFT), originally devised for object can take care overlapping problems. Even if SIFT features have emerged powerful descriptors, their employment context investigated much. In this paper we adaptation approach character localization (spotting) We evaluate applicability technique discuss scope improvement by combining some state-of-the-art approaches.

参考文章(14)
D.H. Ballard, Generalizing the hough transform to detect arbitrary shapes Pattern Recognition. ,vol. 13, pp. 714- 725 ,(1987) , 10.1016/0031-3203(81)90009-1
John Skilling, S. F. Gull, Algorithms and Applications Springer, Dordrecht. pp. 83- 132 ,(1985) , 10.1007/978-94-017-2221-6_5
Jonathan J Hull, Suzanne L Taylor, Document Analysis Systems II World Scientific. ,(1998) , 10.1142/3446
Huizhu Luo, G. Agam, I. Dinstein, Directional mathematical morphology approach for line thinning and extraction of character strings from maps and line drawings international conference on document analysis and recognition. ,vol. 1, pp. 257- 260 ,(1995) , 10.1109/ICDAR.1995.598989
Karl Tombre, Salvatore Tabbone, Loïc Pélissier, Bart Lamiroy, Philippe Dosch, Text/Graphics Separation Revisited document analysis systems. ,vol. 2423, pp. 200- 211 ,(2002) , 10.1007/3-540-45869-7_24
C.L. Tan, P.O. Ng, Text extraction using pyramid Pattern Recognition. ,vol. 31, pp. 63- 72 ,(1998) , 10.1016/S0031-3203(97)00026-5
Marçal Rusiñol, Josep Lladós, Word and Symbol Spotting Using Spatial Organization of Local Descriptors document analysis systems. pp. 489- 496 ,(2008) , 10.1109/DAS.2008.24
L.A. Fletcher, R. Kasturi, A robust algorithm for text string separation from mixed text/graphics images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 10, pp. 910- 918 ,(1988) , 10.1109/34.9112
Ruini Cao, Chew Lim Tan, Text/Graphics Separation in Maps graphics recognition. pp. 167- 177 ,(2001) , 10.1007/3-540-45868-9_14
K. Tombre, B. Lamiroy, Graphics recognition - from re-engineering to retrieval international conference on document analysis and recognition. pp. 148- 155 ,(2003) , 10.1109/ICDAR.2003.1227650