Detecting text in natural scenes with stroke width transform

作者: Boris Epshtein , Eyal Ofek , Yonatan Wexler

DOI: 10.1109/CVPR.2010.5540041

关键词:

摘要: We present a novel image operator that seeks to find the value of stroke width for each image pixel, and demonstrate its use on the task of text detection in natural images. The suggested operator is local and data dependent, which makes it fast and robust enough to eliminate the need for multi-scale computation or scanning windows. Extensive testing shows that the suggested scheme outperforms the latest published algorithms. Its simplicity allows the algorithm to detect texts in many fonts and languages.

参考文章(25)
A Baumgartner, C Steger, W Eckstein, H Ebner, H Mayer, AUTOMATIC ROAD EXTRACTION BASED ON MULTI-SCALE, GROUPING, AND CONTEXT Photogrammetric Engineering and Remote Sensing. ,vol. 65, pp. 777- 785 ,(1999)
A.K. Jain, Bin Yu, Automatic text location in images and video frames international conference on pattern recognition. ,vol. 2, pp. 1497- 1499 ,(1998) , 10.1109/ICPR.1998.711990
Viet Cuong Dinh, Seong Soo Chun, Seungwook Cha, Hanjin Ryu, Sanghoon Sull, An efficient method for text detection in video based on stroke width similarity asian conference on computer vision. pp. 200- 209 ,(2007) , 10.1007/978-3-540-76386-4_18
Cemil Kirbas, Francis Quek, A review of vessel extraction techniques and algorithms ACM Computing Surveys. ,vol. 36, pp. 81- 121 ,(2004) , 10.1145/1031120.1031121
Yangxing Liu, Satoshi Goto, Takeshi Ikenaga, A Contour-Based Robust Algorithm for Text Detection in Color Images The IEICE transactions on information and systems. ,vol. 89, pp. 1221- 1230 ,(2006) , 10.1093/IETISY/E89-D.3.1221
Hae-Kwang Kim, Efficient Automatic Text Location Method and Content-Based Indexing and Structuring of Video Database Journal of Visual Communication and Image Representation. ,vol. 7, pp. 336- 344 ,(1996) , 10.1006/JVCI.1996.0029
B. Freisleben, J. Gllavata, R. Ewerth, Text detection in images based on unsupervised classification of high-frequency wavelet coefficients international conference on pattern recognition. ,vol. 1, pp. 425- 428 ,(2004) , 10.1109/ICPR.2004.896
Jian Liang, David Doermann, Huiping Li, Camera-based analysis of text and documents: a survey International Journal on Document Analysis and Recognition. ,vol. 7, pp. 84- 104 ,(2005) , 10.1007/S10032-004-0138-Z
Keechul Jung, Kwang In Kim, Anil K. Jain, Text information extraction in images and video: a survey Pattern Recognition. ,vol. 37, pp. 977- 997 ,(2004) , 10.1016/J.PATCOG.2003.10.012
Majid Mirmehdi, Special issue on camera-based text and document recognition International Journal on Document Analysis and Recognition. ,vol. 7, pp. 83- 83 ,(2005) , 10.1007/S10032-005-0144-9