Detection of artificial and scene text in images and video frames

作者: Marios Anthimopoulos , Basilis Gatos , Ioannis Pratikakis

DOI: 10.1007/S10044-011-0237-7

关键词: Pattern recognition (psychology)Computer visionPattern recognitionArtificial intelligenceMultimedia information retrievalMachine translationLocal binary patternsNoisy text analyticsInformation extractionText graphGreek languageComputer science

摘要: Textual information in images and video frames constitutes a valuable source of high-level semantics for multimedia indexing retrieval systems. Text detection is the most crucial step text extraction system although it has been extensively studied past decade still, does not exist generic architecture that would work artificial scene content. In this paper we propose both frames. The based on machine learning stage which uses an Random Forest classifier highly discriminative feature set produced by using new texture operator called Multilevel Adaptive Color edge Local Binary Pattern (MACeLBP). MACeLBP describes spatial distribution color edges multiple adaptive levels contrast. Then, gradient-based algorithm applied to achieve distinction among lines as well refinement localization lines. whole situated multiresolution framework invariance scale Finally, optional connected-component segments into words distances between resulting components. experimental results are applying concise evaluation methodology prove superior performance achieved proposed

参考文章(33)
Basilios Gatos, Ioannis Pratikakis, Marios Anthimopoulos, MULTIRESOLUTION TEXT DETECTION IN VIDEO FRAMES international conference on computer vision theory and applications. pp. 161- 166 ,(2016)
Hideaki Goto, Redefining the DCT-based feature for scene text detection: Analysis and comparison of spatial frequency-based features International Journal on Document Analysis and Recognition. ,vol. 11, pp. 1- 8 ,(2008) , 10.1007/S10032-008-0061-9
U. Gargi, D. Crandall, S. Antani, T. Gandhi, R. Keener, R. Kasturi, A system for automatic text detection in video international conference on document analysis and recognition. pp. 29- 32 ,(1999) , 10.1109/ICDAR.1999.791717
Datong Chen, Jean-Marc Odobez, Jean-Philippe Thiran, A localization/verification scheme for finding text in images and video frames based on contrast independent features and machine learning methods Signal Processing-image Communication. ,vol. 19, pp. 205- 217 ,(2004) , 10.1016/S0923-5965(03)00075-4
Keechul Jung, Neural network-based text location in color images Pattern Recognition Letters. ,vol. 22, pp. 1503- 1515 ,(2001) , 10.1016/S0167-8655(01)00096-4
Kongqiao Wang, Jari A. Kangas, Character location in scene images from digital camera Pattern Recognition. ,vol. 36, pp. 2287- 2299 ,(2003) , 10.1016/S0031-3203(03)00082-7
Marios Anthimopoulos, Basilis Gatos, Ioannis Pratikakis, A two-stage scheme for text detection in video images Image and Vision Computing. ,vol. 28, pp. 1413- 1426 ,(2010) , 10.1016/J.IMAVIS.2010.03.004
Rainer Lienhart, Wolfgang Effelsberg, Automatic text segmentation and text recognition for video indexing Multimedia Systems. ,vol. 8, pp. 69- 81 ,(2000) , 10.1007/S005300050006
David Crandall, Sameer Antani, Rangachar Kasturi, Extraction of special effects caption text events from digital video international conference on document analysis and recognition. ,vol. 5, pp. 138- 157 ,(2003) , 10.1007/S10032-002-0091-7
Kwang In Kim, Keechul Jung, Se Hyun Park, Hang Joon Kim, Support vector machine-based text detection in digital video Pattern Recognition. ,vol. 34, pp. 527- 529 ,(2001) , 10.1016/S0031-3203(00)00095-9