Color and Gradient Features for Text Segmentation from Video Frames

作者: P. Shivakumara , D. S. Guru , H. T. Basavaraju

DOI: 10.1007/978-81-322-1143-3_22

关键词: k-means clusteringComputer scienceComputer visionText segmentationArtificial intelligencePattern recognitionFeature (computer vision)Cluster analysisFrame (networking)Image processingConnected-component labelingScale-space segmentation

摘要: Text segmentation in a video is drawing attention of researchers the field image processing, pattern recognition and document analysis because it helps annotating labeling events accurately. We propose novel idea generating an enhanced frame from R, G, B channels input by grouping high low values using Min–Max clustering criteria. also perform sliding window on to group neighboring pixel further enhance frame. Subsequently, we use k-means with k = 2 algorithm separate text non-text regions. The fully connected components will be identified skeleton obtained clustering. Concept component based gradient feature has been adapted for purpose symmetry verification. which satisfy symmetric verification are selected representatives regions they permitted grow cover their respective region containing text. method tested variety frames evaluate performance terms recall, precision f-measure. results show that promising encouraging.

参考文章(22)
A.K. Jain, Bin Yu, Automatic text location in images and video frames international conference on pattern recognition. ,vol. 2, pp. 1497- 1499 ,(1998) , 10.1109/ICPR.1998.711990
Datong Chen, Jean-Marc Odobez, Hervé Bourlard, Text detection and recognition in images and video frames Pattern Recognition. ,vol. 37, pp. 595- 608 ,(2004) , 10.1016/J.PATCOG.2003.06.001
Palaiahnakote Shivakumara, Rushi Padhuman Sreedhar, Trung Quy Phan, Shijian Lu, Chew Lim Tan, Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 22, pp. 1227- 1235 ,(2012) , 10.1109/TCSVT.2012.2198129
Cong Yao, Xiang Bai, Wenyu Liu, Yi Ma, Zhuowen Tu, Detecting texts of arbitrary orientations in natural images computer vision and pattern recognition. pp. 1083- 1090 ,(2012) , 10.1109/CVPR.2012.6247787
Keechul Jung, Neural network-based text location in color images Pattern Recognition Letters. ,vol. 22, pp. 1503- 1515 ,(2001) , 10.1016/S0167-8655(01)00096-4
P Shivakumara, Trung Quy Phan, Chew Lim Tan, A Laplacian Approach to Multi-Oriented Text Detection in Video IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 33, pp. 412- 419 ,(2011) , 10.1109/TPAMI.2010.166
D. S. Guru, S. Manjunath, P. Shivakumara, C. L. Tan, An eigen value based approach for text detection in video Proceedings of the 8th IAPR International Workshop on Document Analysis Systems - DAS '10. pp. 501- 506 ,(2010) , 10.1145/1815330.1815395
Keechul Jung, Kwang In Kim, Anil K. Jain, Text information extraction in images and video: a survey Pattern Recognition. ,vol. 37, pp. 977- 997 ,(2004) , 10.1016/J.PATCOG.2003.10.012
Edward K. Wong, Minya Chen, A new robust algorithm for video text extraction Pattern Recognition. ,vol. 36, pp. 1397- 1406 ,(2003) , 10.1016/S0031-3203(02)00230-3
L. Neumann, J. Matas, Real-time scene text localization and recognition computer vision and pattern recognition. pp. 3538- 3545 ,(2012) , 10.1109/CVPR.2012.6248097