Color and Gradient Features for Text Segmentation from Video Frames

作者： P. Shivakumara , D. S. Guru , H. T. Basavaraju

关键词: k-means clustering 、 Computer science 、 Computer vision 、 Text segmentation 、 Artificial intelligence 、 Pattern recognition 、 Feature (computer vision) 、 Cluster analysis 、 Frame (networking) 、 Image processing 、 Connected-component labeling 、 Scale-space segmentation

摘要: Text segmentation in a video is drawing attention of researchers the field image processing, pattern recognition and document analysis because it helps annotating labeling events accurately. We propose novel idea generating an enhanced frame from R, G, B channels input by grouping high low values using Min–Max clustering criteria. also perform sliding window on to group neighboring pixel further enhance frame. Subsequently, we use k-means with k = 2 algorithm separate text non-text regions. The fully connected components will be identified skeleton obtained clustering. Concept component based gradient feature has been adapted for purpose symmetry verification. which satisfy symmetric verification are selected representatives regions they permitted grow cover their respective region containing text. method tested variety frames evaluate performance terms recall, precision f-measure. results show that promising encouraging.

参考文章(22)

A.K. Jain, Bin Yu, Automatic text location in images and video frames international conference on pattern recognition. ,vol. 2, pp. 1497- 1499 ,(1998) , 10.1109/ICPR.1998.711990

Datong Chen, Jean-Marc Odobez, Hervé Bourlard, Text detection and recognition in images and video frames Pattern Recognition. ,vol. 37, pp. 595- 608 ,(2004) , 10.1016/J.PATCOG.2003.06.001

Palaiahnakote Shivakumara, Rushi Padhuman Sreedhar, Trung Quy Phan, Shijian Lu, Chew Lim Tan, Multioriented Video Scene Text Detection Through Bayesian Classification and Boundary Growing IEEE Transactions on Circuits and Systems for Video Technology. ,vol. 22, pp. 1227- 1235 ,(2012) , 10.1109/TCSVT.2012.2198129

Cong Yao, Xiang Bai, Wenyu Liu, Yi Ma, Zhuowen Tu, Detecting texts of arbitrary orientations in natural images computer vision and pattern recognition. pp. 1083- 1090 ,(2012) , 10.1109/CVPR.2012.6247787

Keechul Jung, Neural network-based text location in color images Pattern Recognition Letters. ,vol. 22, pp. 1503- 1515 ,(2001) , 10.1016/S0167-8655(01)00096-4

P Shivakumara, Trung Quy Phan, Chew Lim Tan, A Laplacian Approach to Multi-Oriented Text Detection in Video IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 33, pp. 412- 419 ,(2011) , 10.1109/TPAMI.2010.166

D. S. Guru, S. Manjunath, P. Shivakumara, C. L. Tan, An eigen value based approach for text detection in video Proceedings of the 8th IAPR International Workshop on Document Analysis Systems - DAS '10. pp. 501- 506 ,(2010) , 10.1145/1815330.1815395

Keechul Jung, Kwang In Kim, Anil K. Jain, Text information extraction in images and video: a survey Pattern Recognition. ,vol. 37, pp. 977- 997 ,(2004) , 10.1016/J.PATCOG.2003.10.012

Edward K. Wong, Minya Chen, A new robust algorithm for video text extraction Pattern Recognition. ,vol. 36, pp. 1397- 1406 ,(2003) , 10.1016/S0031-3203(02)00230-3

10.

L. Neumann, J. Matas, Real-time scene text localization and recognition computer vision and pattern recognition. pp. 3538- 3545 ,(2012) , 10.1109/CVPR.2012.6248097

Color and Gradient Features for Text Segmentation from Video Frames

来源期刊

我的账户

Color and Gradient Features for Text Segmentation from Video Frames

来源期刊

相似文章 4

Full-Vector Gradient for Multi-Spectral or Multivariate Images

Text Detection Through Hidden Markov Random Field and EM-Algorithm

Gradient in spectral and color images: from the Di Zenzo initial construction to a generic proposition

Neighborhood Pixel-Based Approach for Arbitrary-Oriented Multilingual Text Localization

我的账户