Automatic performance evaluation for video text detection

作者: Xian-Sheng Hua , Liu Wenyin , Hong-Jiang Zhang

DOI: 10.1109/ICDAR.2001.953848

关键词:

摘要: We propose an objective, comprehensive and difficulty-independent performance evaluation protocol for video text detection algorithms. The includes a positive set negative of indices at textbox level, which evaluate the quality in terms both location accuracy fragmentation detected textboxes. In protocol, we assign difficulty (DD) level to each ground truth textbox. can then be normalized with respect DD are therefore independent difficulty. also importance (DI) overall rate is DI-weighted average qualities all textboxes, makes more accurate reveal real performance. automatic scheme has been applied on approach determine best parameters that yield results.

参考文章(10)
A.K. Jain, Bin Yu, Automatic text location in images and video frames international conference on pattern recognition. ,vol. 2, pp. 1497- 1499 ,(1998) , 10.1109/ICPR.1998.711990
Victor Wu, R. Manmatha, Edward M. Riseman, Finding text in images acm international conference on digital libraries. pp. 3- 12 ,(1997) , 10.1145/263690.263766
P.J. Phillips, K.W. Bowyer, Empirical Evaluation of Computer Vision Algorithms IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 21, pp. 289- 290 ,(1999) , 10.1109/TPAMI.1999.761260
Huiping Li, D. Doermann, O. Kia, Automatic text detection and tracking in digital video IEEE Transactions on Image Processing. ,vol. 9, pp. 147- 156 ,(2000) , 10.1109/83.817607
Wei Qi, Lie Gu, Hao Jiang, Xiang-Rong Chen, Hong-Jiang Zhang, Integrating visual, audio and text analysis for news video international conference on image processing. ,vol. 3, pp. 520- 523 ,(2000) , 10.1109/ICIP.2000.899482
A. Wernicke, R. Lienhart, On the segmentation of text in videos international conference on multimedia and expo. ,vol. 3, pp. 1511- 1514 ,(2000) , 10.1109/ICME.2000.871054
L. Lam, C.Y. Suen, Evaluation of thinning algorithms from an OCR viewpoint Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93). pp. 287- 290 ,(1993) , 10.1109/ICDAR.1993.395730
Ihsin T. Phillips, Jisheng Liang, Atul K. Chhabra, Robert Haralick, A Performance Evaluation Protocol for Graphics Recognition Systems graphics recognition. pp. 372- 389 ,(1997) , 10.1007/3-540-64381-8_64
Yu Zhong, Hongjiang Zhang, A.K. Jain, Automatic caption localization in compressed video IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 22, pp. 385- 392 ,(2000) , 10.1109/34.845381