N-FTRN: Neighborhoods based fully convolutional network for Chinese text line recognition

作者: Hongzhu Li , Weiqiang Wang , Ke Lv

DOI: 10.1007/S11042-019-7410-1

关键词: Structure (mathematical logic)Computer sciencePattern recognitionFeature (machine learning)Artificial intelligenceCharacter (computing)Line (text file)Recurrent neural networkLayer (object-oriented design)

摘要: The convolutional recurrent neural network is one of the most popular text recognition methods. Recurrent structures can extract long-term dependencies, but they are time consuming in computation compared with structures. We argue that Chinese line be performed based on neighbor rather than entire contextual information, and information extracted from neighborhoods should only a supplement to character regions. Therefore, we propose novel fully (N-FTRN). It first extracts character-level feature sequences lines, then uses residual blocks instead structure utilize information. A reshape layer applied enable recognize both vertical horizontal lines. Extensive experiments have been conducted validate efficiency effectiveness proposed network. Compared state-of-the-art methods, achieve comparable performances scene competition dataset (TRW) ICDAR 2015 much more compact models.

参考文章(45)
Max Jaderberg, Andrea Vedaldi, Andrew Zisserman, Deep Features for Text Spotting european conference on computer vision. pp. 512- 528 ,(2014) , 10.1007/978-3-319-10593-2_34
Andrea Vedaldi, Max Jaderberg, Karen Simonyan, Andrew Zisserman, Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition arXiv: Computer Vision and Pattern Recognition. ,(2014)
Andrew Y. Ng, Adam Coates, David J. Wu, Tao Wang, End-to-end text recognition with convolutional neural networks international conference on pattern recognition. pp. 3304- 3308 ,(2012)
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
Qi Yin, Cong Yao, Zhimin Cao, Shuchang Zhou, Xinyu Zhou, ICDAR 2015 Text Reading in the Wild Competition arXiv: Computer Vision and Pattern Recognition. ,(2015)
Ilya Sutskever, Geoffrey E. Hinton, Alex Krizhevsky, Ruslan R. Salakhutdinov, Nitish Srivastava, Improving neural networks by preventing co-adaptation of feature detectors arXiv: Neural and Evolutionary Computing. ,(2012)
Chen Change Loy, Xiaoou Tang, Yu Qiao, Pan He, Weilin Huang, Reading Scene Text in Deep Convolutional Sequences arXiv: Computer Vision and Pattern Recognition. ,(2015)
Shao Huang, Weiqiang Wang, Hui Zhang, Retrieving images using saliency detection and graph matching 2014 IEEE International Conference on Image Processing (ICIP). pp. 3087- 3091 ,(2014) , 10.1109/ICIP.2014.7025624
Liang Xu, Fei Yin, Qiu-Feng Wang, Cheng-Lin Liu, An over-segmentation method for single-touching Chinese handwriting with learning-based filtering International Journal on Document Analysis and Recognition (IJDAR). ,vol. 17, pp. 91- 104 ,(2014) , 10.1007/S10032-013-0208-1
Tong-Hua Su, Tian-Wen Zhang, De-Jun Guan, Hu-Jie Huang, Off-line recognition of realistic Chinese handwriting using segmentation-free strategy Pattern Recognition. ,vol. 42, pp. 167- 182 ,(2009) , 10.1016/J.PATCOG.2008.05.012