Automatic Detection of Handwritten Texts from Video Frames of Lectures

作者: Purnendu Banerjee , Ujjwal Bhattacharya , Bidyut B. Chaudhuri

DOI: 10.1109/ICFHR.2014.110

关键词:

摘要: Automatic recognition of handwritten texts in video lectures has important applications. In lectures, the presenter usually writes on white / colored board. The camera often captures writing board along with certain other objects possibly including itself. Recognition from such a frame requires prior detection region frame. this article, we present our recent study text localization lecture frames. Here, use Scale Invariant Feature Transform (SIFT) descriptors densely over entire are located regular grid 5 pixels following usual practice and considered uniform patch size 60 × as its support basis an empirical study. This SIFT descriptor at each location (grid point) is fed 128-dimensional input feature vector to Multilayer Perceptron (MLP) network which gives response for point either or non-text. Depending aggregate pixel localize regions Next, employ K-means clustering detect components localized Finally, two simple rules applied decide possible detected noise. We obtained encouraging simulation results approach variety

参考文章(22)
Marcus Liwicki, Horst Bunke, Handwriting Recognition of Whiteboard Notes ,(2005)
Thomas Plötz, Gernot A. Fink, Szilárd Vajda, Layout analysis for camera-based whiteboard notes Journal of Universal Computer Science. ,vol. 15, pp. 3307- 3324 ,(2009) , 10.3217/JUCS-015-18-3307
Alejandro H. Toselli, Joan Andreu Sanchez, Veronica Romero, Enrique Vidal, Nicolas Serrano, Handwritten Text Recognition for Historical Documents Proceedings of the Workshop on Language Technologies for Digital Humanities and Cultural Heritage. pp. 90- 96 ,(2011)
Purnendu Banerjee, Souvik Bhowmick, Bangla Text Recognition from Video Sequence: A New Focus. arXiv: Computer Vision and Pattern Recognition. ,(2014)
Szilárd Vajda, Leonard Rothacker, Gernot A. Fink, A method for camera-based interactive whiteboard reading CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition. pp. 112- 125 ,(2011) , 10.1007/978-3-642-29364-1_9
Ujjwal Bhattacharya, Swapan K. Parui, Aruni Roy Chowdhury, Scene text detection using sparse stroke information and MLP international conference on pattern recognition. pp. 294- 297 ,(2012)
Purnendu Banerjee, B. B. Chaudhuri, An approach for Bangla and Devanagari video text recognition Proceedings of the 4th International Workshop on Multilingual OCR. pp. 8- ,(2013) , 10.1145/2505377.2505389
Sheraz Ahmed, Markus Weber, Marcus Liwicki, Christoph Langenhan, Andreas Dengel, Frank Petzold, Automatic analysis and sketch-based retrieval of architectural floor plans Pattern Recognition Letters. ,vol. 35, pp. 91- 100 ,(2014) , 10.1016/J.PATREC.2013.04.005
L. Rothacker, G. A. Fink, P. Banerjee, U. Bhattacharya, B. B. Chaudhuri, Bag-of-features HMMs for segmentation-free Bangla word spotting Proceedings of the 4th International Workshop on Multilingual OCR. pp. 5- ,(2013) , 10.1145/2505377.2505384
Paul Farrand, Fearzana Hussain, Enid Hennessy, The efficacy of the `mind map' study technique Medical Education. ,vol. 36, pp. 426- 431 ,(2002) , 10.1046/J.1365-2923.2002.01205.X