A Tool for Extracting Text from Scanned Documents and Convert it into Editable Format

作者: S. Sukanya , S. Joseph Gladwin , C. Vinoth Kumar

DOI: 10.1109/VITECON.2019.8899428

关键词:

摘要: With the advent of Social media, now-a-days most data are stored in images. These if processed correctly can provide large information. Hence there is a need to convert these into A novel technique which available image an editable format proposed where be acquired either by camera, smart phone or directly from any source. The segmented characters using Connected Components (CC) and edge recombination stroke width. This then converted Optical Character Recognition (OCR) technology Maximally Stable Extremal Region (MSER) for segmentation. system also extract object images, done Artificial Neural Networks (ANN). complete solution developed MATLAB output variable. extracted word document required it edited. performance measured two parameters namely precision rate recall has about 88% 97% higher than earlier methods.

参考文章(15)
Pablo Arbelaez, Jordi Pont-Tuset, Jon Barron, Ferran Marques, Jitendra Malik, Multiscale Combinatorial Grouping computer vision and pattern recognition. pp. 328- 335 ,(2014) , 10.1109/CVPR.2014.49
Joao Carreira, Cristian Sminchisescu, Constrained parametric min-cuts for automatic object segmentation computer vision and pattern recognition. pp. 3241- 3248 ,(2010) , 10.1109/CVPR.2010.5540063
Thomas Deselaers, Daniel Keysers, Jan Hosang, Henry A. Rowley, GyroPen: Gyroscopes for Pen-Input With Mobile Phones IEEE Transactions on Human-Machine Systems. ,vol. 45, pp. 263- 271 ,(2015) , 10.1109/THMS.2014.2365723
Carlos Merino-Gracia, Majid Mirmehdi, José Sigut, José L González-Mora, None, Fast perspective recovery of text in natural scenes Image and Vision Computing. ,vol. 31, pp. 714- 724 ,(2013) , 10.1016/J.IMAVIS.2013.07.002
Fei Yin, Cheng-Lin Liu, Handwritten Chinese text line segmentation by clustering with distance metric learning Pattern Recognition. ,vol. 42, pp. 3146- 3157 ,(2009) , 10.1016/J.PATCOG.2008.12.013
Jimei Yang, Brian Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang, Object Contour Detection with a Fully Convolutional Encoder-Decoder Network computer vision and pattern recognition. pp. 193- 202 ,(2016) , 10.1109/CVPR.2016.28
Vladimir Riffo, Domingo Mery, Automated Detection of Threat Objects Using Adapted Implicit Shape Model systems man and cybernetics. ,vol. 46, pp. 472- 482 ,(2016) , 10.1109/TSMC.2015.2439233
Xu-Cheng Yin, Ze-Yu Zuo, Shu Tian, Cheng-Lin Liu, Text Detection, Tracking and Recognition in Video: A Comprehensive Survey IEEE Transactions on Image Processing. ,vol. 25, pp. 2752- 2773 ,(2016) , 10.1109/TIP.2016.2554321
Ismet Zeki Yalniz, R. Manmatha, Dependence Models for Searching Text in Document Images IEEE Transactions on Pattern Analysis and Machine Intelligence. ,vol. 41, pp. 49- 63 ,(2019) , 10.1109/TPAMI.2017.2780108
Yanwei Wang, Yuefang Sun, Changsong Liu, Layout and Perspective Distortion Independent Recognition of Captured Chinese Document Image international conference on document analysis and recognition. pp. 591- 596 ,(2017) , 10.1109/ICDAR.2017.102