Urdu Optical Character Recognition Systems: Present Contributions and Future Directions

作者: Naila Habib Khan , Awais Adnan

DOI: 10.1109/ACCESS.2018.2865532

关键词:

摘要: This paper gives an across-the-board comprehensive review and survey of the most prominent studies in field Urdu optical character recognition (OCR). introduces OCR technology presents a historical systems, providing comparisons between English, Arabic, systems. Detailed background literature have also been provided for script, discussing script’s past, categories, phases. further reports all state-of-the-art different phases, namely, image acquisition, pre-processing, segmentation, feature extraction, classification/recognition, post-processing system. In segmentation section, analytical holistic approaches text emphasized. extraction comparison has learning engineering approaches. Deep traditional machine discussed. The numeral systems deliberated concisely. research concludes by identifying some open problems suggesting future directions.

参考文章(88)
Asma Sajjad, S. Afaq Husain, Fareeha Anwar, Online Urdu Character Recognition System. Journal of Machine Vision and Applications. pp. 98- 101 ,(2007)
Munish Kumar, M. K. Jindal, R. K. Sharma, Review on OCR for Handwritten Indian Scripts Character Recognition international conference on digital image processing. pp. 268- 276 ,(2011) , 10.1007/978-3-642-24055-3_28
Tony McEnery, Andrew Hardie, Hamish Cunningham, Paul Baker, EMILLE, A 67-million word corpus of indic languages:Data collection, mark-up and harmonisation language resources and evaluation. ,(2002)
Patrick Rebentrost, Masoud Mohseni, Seth Lloyd, Quantum algorithms for supervised and unsupervised machine learning arXiv: Quantum Physics. ,(2013)
Sobia Tariq Javed, Sarmad Hussain, Segmentation Based Urdu Nastalique OCR iberoamerican congress on pattern recognition. pp. 41- 49 ,(2013) , 10.1007/978-3-642-41827-3_6
Sarmad Hussain, Salman Ali, Qurat ul Ain Akram, Nastalique segmentation-based approach for Urdu OCR International Journal on Document Analysis and Recognition (IJDAR). ,vol. 18, pp. 357- 374 ,(2015) , 10.1007/S10032-015-0250-2
Saeeda Naz, Arif I. Umar, Riaz Ahmad, Saad B. Ahmed, Syed H. Shirazi, Muhammad I. Razzak, Urdu Nasta'liq text recognition system based on multi-dimensional recurrent neural network and statistical features Neural Computing and Applications. ,vol. 28, pp. 219- 231 ,(2017) , 10.1007/S00521-015-2051-4
Tayyaba Altaf, Aisha Latif, Faiza Iqbal, Nazia Kanwal, Conversion of urdu nastaliq to roman urdu using OCR international conference on information systems. pp. 19- 22 ,(2011)
Quara-Tul-Ain Safdar, Kamran Ullah Khan, Online Urdu Handwritten Character Recognition: Initial Half Form Single Stroke Characters frontiers of information technology. pp. 292- 297 ,(2014) , 10.1109/FIT.2014.61
Omar Mukhtar, Srirangaraj Setlur, Venu Govindaraju, Experiments on Urdu Text Recognition Advances in Pattern Recognition. pp. 163- 171 ,(2009) , 10.1007/978-1-84800-330-9_8