作者: Adnan Ul-Hasan , Saad Bin Ahmed , Faisal Rashid , Faisal Shafait , Thomas M. Breuel
关键词:
摘要: Recurrent neural networks (RNN) have been successfully applied for recognition of cursive handwritten documents, both in English and Arabic scripts. Ability RNNs to model context sequence data like speech text makes them a suitable candidate develop OCR systems printed Nabataean scripts (including Nastaleeq which no system is available date). In this work, we presented the results applying RNN Urdu script. Bidirectional Long Short Term Memory (BLSTM) architecture with Connectionist Temporal Classification (CTC) output layer was employed recognize text. We evaluated BLSTM two cases: one ignoring character's shape variations second considering them. The error rate at character level first case 5.15% 13.6%. These were obtained on synthetically generated UPTI dataset containing artificially degraded images reflect some real-world scanning artifacts along clean images. Comparison shape-matching based method also presented.