KHATT: Arabic Offline Handwritten Text Database

作者: Sabri A. Mahmoud , Irfan Ahmad , Mohammad Alshayeb , Wasfi G. Al-Khatib , Mohammad Tanvir Parvez

DOI: 10.1109/ICFHR.2012.224

关键词:

摘要: In this paper, we report our comprehensive Arabic offline Handwritten Text database (KHATT) after completion of the collection 1000 handwritten forms written by writers from different countries. It is composed an image containing images text at 200, 300, and 600 dpi resolutions, a manually verified ground truth that contains meta-data describing page, paragraph, line levels. A formal verification procedure implemented to align with its form, paragraph Tools extract paragraphs pages segment into lines are developed. Preliminary experiments on recognition conducted using sample data results reported. The will be made freely available researchers world-wide for research in various handwritten-related problems such as recognition, writer identification verification, etc.

参考文章(20)
Sherif Abdelazeem, Ezzat Ali El-Sherif, A Two-Stage System for Arabic Handwritten Digit Recognition Tested on a New Large Database. artificial intelligence and pattern recognition. pp. 237- 242 ,(2007)
Lambert Schomaker, Louis Vuurpijl, Lambertus Schomaker, Forensic writer identification: a benchmark data set and a comparison of two systems NICI (NIjmegen Institute of Cognitive Information), Katholieke Universiteit Nijmegen. ,(2000)
Petr Motlicek, Georg Stemmer, Ondrej Glembek, Karel Vesely, Lukas Burget, Gilles Boulianne, Yanmin Qian, Mirko Hannemann, Nagendra Goel, Petr Schwarz, Arnab Ghoshal, Jan Silovsky, Daniel Povey, The Kaldi Speech Recognition Toolkit ieee automatic speech recognition and understanding workshop. ,(2011)
Sabri A. Mahmoud, Irfan Ahmad, Mohammed Alshayeb, Wasfi G. Al-Khatib, A database for offline arabic handwritten text recognition international conference on image analysis and recognition. pp. 397- 406 ,(2011) , 10.1007/978-3-642-21596-4_40
Mohammad Tanvir Parvez, Sabri A. Mahmoud, Offline arabic handwritten text recognition: A Survey ACM Computing Surveys. ,vol. 45, pp. 23- ,(2013) , 10.1145/2431211.2431222
Badr Al-Badr, Sabri A. Mahmoud, Survey and bibliography of Arabic optical text recognition Signal Processing. ,vol. 41, pp. 49- 77 ,(1995) , 10.1016/0165-1684(94)00090-M
Husni A. Al-Muhtaseb, Sabri A. Mahmoud, Rami S. Qahwaji, Recognition of off-line printed Arabic text using Hidden Markov Models Signal Processing. ,vol. 88, pp. 2902- 2912 ,(2008) , 10.1016/J.SIGPRO.2008.06.013
Haikal El Abed, Volker Margner, The IFN/ENIT-database - a tool to develop Arabic handwriting recognition systems information sciences, signal processing and their applications. pp. 1- 4 ,(2007) , 10.1109/ISSPA.2007.4555529
M. Wienecke, G.A. Fink, G. Sagerer, Towards automatic video-based whiteboard reading international conference on document analysis and recognition. pp. 87- 91 ,(2003) , 10.1109/ICDAR.2003.1227633
N. Kharma, M. Ahmed, R. Ward, A new comprehensive database of handwritten Arabic words, numbers, and signatures used for OCR testing canadian conference on electrical and computer engineering. ,vol. 2, pp. 766- 768 ,(1999) , 10.1109/CCECE.1999.808042