作者: Mahdi Hamdani , Amr El-Desoky Mousa , Hermann Ney
关键词:
摘要: The use of Language Models (LMs) is a very important component in large and open vocabulary recognition systems. This paper presents an open-vocabulary approach for Arabic handwriting recognition. proposed makes word decomposition based on morphological analysis. combination words sub-words obtained by the process. Out Of Vocabulary (OOV) can be recognized combining different elements from lexicon. system Hidden Markov (HMMs) with position context dependent character models. An n-gram LM trained decomposed text used along HMMs during search. evaluated using two datasets. leads to significant improvement performance. Two types experiments tasks are conducted this work. allows have absolute up 1% Word Error Rate (WER) constrained task keep same performance baseline unconstrained one.