作者: Hossein Ziaei Nafchi , Seyed Morteza Ayatollahi , Reza Farrahi Moghaddam , Mohamed Cheriet
DOI:
关键词:
摘要: For the purpose of facilitating benchmark contributions for binarization methods, a new fast ground truthing approach, called the PhaseGT, is proposed. This approach is used for building the 1 st groundtruthed Persian Heritage Image Binarization Dataset (PHIBD 2012). The PhaseGT is a semiautomatic approach to ground truthing of images of any language, especially designed for historical document images. The main goal of the PhaseGT is to accelerate the ground truthing process and reduce the manual ground truthing effort. It uses the phase congruency features to preprocess the input image and to provide a more accurate initial binarization to the human expert who performs the manual part. This preprocessing is in turn based on a priori knowledge that is provided by human user. The PHIBD 2012 dataset contains 15 historical document images with their corresponding ground truth binary images. The …