Automatic separation of text from background in scanned images of complex documents

作者: Oscar A. Zuniga

DOI:

关键词: GeographyHistogramArtificial intelligenceImage (mathematics)GrayscaleBackground informationFrequency of occurrencePixelComputer visionImage histogramBlock (data storage)

摘要: A system that converts a scanned image of complex document into an where text has been preserved and separated from the background. The first subdivides blocks then examines each block pixel by to construct histogram gray scale values pixels. is partitioned first, middle last regions. If one or more peaks occur in regions, single peak occurs within region, pixels are reexamined determine frequency occurrence having level nearby which have region peak. this high, assumed be background information. After determining threshold, rescans applying threshold separate information block.

参考文章(4)
Sidney J. Fox, William J. Zimmermann, Filip J. Yeskel, Universal thresholder/discriminator ,(1982)
C.K. Chow, T. Kaneko, Automatic boundary detection of the left ventricle from cineangiograms Computers and Biomedical Research. ,vol. 5, pp. 388- 410 ,(1972) , 10.1016/0010-4809(72)90070-5
James Charles Stoffel, Image scanning apparatus and method ,(1982)