作者: Dennis G. Nicholson
DOI:
关键词:
摘要: A system creates an electronic version of a document from page-images the document, wherein replicates both logical content and physical appearance original document. During operation, receives for Next, extracts character images page-images, generates synthetic font extracted images. Finally, constructs by, using to represent text regions by image-segments pages-images non-text