Reconstructing high-fidelity electronic documents from images via generation of synthetic fonts

作者: Dennis G. Nicholson

DOI:

关键词:

摘要: A system creates an electronic version of a document from page-images the document, wherein replicates both logical content and physical appearance original document. During operation, receives for Next, extracts character images page-images, generates synthetic font extracted images. Finally, constructs by, using to represent text regions by image-segments pages-images non-text