Comparing contents of electronic documents

作者: Richard Lee Sites

DOI:

关键词:

摘要: A method is described which compares contents-rich documents page by and creates a difference document of paired pages. The pages are compared, in that order, based on their marking operators, bitmaps rendered from the still unpaired pages, subset bitmap, e.g. smaller areas. Pages visually identical paired. Blank inserted if cannot be to deal with insertions deletions. Differences between can visible printed document, marked used contain embedded graphical contents as well plain text files.

参考文章(17)
Takahiro Ushiro, Yoshinobu Aiba, Hideto Kohtani, Image retrieving apparatus ,(1994)
Deborah Kurata, Irving S. Rappaport, Kevin G. Rivette, Adam Jackson, Michael P. Florio, Don Ahn, System, method, and computer program product for generating documents using pagination information ,(1998)
Jeanette L. Blomberg, Christian K. Shin, Randall H. Trigg, James V. Mahoney, System for searching a corpus of document images by user specified document layout components ,(1997)
Frederick A. Hayes-Roth, Neil A. Jacobstein, Yen-whei Chow, James E. Manley, Christopher B. McMahan, Automatic retrieval of changed files by a network software agent ,(1996)
Andrei Z. Broder, Mark S. Manasse, Steven C. Glassman, Charles G. Nelson, Geoffrey G. Zweig, Method for clustering closely resembling data objects ,(1998)