System for extracting text from a plurality of captured images of a document

作者: Peter O. Stubler , Andrew C. Blose

DOI:

关键词: Data processing systemArtificial intelligenceTextual informationProcess (computing)Digital imageComputer scienceNetwork interfaceOptical character recognitionInformation retrievalComputer vision

摘要: A system including a data processing system, network interface for communicating over network, and program memory storing instructions configured to cause the implement method extracting textual information from images of document containing text characters. The includes receiving plurality digital network. Each captured is automatically analyzed using an optical character recognition process determine extracted data. are merged document, wherein differences between document.

参考文章(11)
Patrick Denis Lincoln, Lawrence R. Toll, Peter D. Karp, Kemal Sonmez, Data relationship model ,(2001)
Chiranjib Bhattacharyya, Sriraghavendra Ramaswamy, System and method for searching handwritten texts ,(2009)
Alexander Bronstein, Michael Bronstein, Shlomo Selim Rakib, Methods and systems for representation and matching of video content ,(2009)
Alistair Willis, David Morse, David King, Dave Roberts, Chris Lyal, Anton Dil, Improving search in scanned documents: Looking for OCR mismatches ,(2009)
Shaolei Feng, R. Manmatha, A hierarchical, HMM-based automatic evaluation of OCR accuracy for a digital library of books acm/ieee joint conference on digital libraries. pp. 109- 118 ,(2006) , 10.1145/1141753.1141776