Method and apparatus for analysis of electronic communications containing imagery

作者: Prasanna Mulgaonkar , Gregory K. Myers , John P. Marcotullio , Hrishikesh B. Aradhye

DOI:

关键词:

摘要: A method and apparatus are provided for analyzing an electronic communication containing imagery, e.g., to determine whether or not the is a spam communication. In one embodiment, inventive includes detecting more regions of imagery in received applying pre-processing techniques locate (e.g., blocks lines) text that may be distorted. The then analyzes content indicates spam. specialized extraction rectification embedded followed by optical character recognition processing applied extract their therefrom. another keyword shape-matching detect presence absence spam-indicative words from text. other attributes extracted regions, such as size, location, color complexity used build evidence against

参考文章(4)
Dan S. Bloomberg, Francine R. Chen, Lynn D. Wilcox, Word spotting in bitmap images using word bounding boxes and hidden Markov models ,(1992)
Amin El-Gazzar, Adrian Puente, Alexander Nolasco, Wolf Hartmut, Spam fax filter ,(2004)