Detection of numbered captions

作者: Jean-Luc Meunier , Herve Dejean

DOI:

关键词: Sequence (medicine)Numbering schemeImitation (music)Term (logic)Fragment (computer graphics)Information retrievalComputer science

摘要: A method of detection numbered captions in a document includes receiving including sequence pages and identifying illustrations on the document. For each identified illustration, associated text is identified. An imitation page generated for illustrations, comprising single illustration its text. pages, terms Each term derived from fragment associate respective page. The complying with at least one predefined numbering scheme which defines form an incremental state sequence. are construed as being part caption