作者: Pallavi Pyreddy , W. Bruce Croft
DOI:
关键词: Column (database) 、 Information retrieval 、 Heuristic 、 Key (cryptography) 、 Table (information) 、 Data element 、 Representation (mathematics) 、 Information needs 、 Computer science 、 Component (UML)
摘要: Tables form an important kind of data element in text retrieval. Often, the gist entire news article or other exposition can be concisely captured tabular form. Information than key words a digital document exploited to provide users with more flexible and powerful query capabilities. More specifically, structural information is identify tables their component fields let based on these fields. Component include table lines, caption row headings, column components. Empirical results have demonstrated that heuristic method extraction tagging performed effectively efficiently. Moreover, experiments retrieval using system present invention strongly indicate such decomposition facilitate better representation user's needs hence effective tables.