作者: Qianjin Hu , Yonghong Yang , Yuanhua Tang
DOI:
关键词: Internet content 、 Newspaper 、 Phrase 、 Ranking 、 Computer science 、 Information retrieval 、 Web page 、 Text query 、 Scientific literature 、 Sentence
摘要: The invention is a method for textual searching of text-based databases including compiled internet content, scientific literature, abstracts books and articles, newspapers, journals, the like. Specifically, algorithm supports searches using full-text or webpage as query keyword allowing multiple entries an information-content based ranking system (Shannon Information score) that uses p-values to represent likelihood hit due random matches. Additionally, users can specify parameters determine hits their with scoring on phrase matches sentence similarities.