The Anatomy of Mitos Web Search Engine

作者: Yannis Tzitzikas , Yannis Theoharis , Nikos Armenatzoglou , Georgia Troullinou , Yannis Marketakis

DOI:

关键词:

摘要: Engineering a Web search engine offering effective and efficient information retrieval is challenging task. This document presents our experiences from designing developing wide spectrum of functionalities we report some interesting experimental results. A rather peculiar design choice the that its index based on DBMS, while distinctive are offered include advanced Greek language stemming, real time result clustering, link analysis techniques (also for spam page detection).

参考文章(11)
r;ribeiro-neto bueza-yates (b), Modern Information Retrieval ,(1999)
Zoltán Gyöngyi, Hector Garcia-Molina, Jan Pedersen, Combating web spam with trustrank very large data bases. pp. 576- 587 ,(2004) , 10.1016/B978-012088469-8.50052-8
David A. Hull, Stemming algorithms: a case study for detailed evaluation Journal of the Association for Information Science and Technology. ,vol. 47, pp. 70- 84 ,(1996) , 10.1002/(SICI)1097-4571(199601)47:1<70::AID-ASI7>3.3.CO;2-Q
Arvind Arasu, Junghoo Cho, Hector Garcia-Molina, Andreas Paepcke, Sriram Raghavan, Searching the Web ACM Transactions on Internet Technology. ,vol. 1, pp. 2- 43 ,(2001) , 10.1145/383034.383035
Sergey Brin, Lawrence Page, The anatomy of a large-scale hypertextual Web search engine the web conference. ,vol. 30, pp. 107- 117 ,(1998) , 10.1016/S0169-7552(98)00110-X
Deepayan Chakrabarti, Christos Faloutsos, Graph mining ACM Computing Surveys. ,vol. 38, pp. 2- ,(2006) , 10.1145/1132952.1132954
M.Y. Eltabakh, R. Eltarras, W.G. Aref, Space-Partitioning Trees in PostgreSQL: Realization and Performance international conference on data engineering. pp. 100- 100 ,(2006) , 10.1109/ICDE.2006.146
Walid G Aref, Ihab F Ilyas, None, SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees intelligent information systems. ,vol. 17, pp. 215- 240 ,(2001) , 10.1023/A:1012809914301
Walid Aref, Daniel Barbará, Padmavathi Vallabhaneni, The handwritten trie: indexing electronic ink international conference on management of data. ,vol. 24, pp. 151- 162 ,(1995) , 10.1145/223784.223811