A system for keyword search on textual streams

作者: Vagelis Hristidis , Oscar Valdivia , Philip S. Yu , Michail Vlachos

DOI:

关键词:

摘要: An increasing amount of data is produced in the form text streams − these can be RSS news feeds, TV closed captions, emails, etc. We study problem answering keyword queries on multiple textual streams. define result a query inspired by previous work search static databases. A to combination “sufficiently correlated” each other that collectively contain all keywords within specified time span. On algorithmic side, this paper we focus component continuously monitoring and outputting results as soon they are available.

参考文章(17)
Yiming Yang, Bryan Klimt, Introducing the Enron Corpus. conference on email and anti-spam. ,(2004)
Françoise Fabret, François Llirbat, Hans-Arno Jacobsen, Dennis E. Shasha, Kenneth A. Ross, João L. M. Pereira, Filtering Algorithms and Implementation for Very Fast Publish/Subscribe. international conference on management of data. pp. 115- 126 ,(2001)
Hector Garcia-Molina, Suresh Venkatasubramanian, Narayanan Shivakumar, Roy Goldman, Proximity Search in Databases very large data bases. pp. 26- 37 ,(1998)
C. Fiorentino, M. Cilia, L. Fiege, A. Buchmann, Building a Configurable Publish/Subscribe Notification Service Distributed Applications and Interoperable Systems. pp. 136- 147 ,(2005) , 10.1007/11498094_13
Françoise Fabret, H. Arno Jacobsen, François Llirbat, Joăo Pereira, Kenneth A. Ross, Dennis Shasha, Filtering algorithms and implementation for very fast publish/subscribe systems international conference on management of data. ,vol. 30, pp. 115- 126 ,(2001) , 10.1145/375663.375677
Vagelis Hristidis, Oscar Valdivia, Michail Vlachos, Philip S. Yu, Continuous keyword search on multiple text streams Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 802- 803 ,(2006) , 10.1145/1183614.1183738
Vagelis Hristidis, Yannis Papakonstantinou, Discover: keyword search in relational databases very large data bases. pp. 670- 681 ,(2002) , 10.1016/B978-155860869-6/50065-2
S. Agrawal, S. Chaudhuri, G. Das, DBXplorer: a system for keyword-based search over relational databases international conference on data engineering. pp. 5- 16 ,(2002) , 10.1109/ICDE.2002.994693
Rui Zhang, Nick Koudas, Beng Chin Ooi, Divesh Srivastava, Multiple aggregations over data streams Proceedings of the 2005 ACM SIGMOD international conference on Management of data - SIGMOD '05. pp. 299- 310 ,(2005) , 10.1145/1066157.1066192
Arvind Arasu, Brian Babcock, Shivnath Babu, Jon McAlister, Jennifer Widom, Characterizing memory requirements for queries over continuous data streams symposium on principles of database systems. pp. 221- 232 ,(2002) , 10.1145/543613.543642