Location-Based Top-k Term Querying over Sliding Window

作者: Ying Xu , Lisi Chen , Bin Yao , Shuo Shang , Shunzhi Zhu

DOI: 10.1007/978-3-319-68783-4_21

关键词:

摘要: In part due to the proliferation of GPS-equipped mobile devices, massive svolumes geo-tagged streaming text messages are becoming available on social media. It is great interest discover most frequent nearby terms from such tremendous stream data. this paper, we present novel indexing, updating, and query processing techniques that capable discovering top-k locally popular over a sliding window. Specifically, given location set within window, study problem searching for by considering both term frequency proximities between containing location. We develop efficient mechanism solve problem, including quad-tree based indexing structure, update technique, best-first algorithm. An empirical conducted show our proposed fit users’ requirements through varying number parameters.

参考文章(48)
Makbule Gulcin Ozsoy, Kezban Dilek Onal, Ismail Sengor Altingovde, Result Diversification for Tweet Search Web Information Systems Engineering – WISE 2014. pp. 78- 89 ,(2014) , 10.1007/978-3-319-11746-1_6
Gurmeet Singh Manku, Rajeev Motwani, Chapter 31 – Approximate Frequency Counts over Data Streams very large data bases. pp. 346- 357 ,(2002) , 10.1016/B978-155860869-6/50038-X
Nick Koudas, Nilesh Bansal, BlogScope: a system for online analysis of high volume text streams very large data bases. pp. 1410- 1413 ,(2007)
Moses Charikar, Kevin Chen, Martin Farach-Colton, Finding Frequent Items in Data Streams international colloquium on automata languages and programming. ,vol. 312, pp. 693- 703 ,(2002) , 10.1016/S0304-3975(03)00400-6
Ahmed Metwally, Divyakant Agrawal, Amr El Abbadi, Efficient Computation of Frequent and Top-k Elements in Data Streams Database Theory - ICDT 2005. pp. 398- 412 ,(2004) , 10.1007/978-3-540-30570-5_27
Erik D. Demaine, Alejandro López-Ortiz, J. Ian Munro, Frequency Estimation of Internet Packet Streams with Limited Space european symposium on algorithms. pp. 348- 360 ,(2002) , 10.1007/3-540-45749-6_33
João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, Kjetil Nørvåg, Efficient processing of top-k spatial keyword queries symposium on large spatial databases. pp. 205- 222 ,(2011) , 10.1007/978-3-642-22922-0_13
Ahmed Metwally, Divyakant Agrawal, Amr El Abbadi, An integrated efficient solution for computing frequent and top- k elements in data streams ACM Transactions on Database Systems. ,vol. 31, pp. 1095- 1133 ,(2006) , 10.1145/1166074.1166084
Yang Li, Feifei Li, Ke Yi, Bin Yao, Min Wang, Flexible aggregate similarity search international conference on management of data. pp. 1009- 1020 ,(2011) , 10.1145/1989323.1989429
Shuo Shang, Ruogu Ding, Kai Zheng, Christian S. Jensen, Panos Kalnis, Xiaofang Zhou, Personalized trajectory matching in spatial networks very large data bases. ,vol. 23, pp. 449- 468 ,(2014) , 10.1007/S00778-013-0331-0