Real datasets for file-sharing peer-to-peer systems

作者: Shen Tat Goh , Panos Kalnis , Spiridon Bakiras , Kian-Lee Tan

DOI: 10.1007/11408079_19

关键词:

摘要: The fundamental drawback of unstructured peer-to-peer (P2P) networks is the flooding-based query processing protocol that seriously limits their scalability. As a result, significant amount research work has focused on designing efficient search protocols reduce overall communication cost. What lacking, however, availability real data, regarding exact content users' libraries and queries these users ask. Using trace-driven simulations will clearly generate more meaningful results further illustrate efficiency generic under real-life scenario. Motivated by this fact, we developed Gnutella-style probe collected detailed data over period two months. They involve around 4,500 contain files shared each user, together with any available metadata (e.g., artist for songs) information about nodes connection speed). We also initiated users. After filtering, were organized in XML format are to researchers. Here, analyze dataset present its statistical characteristics. Additionally, as case study, employ it evaluate recently proposed P2P searching techniques.

参考文章(14)
Hector Garcia-Molina, Beverly Yang, Venkata Gopal K Addada, Efficient search in peer to peer networks ,(2004)
S. Bakiras, P. Kalnis, T. Loukopoulos, Wee Siong Ng, A general framework for searching in distributed data repositories international parallel and distributed processing symposium. pp. 34- ,(2003) , 10.1109/IPDPS.2003.1213117
Michalis Faloutsos, Petros Faloutsos, Christos Faloutsos, On power-law relationships of the Internet topology acm special interest group on data communication. ,vol. 29, pp. 251- 262 ,(1999) , 10.1145/316188.316229
Stefan Saroiu, P. Krishna Gummadi, Steven D. Gribble, Measurement study of peer-to-peer file sharing systems Multimedia Computing and Networking 2002. ,vol. 4673, pp. 156- 170 ,(2001) , 10.1117/12.449977
K.L. Calvert, M.B. Doar, E.W. Zegura, Modeling Internet topology IEEE Communications Magazine. ,vol. 35, pp. 160- 163 ,(1997) , 10.1109/35.587723
B. Beverly Yang, H. Garcia-Molina, Designing a super-peer network international conference on data engineering. pp. 49- 60 ,(2003) , 10.1109/ICDE.2003.1260781
B. Yang, H. Garcia-Molina, Improving search in peer-to-peer networks international conference on distributed computing systems. pp. 5- 14 ,(2002) , 10.1109/ICDCS.2002.1022237
S. Sen, J. Wang, Analyzing peer-to-peer traffic across large networks IEEE ACM Transactions on Networking. ,vol. 12, pp. 219- 232 ,(2004) , 10.1109/TNET.2004.826277
Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry M. Levy, John Zahorjan, Measurement, modeling, and analysis of a peer-to-peer file-sharing workload symposium on operating systems principles. ,vol. 37, pp. 314- 329 ,(2003) , 10.1145/1165389.945475
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan, Chord Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '01. ,vol. 31, pp. 149- 160 ,(2001) , 10.1145/383059.383071