Constructing collaborative desktop storage caches for large scientific datasets

作者: Sudharshan S. Vazhkudai , Xiaosong Ma , Vincent W. Freeh , Jonathan W. Strickland , Nandan Tammineedi

DOI: 10.1145/1168910.1168911

关键词:

摘要: High-end computing is suffering a data deluge from experiments, simulations, and apparatus that creates overwhelming application dataset sizes. This has led to the proliferation of high-end mass storage systems, area clusters, centers. These facilities offer large range choices in terms capacity access rate, as well strong availability consistency support. However, for most end-users, “last mile” their analysis pipeline often requires processing visualization at local computers, typically desktop workstations. End-user workstations---despite having more power than ever before---are ill-equipped cope with such demands due insufficient secondary space I/O rates. Meanwhile, portion unused.We propose FreeLoader framework, which aggregates unused bandwidth into shared cache/scratch space, hosting large, immutable datasets exploiting locality. article presents architecture, component design, performance results based on our proof-of-concept prototype. Its architecture comprises contributing benefactor nodes, steered by management layer, providing services integrity, high performance, load balancing, impact control. Our experiments show an appealing low-cost solution storing massive delivering higher rates traditional facilities, namely, or remote file Internet repositories. In particular, we present novel striping techniques allow efficiently aggregate workstation's network communication bandwidth. addition, native workload donor machines small can be effectively controlled. Further, security features encryptions integrity checks easily added filters interested clients. Finally, demonstrate how legacy applications use API store retrieve datasets.

参考文章(55)
Charles Blake, Rodrigo Rodrigues, High availability, scalable storage, dynamic peer networks: pick two workshop on hot topics in operating systems. pp. 1- 1 ,(2003)
Jon Crowcroft, Ian Pratt, Peer to peer: peering into the future Lecture Notes in Computer Science. pp. 1- 19 ,(2002) , 10.1007/3-540-36162-6_1
Svend Frølund, Alistair Veitch, Yasushi Saito, Arif Merchant, Susan Spence, FAB: enterprise storage systems on a shoestring hot topics in operating systems. pp. 29- 29 ,(2003)
Walfredo Cirne, Paulo Roisenberg, Reynaldo C. Novaes, Roque Scheer, Caio Northfleet, J. H. Jornada, Non-Dedicated Distributed Environment: A Solution for Safe and Continuous Exploitation of Idle Cycles Scalable Computing: Practice and Experience. ,vol. 6, ,(2001) , 10.12694/SCPE.V6I3.341
J.W. Strickland, V.W. Freeh, Xiaosong Ma, S.S. Vazhkudai, Governor: Autonomic Throttling for Aggressive Idle Resource Scavenging Second International Conference on Autonomic Computing (ICAC'05). pp. 64- 75 ,(2005) , 10.1109/ICAC.2005.31
Jim Gray, David T. Liu, Maria Nieto-Santisteban, Alex Szalay, David J. DeWitt, Gerd Heber, Scientific data management in the coming decade international conference on management of data. ,vol. 34, pp. 34- 41 ,(2005) , 10.1145/1107499.1107503
A. Szalay, The World-Wide Telescope Science. ,vol. 293, pp. 2037- 2040 ,(2001) , 10.1126/SCIENCE.293.5537.2037
Landon P. Cox, Christopher D. Murray, Brian D. Noble, Pastiche ACM SIGOPS Operating Systems Review. ,vol. 36, pp. 285- 298 ,(2002) , 10.1145/844128.844155