Mutant: Balancing Storage Cost and Latency in LSM-Tree Data Stores

作者: Hobin Yoon , Juncheng Yang , Sveinn Fannar Kristjansson , Steinn E. Sigurdarson , Ymir Vigfusson

DOI: 10.1145/3267809.3267846

关键词: Merge (version control)Computer scienceMutantCloud computingOperating systemCloud databaseStorage typeLatency (engineering)Data migrationWorkload

摘要: Today's cloud database systems are not designed for seamless cost-performance trade-offs changing SLOs. Database engineers have a limited number of due to the storage types offered by vendors, and switching different type requires time-consuming data migration new database. We propose Mutant, layer log-structured merge tree (LSM-tree) stores that dynamically balances cost performance organizing SSTables (files store subset records) into based on SSTable access frequencies. implemented Mutant extending RocksDB found in our evaluation delivers with YCSB workload real-world trace. Moreover, through additional optimizations, lowers user-perceived latency significantly compared unmodified

参考文章(19)
Viswanath Sivakumar, Wyatt Lloyd, Linpeng Tang, Sanjeev Kumar, Sabyasachi Roy, Satadru Pan, Ernest Lin, Subramanian Muralidhar, Cory Hill, Shiva Shankar, Weiwen Liu, f4: Facebook's warm BLOB storage system operating systems design and implementation. pp. 383- 398 ,(2014) , 10.5555/2685048.2685078
Frank P. Incropera, Fundamentals of Heat and Mass Transfer ,(1985)
Mario Blaum, David A. Pease, Mingyuan Xia, Mohit Saxena, A tale of two erasure codes in HDFS file and storage technologies. pp. 213- 226 ,(2015) , 10.5555/2750482.2750499
Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber, Bigtable ACM Transactions on Computer Systems. ,vol. 26, pp. 1- 26 ,(2008) , 10.1145/1365815.1365816
Brian F. Cooper, Adam Silberstein, Erwin Tam, Raghu Ramakrishnan, Russell Sears, Benchmarking cloud serving systems with YCSB Proceedings of the 1st ACM symposium on Cloud computing - SoCC '10. pp. 143- 154 ,(2010) , 10.1145/1807128.1807152
Christopher Jermaine, Edward Omiecinski, Wai Gen Yee, The partitioned exponential file for database storage management very large data bases. ,vol. 16, pp. 417- 437 ,(2007) , 10.1007/S00778-005-0171-7
Russell Sears, Raghu Ramakrishnan, bLSM Proceedings of the 2012 international conference on Management of Data - SIGMOD '12. pp. 217- 228 ,(2012) , 10.1145/2213836.2213862
Dennis Colarelli, Dirk Grunwald, Massive Arrays of Idle Disks For Storage Archives conference on high performance computing (supercomputing). pp. 1- 11 ,(2002) , 10.5555/762761.762819
Patrick O’Neil, Edward Cheng, Dieter Gawlick, Elizabeth O’Neil, The log-structured merge-tree (LSM-tree) Acta Informatica. ,vol. 33, pp. 351- 385 ,(1996) , 10.1007/S002360050048
Anders Brodersen, Salvatore Scellato, Mirjam Wattenhofer, YouTube around the world: geographic popularity of videos the web conference. pp. 241- 250 ,(2012) , 10.1145/2187836.2187870