System and method for creating deduplicated copies of data by tracking temporal relationships among copies using higher-level hash structures

作者: Mark A. Roman , Christopher A. Provenzano

DOI:

关键词: Signature (logic)Hash functionData miningHash treeStructure (mathematical logic)Information retrievalContent (measure theory)Tracking (particle physics)Computer scienceCacheTemporal logic

摘要: Systems and methods are disclosed for forming deduplicated images of a data object that changes over time using difference information between temporal states the object. The method includes organizing content first state as plurality segments storing in store; creating an organized arrangement hash structures to represent its state; receiving object; at least one signature changed content; is unique store segments. also determining, subsequent deduplicating store, whether should be stored by searching higher-level structure global cache store.

参考文章(130)
Robert Fair, James Lentini, Daniel Ellard, Andy Kahn, John K. Edwards, Keith A. Smith, Arkady Kanevsky, Craig Everhart, Edward Zayas, Ashish Prakash, Eric Hamilton, FlexVol: flexible, efficient file volume virtualization in WAFL usenix annual technical conference. pp. 129- 142 ,(2008)
Petros Efstathopoulos, Fanglu Guo, Building a high-performance deduplication system usenix annual technical conference. pp. 25- 25 ,(2011)
Kai Li, Hugo Patterson, Benjamin Zhu, Avoiding the disk bottleneck in the data domain deduplication file system file and storage technologies. pp. 18- ,(2008)
Russell R. Stringham, Client side data deduplication ,(2008)
Hiroshi Ogasawara, Hitoshi Kamei, Takahiro Nakano, Storage network system and its control method ,(2010)