Data deduplication with adaptive erasure code redundancy
MetadataShow full item record
CitationArslan, S.S., Wideman, R., Lee, J., & Goker, T. (January 14, 2016). (Patent). Data Deduplication with Adaptive Erasure Code Redundancy. Pub. No: US20160013815A1. Quantum Corporation, San Jose, CA (USA). pp. 1-20.
Example apparatus and methods combine erasure coding with data deduplication to simultaneously reduce the overall redundancy in data while increasing the redundancy of unique data. In one embodiment, an efficient representation of a data set is produced by deduplication. The efficient rep resentation reduces duplicate data in the data set. Redundancy is then added back into the data set using erasure coding. The redundancy that is added back in adds protection to the unique data associated with the efficient representation. How much redundancy is added back in and what type of redundancy is added back in may be controlled based on an attribute (e.g., value, reference count, symbol size, number of symbols) of the unique data. Decisions concerning how much and what type of redundancy to add back in may be adapted over time based, for example, on observations of the efficiency of the overall system.