Data deduplication

Known as: Deduplication, De-duplication, Duplication

In computing, data deduplication is a specialized data compression technique for eliminating duplicate copies of repeating data. Related and somewhat…

Wikipedia

Papers overview

Semantic Scholar uses AI to extract papers important to this topic.

2016

Code Randomization: Haven’t We Solved This Problem Yet?

Two decades since the idea of using software diversity for security was put forward, ASLR is the only technique to see widespread…

2016

A New Duplication Task Scheduling Algorithm in Heterogeneous Distributed Computing Systems

Aida A. NasrNirmeen A. El-BahnasawyA. El-Sayed
2016
Corpus ID: 64506012

The efficient scheduling algorithm is critical to achieve high performance in parallel and distributed systems. The main…

Review

2015

Review

2015

A Practioner's Guide to Evaluating Entity Resolution Results

Matt Barnes
arXiv.org
2015
Corpus ID: 1975403

Entity resolution (ER) is the task of identifying records belonging to the same entity (e.g. individual, group) across one or…

Review

2013

Review

2013

Exploratory Patent Search with Faceted Search and Configurable Entity Mining

P. FafaliosM. SalampasisYannis Tzitzikas
2013
Corpus ID: 15907819

Searching for patents is usually a recall-oriented problem and depending on the patent search type, quite often a problem which…

2012

Minimizing remote storage usage and synchronization time using deduplication and multichunking: Syncany as an example

P. Heckel
2012
Corpus ID: 177517181

School of Business Informatics and Mathematics Laboratory for Dependable Distributed Systems

2009

An Efficient Indexing Mechanism for Data Deduplication

Tin Thein ThwelN. Thein
International Conference on the Current Trends in…
2009
Corpus ID: 16871976

At present, there is a vast amount of duplicated data or redundant data in storage systems. Data de-duplication can eliminate…

Review

2006

Review

2006

Backup & Recovery

W. C. Preston
2006
Corpus ID: 86629619

Packed with practical, freely available backup and recovery solutions for Unix, Linux, Windows, and Mac OS X systems -- as well…

Review

2005

Review

2005

Assessing Deduplication and Data Linkage Quality: What to Measure?

P. ChristenKarl Goiser
2005
Corpus ID: 10880077

Deduplicating one data set or linking several data sets are increasingly important tasks in the data preparation steps of many…

1993

VLSI concurrent error correcting adders and multipliers

Y. HsuE. Swartzlander
Proceedings of IEEE International Workshop on…
1993
Corpus ID: 424952

Time redundancy is an approach to achieve fault-tolerance without introducing excessive hardware that can be used in applications…

1953

Duplication of the entire colon, bladder, and urethra.

Ravitch MmScott Ww
1953
Corpus ID: 77424014

Data deduplication

Related topics

Papers overview