Skip to search form
Skip to main content
Skip to account menu
Semantic Scholar
Semantic Scholar's Logo
Search 234,976,544 papers from all fields of science
Search
Sign In
Create Free Account
Data deduplication
Known as:
Deduplication
, De-duplication
, Duplication
Expand
In computing, data deduplication is a specialized data compression technique for eliminating duplicate copies of repeating data. Related and somewhat…
Expand
Wikipedia
(opens in a new tab)
Create Alert
Alert
Related topics
Related topics
49 relations
Amazon Elastic Compute Cloud (EC2)
Backup
BackupPC
Cloud storage
Expand
Papers overview
Semantic Scholar uses AI to extract papers important to this topic.
2016
2016
Code Randomization: Haven’t We Solved This Problem Yet?
Stephen Crane
,
Andrei Homescu
,
Per Larsen
IEEE Cybersecurity Development
2016
Corpus ID: 16757491
Two decades since the idea of using software diversity for security was put forward, ASLR is the only technique to see widespread…
Expand
2016
2016
A New Duplication Task Scheduling Algorithm in Heterogeneous Distributed Computing Systems
Aida A. Nasr
,
Nirmeen A. El-Bahnasawy
,
A. El-Sayed
2016
Corpus ID: 64506012
The efficient scheduling algorithm is critical to achieve high performance in parallel and distributed systems. The main…
Expand
Review
2015
Review
2015
A Practioner's Guide to Evaluating Entity Resolution Results
Matt Barnes
arXiv.org
2015
Corpus ID: 1975403
Entity resolution (ER) is the task of identifying records belonging to the same entity (e.g. individual, group) across one or…
Expand
Review
2013
Review
2013
Exploratory Patent Search with Faceted Search and Configurable Entity Mining
P. Fafalios
,
M. Salampasis
,
Yannis Tzitzikas
2013
Corpus ID: 15907819
Searching for patents is usually a recall-oriented problem and depending on the patent search type, quite often a problem which…
Expand
2012
2012
Minimizing remote storage usage and synchronization time using deduplication and multichunking: Syncany as an example
P. Heckel
2012
Corpus ID: 177517181
School of Business Informatics and Mathematics Laboratory for Dependable Distributed Systems
2009
2009
An Efficient Indexing Mechanism for Data Deduplication
Tin Thein Thwel
,
N. Thein
International Conference on the Current Trends in…
2009
Corpus ID: 16871976
At present, there is a vast amount of duplicated data or redundant data in storage systems. Data de-duplication can eliminate…
Expand
Review
2006
Review
2006
Backup & Recovery
W. C. Preston
2006
Corpus ID: 86629619
Packed with practical, freely available backup and recovery solutions for Unix, Linux, Windows, and Mac OS X systems -- as well…
Expand
Review
2005
Review
2005
Assessing Deduplication and Data Linkage Quality: What to Measure?
P. Christen
,
Karl Goiser
2005
Corpus ID: 10880077
Deduplicating one data set or linking several data sets are increasingly important tasks in the data preparation steps of many…
Expand
1993
1993
VLSI concurrent error correcting adders and multipliers
Y. Hsu
,
E. Swartzlander
Proceedings of IEEE International Workshop on…
1993
Corpus ID: 424952
Time redundancy is an approach to achieve fault-tolerance without introducing excessive hardware that can be used in applications…
Expand
1953
1953
Duplication of the entire colon, bladder, and urethra.
Ravitch Mm
,
Scott Ww
1953
Corpus ID: 77424014