An Experimental Survey of MapReduce-Based Similarity Joins

Abstract

In recent years, Big Data systems and their main data processing framework MapReduce, have been introduced to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Similarity Join (SJ), which finds similar pairs of objects between two datasets. The study of SJ techniques for Big Data systems… (More)
DOI: 10.1007/978-3-319-46759-7_14

Topics

6 Figures and Tables

Slides referencing similar topics