An Experimental Survey of MapReduce-Based Similarity Joins


In recent years, Big Data systems and their main data processing framework MapReduce, have been introduced to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Similarity Join (SJ), which finds similar pairs of objects between two datasets. The study of SJ techniques for Big Data systems… (More)
DOI: 10.1007/978-3-319-46759-7_14


6 Figures and Tables

Slides referencing similar topics