A Grid-Based k-Nearest Neighbor Join for Large Scale Datasets on MapReduce

@article{Jang2015AGK,
  title={A Grid-Based k-Nearest Neighbor Join for Large Scale Datasets on MapReduce},
  author={Mi-Young Jang and Young-Sung Shin and Jae-Woo Chang},
  journal={2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems},
  year={2015},
  pages={888-891}
}
Because MapReduce supports efficient parallel data processing, MapReduce-based query processing algorithms have been widely studied. Among various query types, k-nearest neighbor join, which aims to produce the k nearest neighbors of each point of a dataset from another dataset, has been considered most important in data analysis. Existing k-NN join query… CONTINUE READING