Massively Parallel Feature Selection: An Approach Based on Variance Preservation

@inproceedings{Zhao2012MassivelyPF,
  title={Massively Parallel Feature Selection: An Approach Based on Variance Preservation},
  author={Zheng Zhao and James Cox and David Duling and Warren Sarle},
  booktitle={ECML/PKDD},
  year={2012}
}
Advances in computer technologies have enabled corporations to accumulate data at an unprecedented speed. Large-scale business data might contain billions of observations and thousands of features, which easily brings their scale to the level of terabytes. Most traditional feature selection algorithms are designed for a centralized computing architecture. Their usability significantly deteriorates when data size exceeds hundreds of gigabytes. High-performance distributed computing frameworks… CONTINUE READING

Similar Papers

Topics from this paper.

Citations

Publications citing this paper.
SHOWING 1-10 OF 36 CITATIONS

A distributed feature selection scheme with partial information sharing

  • Machine Learning
  • 2019
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

A large-scale filter method for feature selection based on spark

  • 2017 IEEE 4th International Conference on Soft Computing & Machine Intelligence (ISCMI)
  • 2017
VIEW 5 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Bioinformatics and Biomedical Engineering

  • Lecture Notes in Computer Science
  • 2016
VIEW 10 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

New Trends in Databases and Information Systems

  • Communications in Computer and Information Science
  • 2015
VIEW 11 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Innovations and Interdisciplinary Solutions for Underserved Areas

  • Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
  • 2018