Combination of PCA with SMOTE Resampling to Boost the Prediction Rate in Lung Cancer Dataset

@article{Naseriparsa2013CombinationOP,
  title={Combination of PCA with SMOTE Resampling to Boost the Prediction Rate in Lung Cancer Dataset},
  author={Mehdi Naseriparsa and Mohammad Mansour Riahi Kashani},
  journal={CoRR},
  year={2013},
  volume={abs/1403.1949}
}
Classification algorithms are unable to make reliable models on the datasets with huge sizes. These datasets contain many irrelevant and redundant features that mislead the classifiers. Furthermore, many huge datasets have imbalanced class distribution which leads to bias over majority class in the classification process. In this paper combination of unsupervised dimensionality reduction methods with resampling is proposed and the results are tested on LungCancer dataset. In the first step PCA… CONTINUE READING
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS
7 Citations
14 References
Similar Papers

References

Publications referenced by this paper.
Showing 1-10 of 14 references

UCI Repository of machine learning databases, http://www.ics.uci.edu/~mlearn/MLRepository.html, University of California

  • Mertz C.J, P. M. Murphy
  • 2013
1 Excerpt

Data Mining A Knowledge Discovery Approach

  • J. Krzysztof, P. Witold, W. Roman, A. Lukasz
  • 2007
1 Excerpt

Similar Papers

Loading similar papers…