Colon cancer survival prediction using ensemble data mining on SEER data

  title={Colon cancer survival prediction using ensemble data mining on SEER data},
  author={Reda Al-Bahrani and Ankit Agrawal and Alok N. Choudhary},
  journal={2013 IEEE International Conference on Big Data},
We analyze the colon cancer data available from the SEER program with the aim of developing accurate survival prediction models for colon cancer. Carefully designed preprocessing steps resulted in removal of several attributes and applying several supervised classification methods. We also adopt synthetic minority over-sampling technique (SMOTE) to balance the survival and non-survival classes we have. In our experiments, ensemble voting of the three of the top performing classifiers was found… CONTINUE READING
7 Citations
25 References
Similar Papers


Publications referenced by this paper.
Showing 1-10 of 25 references

A predication survival model for colorectal cancer.

  • Fathy, Sherif Kassem
  • In Proceedings of the 2011 American conference on…
  • 2011
1 Excerpt

" The use of the area under the ROC curve in the evaluation of machine learning algorithms

  • Andrew P. Bradley
  • SIGKDD Explorations
  • 2009

Similar Papers

Loading similar papers…