Mining medical datasets is a challenging problem before data mining researchers as these datasets have several hidden challenges compared to conventional datasets. Starting from the collection of samples through field experiments and clinical trials to performing classification, there are numerous challenges at every stage in the mining process. The preprocessing phase in the mining process itself is a challenging issue when, we work on medical datasets. The main contribution of this research includes the detailed survey carried out and brings out the discussion that is not initiated in research papers published in the fields of medical and health informatics. We made a sincere effort towards making this possible and aim to bring out the various research issues associated with the disease prediction from the perspective of data mining. We also discuss the nature of medical disease datasets before switching our attention towards prediction or classification.
Unfortunately, ACM prohibits us from displaying non-influential references for this paper.
To see the full reference list, please visit http://dl.acm.org/citation.cfm?id=2833078.