• Corpus ID: 18745929

Diagnosis of Lung Cancer Prediction System Using Data Mining Classification Techniques

  title={Diagnosis of Lung Cancer Prediction System Using Data Mining Classification Techniques},
  author={V. V. Jayarama Krishnaiah and Gugulothu Narsimha and N. Subhash Chandra},
Cancer is the most important cause of death for both men and women. The early detection of cancer can be helpful in curing the disease completely. So the requirement of techniques to detect the occurrence of cancer nodule in early stage is increasing. A disease that is commonly misdiagnosed is lung cancer. Earlier diagnosis of Lung Cancer saves enormous lives, failing which may lead to other severe problems causing sudden fatal end. Its cure rate and prediction depends mainly on the early… 

Figures and Tables from this paper

Mining Techniques for Clinical Expert System and Predicting and Treating Lung Cancer with Big Data
The aim of the paper is to suggest a model for early detection and correct diagnosis of the disease using which will help the doctor in saving the life of the patient.
A Survey on Lung Cancer Diagnosis Using Novel Methods
This paper aims to study on the process of detecting and diagnosing lung diseases in the early stages so as to help doctors to save patient’s life.
Application of Machine Learning for Prediction of Lung Cancer using Omics Data
Different ways to detect and predict lung cancer using latest Machine learning algorithms combined with data mining has been analyzed.
Classification Based Analysis on Cancer Datasets Using Predictor Measures
This paper focuses on prediction of some such diseases like Leukemia and Breast cancers by using Naive Bayes and SVM prediction models built for the prediction and classification.
Early Detection and Prevention of Cancer using Data Mining Techniques
A novel multi layered method combining clustering and decision tree techniques to build a cancer risk prediction system is proposed here which predicts lung, breast, oral, cervix, stomach and blood cancers and is also user friendly, time and cost saving.
A Critical Study of Classification Algorithms for LungCancer Disease Detection and Diagnosis
The main aim of this paper is to provide the earlier warning to the users and to measure the performance analysis of the classification algorithms using WEKA Tool.
Data mining techniques for lung and breast cancer diagnosis: A review
  • B. Ahmed
  • Computer Science
    International Journal of Informatics and Communication Technology (IJ-ICT)
  • 2021
A comparison has been done among various reviewed papers in terms of accuracy that used various data-mining classification algorithms to propose the lung and breast cancer diagnosis system that showed that the Multilayer Perceptron (MLP) and Logistic Regression (LR) gave a higher accuracy.
Early Detection of Lung Cancer using Data Mining Techniques: A Survey
The present article surveys the role of different Data mining paradigms, like Decision Trees (DT), Artificial Neural Networks (ANN), Association Rule Mining (ARM) and Bayesian Classifier in lung cancer diagnosis.
An Analysis and Survey on Future Trends of Data Mining in Prediction and Recurrence of Lung , Breast and Liver Cancer
The past work done on the prediction of lung, breast and liver cancer, three of the most fatal diseases are reviewed to provide a succinct and concise overview of the work done in this field.
Detection of Lung Cancer Using Backpropagation Neural Networks and Genetic Algorithm
This system will help to draw an appropriate decision about a particular patient’s state by using Genetic Algorithm to extract as well as select features on the basis of the fitness of the features extracted.


" Intelligent Heart Disease Prediction System Using Data Mining Techniques "
There is need of developing a master's project which will help practitioners predict the heart disease before it occurs and how to turn data into useful information that can enable healthcare practitioners to make intelligent clinical decisions.
Empirical Study on Applications of Data Mining Techniques in Healthcare
The potential use of classification based data mining techniques such as Rule based, decision tree and Artificial Neural Network to massive volume of healthcare data is examined.
An expert system for detection of breast cancer based on association rules and neural network
Impact of prediagnosis smoking, alcohol, obesity, and insulin resistance on survival in male cancer patients: National Health Insurance Corporation Study.
Prediagnosis risk factors for cancer development (smoking, alcohol consumption, obesity, and insulin resistance) had a statistically significant effect on survival among male cancer patients.
Microarray data classified by artificial neural networks.
The current work reviews advantages as well as disadvantages of neural networks in the context of microarray analysis, and selected solutions for the effective combination of multiple ANNs are presented.
Cancer Research in ICMR Achievements in Nineties
  • Medicine
  • 2000
The focus of research in the field of cancer has been on the aetiology with identification of preventable risk factors, understand the mechanism of carcinogenesis and on operational research for control of tobacco use and common cancers through existing infrastructures.
Gene selection and classification of microarray data using random forest
It is shown that random forest has comparable performance to other classification methods, including DLDA, KNN, and SVM, and that the new gene selection procedure yields very small sets of genes (often smaller than alternative methods) while preserving predictive accuracy.
One Dependence Augmented Naive Bayes
The motivation is to develop a new algorithm to improve Naive Bayes' performance not only on classification measured by accuracy but also on ranking measured by AUC, and the experimental results show that the algorithm outperforms all the other algorithms significantly in yielding accurate ranking, yet at the same time outperform all theother algorithms slightly in terms of classification accuracy.
Random Forests
Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.