“Can NLP techniques be utilized as a reliable tool for medical science?” - Building a NLP Framework to Classify Medical Reports

  title={“Can NLP techniques be utilized as a reliable tool for medical science?” - Building a NLP Framework to Classify Medical Reports},
  author={Nafiz Sadman and Sumaiya Tasneem and Ariful Haque and Maminur Islam and Md. Manjurul Ahsan and Kishor Datta Gupta},
  journal={2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)},
Artificial intelligence persists on being a right-hand tool for many branches of biology. From preliminary advices and treatments, such as understanding if symptoms related to fever or cold, to critical detection of cancerous cell or classification of X-rays, traditional machine learning and deep learning techniques achieved remarkable feats. However, total dependency on machine-based prediction is yet a far fetched concept. In this paper, we provide a framework utilizing several Natural… 
Data Integration Challenges for Machine Learning in Precision Medicine
A number of challenges arise in the design of successful designs to medical data analytics under currently demanding conditions of performance in personalized medicine, while also subject to time, computational power, and bioethical constraints.
Recommend Speciality Doctor from Health Transcription: Ensemble Machine Learning Approach
A recommendation system that will recommend a doctor based on the result of ensemble machine learning (ML) and natural language processing (NLP) techniques and shows promising results, which indicate that this approach can be utilized in the health care industry.
Industry 4.0 in Healthcare: A systematic review
  • M. Ahsan, Z. Siddique
  • Computer Science
    International Journal of Information Management Data Insights
  • 2022


Machine learning applications in cancer prognosis and prediction
Natural Language Processing Based Instrument for Classification of Free Text Medical Records
The present work introduces the instrument for classification of medical records based on the Georgian language, the first attempt of such classification of the Georgia language based medical records, and the overall results of the study were successful.
CIDMP: Completely Interpretable Detection of Malaria Parasite in Red Blood Cells using Lower-dimensional Feature Space
An approach to extract a very small number of aggregated features that are easy to interpret and compute, and empirically show that it is possible to obtain high prediction accuracy even with a significantly reduced feature-space.
Automated identification of extreme-risk events in clinical incident reports
Text classification techniques can be applied effectively to automate the detection of extreme-risk events in clinical incident reports and higher prediction accuracy was attained on the specialized dataset, compared with the multi-type dataset.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
Deep Learning and Its Applications in Biomedicine
Ensemble Learning
  • Gavin Brown
  • Computer Science
    Encyclopedia of Machine Learning and Data Mining
  • 2017
In this paper, we consider ensemble classifiers, that is, machine learning based classifiers that utilize a combination of scoring functions. We provide a framework for categorizing such classifiers,
Classification and Regression by randomForest
random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Naive Bayes for Text Classification with Unbalanced Classes
This paper proposes an appropriate correction by adjusting attribute priors that can significantly improve the area under the ROC curve and shows that the modified version of MNB is very closely related to the simple centroid-based classifier and compare the two methods empirically.
Universal Sentence Encoder
It is found that transfer learning using sentence embeddings tends to outperform word level transfer with surprisingly good performance with minimal amounts of supervised training data for a transfer task.