• Corpus ID: 102351907

Diabetes Mellitus Forecasting Using Population Health Data in Ontario, Canada

  title={Diabetes Mellitus Forecasting Using Population Health Data in Ontario, Canada},
  author={Mathieu Ravaut and Hamed Sadeghi and Kin Kwan Leung and Maksims Volkovs and Laura C. Rosella},
Leveraging health administrative data (HAD) datasets for predicting the risk of chronic diseases including diabetes has gained a lot of attention in the machine learning community recently. In this paper, we use the largest health records datasets of patients in Ontario,Canada. Provided by the Institute of Clinical Evaluative Sciences (ICES), this database is age, gender and ethnicity-diverse. The datasets include demographics, lab measurements,drug benefits, healthcare system interactions… 

Figures and Tables from this paper

Prediction of Type 2 Diabetes Based on Machine Learning Algorithm

The performance of the prediction model proved to be reasonably good at forecasting the occurrence of type 2 diabetes in the Korean population and can provide clinicians and patients with valuable predictive information on the likelihood of developing T2D.

Diagnosis of Diabetes Mellitus Using Gradient Boosting Machine (LightGBM)

The experimental results show that the prepared diabetes dataset is informative to predict the condition of diabetes mellitus and the LightGBM model outperformed KNN, SVM, NB, Bagging, RF, and XGBoost in the case of the ZMHDD dataset.

Use of Machine Learning Approaches in Clinical Epidemiological Research of Diabetes

Machine learning approaches have the potential to improve risk stratification and outcome prediction for clinical epidemiology applications, and would be facilitated by use of universal open-source datasets for fair comparisons.

Automated Prediction of Liver Disease using Machine Learning (ML) Algorithms

The data was separated into two categories: patients with liver disease and sicknesses the most accurate machine learning method was used to predict the final result, and the best one is identified.



Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors

We present a new approach to population health, in which data-driven predictive models are learned for outcomes such as type 2 diabetes. Our approach enables risk assessment from readily available

Type-2 Diabetes Mellitus Diagnosis from Time Series Clinical Data Using Deep Learning Models

Long Short-Term Memory (LSTM) and Gated-Recurrent Unit (GRU) are trained on KAIMRCD and are demonstrated here to outperform classical machine learning approaches in the literature with over 97% accuracy.

Early Detection of Diabetes from Health Claims

Based on patient data from insurance claims, the results of initial experiments into identification of patients who will develop diabetes are presented and the need to develop machine learning algorithms that can effectively deal with the depth and the variety of the data is motivated.

Predicting Diabetes Mellitus With Machine Learning Techniques

The results showed that prediction with random forest could reach the highest accuracy (ACC = 0.8084) when all the attributes were used and principal component analysis (PCA) and minimum redundancy maximum relevance (mRMR) was used to reduce the dimensionality.

Machine Learning for the Prediction of New-Onset Diabetes Mellitus during 5-Year Follow-up in Non-Diabetic Patients with Cardiovascular Risks

A T2DM prediction system using machine learning and an EMR database is developed and verified and it predicted the 5-year occurrence of type 2 diabetes mellitus similarly to with a traditional prediction model.

Diabetes in Ontario: determination of prevalence and incidence using a validated administrative data algorithm.

Administrative data can be used to establish population-based incidence and prevalence of diabetes, which is increasing in Ontario and is considerably higher than self-reported rates.

Prediction of Type 2 Diabetes by Hemoglobin A1c in Two Community-Based Cohorts

HbA1c predicts type 2 diabetes (T2D) in different common scenarios and is useful for identifying individuals with elevated T2D risk in both the short and long-term.

Combining the Power of Artificial Intelligence with the Richness of Healthcare Claims Data: Opportunities and Challenges

The identification of complex patterns within high dimensional datasets may find new predictors for early onset of diseases or lead to a more proactive offering of personalized preventive services using AI methods in healthcare.

The Impact of Big Data on Chronic Disease Management

Initial applications of big data analytics have proved useful in various phases of chronic disease management and could help reduce the chronic disease burden.