• Corpus ID: 31856582

Assessing Gradient Boosting in the Reduction of Misclassification Error in the Prediction of Success for Actuarial Majors

  title={Assessing Gradient Boosting in the Reduction of Misclassification Error in the Prediction of Success for Actuarial Majors},
  author={Alan Olinsky and Kristin Kennedy and Bonnie Kennedy},
This paper provides a relatively new technique for predicting the retention of students in an actuarial mathematics program. The authors utilize data from a previous research study. In that study, logistic regression, classification trees, and neural networks were compared. The neural networks (with prior imputation of missing data) and classification trees (with no imputation required) were most accurate. However, in this paper, we examine the use of gradient boosting to improve the accuracy… 

Figures and Tables from this paper

What variables are important in predicting bovine viral diarrhea virus? A random forest approach
The results suggest that the use of machine learning algorithms, especially RF, is a promising methodology for the analysis of cross-sectional studies, presenting a satisfactory predictive power and the ability to identify predictors that represent potential risk factors for BVDV investigation.
Performances of machine learning algorithms for mapping fractional cover of an invasive plant species in a dryland ecosystem
It is demonstrated that MLAs can be successfully used to develop fractional cover maps of plant species, particularly IAPS, so as to design targeted and spatially explicit management strategies.
Contrasting determinants for the introduction and establishment success of exotic birds in Taiwan using decision trees models
The results showed that future success for introduction and establishment of exotic birds may be gauged by simply looking at previous success in invading other countries, and found that species traits related to reproduction were more important in establishment models than in introduction models.
Determinants of establishment success: Comparing alien and native freshwater fishes in Taiwan
It is suggested that aquaculture use, measures of body size, and the number of previously invaded countries may alert researchers and conservation managers to species with a high establishment potential, especially for countries with similar conditions as those in Taiwan.
A novel hybrid approach to flood susceptibility assessment based on machine learning and land use change. Case study: a river watershed in Vietnam
ABSTRACT This study aims to develop a comprehensive approach including an analysis of the relationships between flood susceptibility and land-use change, based on the relevance vector machine (RVM)


A Comparison of Logistic Regression, Neural Networks, and Classification Trees Predicting Success of Actuarial Students
The authors extended previous research by re-examining the data using neural networks and classification trees, from Enterprise Miner, the SAS data mining package, which can provide a prediction of the dependent variable for all cases in the data set including those with missing values.
Greedy function approximation: A gradient boosting machine.
A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Applied Analytics Using SAS Enterprise Miner 5.3: Course Notes
  • SAS Institute,
  • 2009