• Corpus ID: 19175060

Data-Driven Exploration of Factors Affecting Federal Student Loan Repayment

  title={Data-Driven Exploration of Factors Affecting Federal Student Loan Repayment},
  author={Bin Luo and Qi Zhang and Somya D. Mohanty},
Student loans occupy a significant portion of the federal budget, as well as, the largest financial burden in terms of debt for graduates. This paper explores data-driven approaches towards understanding the repayment of such loans. Using statistical and machine learning models on the College Scorecard Data, this research focuses on extracting and identifying key factors affecting the repayment of a student loan. The specific factors can be used to develop models which provide predictive… 

Figures and Tables from this paper

Modified Random Forest based Graduates Earning of Higher Education Mining

With the significant trend of students and families towards higher education and the great change in the labor market, great attention is paid to the issue of job opportunities and the earnings of

International Journal of Recent Technology and Engineering (IJRTE)

VMS-AES is a novel AES-like algorithm which uses Linear Feedback Shift Register (LFSR) to generate the required parameters used to remap the values of substitution box randomly to another location depending on a secret key and keeps the same good linear and differential properties of the AES S-box.



A Trillion Dollar Question: What Predicts Student Loan Delinquencies?

The recent significant increase in student loan delinquencies has generated interest in understanding the key factors predicting the non-performance of these loans. However, despite the large size of

Institutional Accountability: A Comparison of the Predictors of Student Loan Repayment and Default Rates

The federal government holds colleges accountable if too many of their students default on loan repayment, but the default measure traditionally used captures only a fraction of students who are

College on Credit: A Multilevel Analysis of Student Loan Default

This study updates and expands the literature on student loan default. By applying multilevel regression to the Beginning Postsecondary Students survey, four key findings emerge. First, attending

Understanding the Determinants of Debt Burden among College Graduates

This article examines debt burden among college graduates and contributes to previous research by incorporating institutional and state characteristics. Utilizing a combination of national datasets

A Crisis in Student Loans?: How Changes in the Characteristics of Borrowers and in the Institutions They Attended Contributed to Rising Loan Defaults

This paper examines the rise in student loan default and delinquency. It draws on a unique set of administrative data on federal student borrowing matched to earnings records from de-identified tax

Regularization and variable selection via the elastic net

It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.

Classification and Regression by randomForest

random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.

Regression Shrinkage and Selection via the Lasso

A new method for estimation in linear models called the lasso, which minimizes the residual sum of squares subject to the sum of the absolute value of the coefficients being less than a constant, is proposed.

Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties

In this article, penalized likelihood approaches are proposed to handle variable selection problems, and it is shown that the newly proposed estimators perform as well as the oracle procedure in variable selection; namely, they work as well if the correct submodel were known.

The Dantzig selector: Statistical estimation when P is much larger than n

The main point of the paper is accurate statistical estimation in high dimensions, which includes theoretical, practical and computational issues, and the Dantzig Selector compares with the Lasso.