Fast Estimation of Multinomial Logit Models: R Package mnlogit

  title={Fast Estimation of Multinomial Logit Models: R Package mnlogit},
  author={A. Hasan and Wang Zhiyu and Alireza S. Mahani},
  journal={arXiv: Computation},
We present R package mnlogit for training multinomial logistic regression models, particularly those involving a large number of classes and features. Compared to existing software, mnlogit offers speedups of 10x-50x for modestly sized problems and more than 100x for larger problems. Running mnlogit in parallel mode on a multicore machine gives an additional 2x-4x speedup on up to 8 processor cores. Computational efficiency is achieved by drastically speeding up calculation of the log… Expand
mixl: An open-source R package for estimating complex choice models on large datasets
It is shown that mixl is fast, easy to use, and scales to very large datasets, and some results using real world data and models are presented. Expand
Efficient Bayesian Modeling of Binary and Categorical Data in R: The UPG Package
We introduce the UPG package for highly efficient Bayesian inference in probit, logit, multinomial logit and binomial logit models. UPG offers a convenient estimation framework for balanced andExpand
Estimation of Random Utility Models in R: The mlogit Package
  • Y. Croissant
  • Computer Science, Mathematics
  • J. Stat. Softw.
  • 2020
mlogit is a package for R which enables the estimation of random utility models with choice situation and/or alternative specific variables. The main extensions of the basic multinomial modelExpand
Stochastic Newton Sampler: R Package sns
The R package sns implements Stochastic Newton Sampler (SNS), a Metropolis-Hastings Monte Carlo Markov Chain algorithm where the proposal density function is a multivariate Gaussian based on a local,Expand
Parameter estimation of multinomial logistic regression model using least absolute shrinkage and selection operator (LASSO)
Regression modeling was used to describe the relationship between the response variable and one or more predictor variables. For a categorical response variable, the logistic regression would be moreExpand
Constrained Statistical Inference for Categorical Data
Using real-world data from the Canadian Community Health Survey, the methodology of using constraints showed significant improvement on methodology that does not, which substantiates the added value of the work presented here. Expand
Stochastic gradient descent methods for estimation with large data sets
We develop methods for parameter estimation in settings with large-scale data sets, where traditional methods are no longer tenable. Our methods rely on stochastic approximations, which areExpand
Insights from kernel conditional-probability estimates into female labour force participation decision in the UK
The female labour force participation decision in the UK is a well-researched topic in empirical economics and econometrics. In this paper, using data from the UK Labour Force Survey in 2007, weExpand
Information-Theoretic Scoring Rules to Learn Additive Bayesian Network Applied to Epidemiology
The purpose of this paper is to present an R package abn that has an implementation of multiple frequentist scores and some realistic simulations that show its usability and performance. Expand
Phase II monitoring of generalized linear profiles using weighted likelihood ratio charts
A new control chart is developed based on the weighted likelihood ratio test, and it can be readily extended to other generalized profiles or profiles with random predictors if the likelihood function can be obtained. Expand


Regularization Paths for Generalized Linear Models via Coordinate Descent.
In comparative timings, the new algorithms are considerably faster than competing methods and can handle large problems and can also deal efficiently with sparse features. Expand
Multinomial logistic regression algorithm
The lower bound principle (introduced in Böhning and Lindsay 1988, Ann. Inst. Statist. Math., 40, 641–663), Böhning (1989, Biometrika, 76, 375–383) consists of replacing the second derivative matrixExpand
The VGAM Package for Categorical Data Analysis
Classical categorical regression models such as the multinomial logit and proportional odds models are shown to be readily handled by the vector generalized linear and additive model (VGLM/VGAM)Expand
Extended Model Formulas in R : Multiple Parts and Multiple Responses
Model formulas are the standard approach for specifying the variables in statistical models in the S language. Although being eminently useful in an extremely wide class of applications, they haveExpand
Trust Region Newton Method for Logistic Regression
This paper applies a trust region Newton method to maximize the log-likelihood of the logistic regression model, and extends the proposed method to large-scale L2-loss linear support vector machines (SVM). Expand
maxent: An R Package for Low-memory Multinomial Logistic Regression with Support for Semi-automated Text Classification
The focus of this maximum entropy classifier is to minimize memory consumption on very large datasets, particularly sparse document-term matrices represented by the tm text mining package. Expand
Discrete Choice Methods with Simulation
Discrete Choice Methods with Simulation by Kenneth Train has been available in the second edition since 2009 and contains two additional chapters, one on endogenous regressors and one on the expectation–maximization (EM) algorithm. Expand
A Numerical Study of the Limited Memory BFGS Method and the Truncated-Newton Method for Large Scale Optimization
This paper examines the numerical performances of two methods for large-scale optimization: a limited memory quasi-Newton method (L-BFGS), and a discrete truncated-Newton method (TN). Various ways ofExpand
Diagnostic Checking in Regression Relationships
A rich variety of diagnostic tests for these situations have been developed in the econometrics community, a collection of which has been implemented in the packages lmtest and strucchange covering the problems mentioned above. Expand
On the limited memory BFGS method for large scale optimization
The numerical tests indicate that the L-BFGS method is faster than the method of Buckley and LeNir, and is better able to use additional storage to accelerate convergence, and the convergence properties are studied to prove global convergence on uniformly convex problems. Expand