# Fast Estimation of Multinomial Logit Models: R Package mnlogit

@article{Hasan2014FastEO, title={Fast Estimation of Multinomial Logit Models: R Package mnlogit}, author={A. Hasan and Wang Zhiyu and Alireza S. Mahani}, journal={arXiv: Computation}, year={2014} }

We present R package mnlogit for training multinomial logistic regression models, particularly those involving a large number of classes and features. Compared to existing software, mnlogit offers speedups of 10x-50x for modestly sized problems and more than 100x for larger problems. Running mnlogit in parallel mode on a multicore machine gives an additional 2x-4x speedup on up to 8 processor cores. Computational efficiency is achieved by drastically speeding up calculation of the log… Expand

#### 32 Citations

mixl: An open-source R package for estimating complex choice models on large datasets

- Computer Science
- 2021

It is shown that mixl is fast, easy to use, and scales to very large datasets, and some results using real world data and models are presented. Expand

Efficient Bayesian Modeling of Binary and Categorical Data in R: The UPG Package

- Mathematics
- 2021

We introduce the UPG package for highly efficient Bayesian inference in probit, logit, multinomial logit and binomial logit models. UPG offers a convenient estimation framework for balanced and… Expand

Estimation of Random Utility Models in R: The mlogit Package

- Computer Science, Mathematics
- J. Stat. Softw.
- 2020

mlogit is a package for R which enables the estimation of random utility models with choice situation and/or alternative specific variables. The main extensions of the basic multinomial model… Expand

Stochastic Newton Sampler: R Package sns

- Mathematics, Computer Science
- 2015

The R package sns implements Stochastic Newton Sampler (SNS), a Metropolis-Hastings Monte Carlo Markov Chain algorithm where the proposal density function is a multivariate Gaussian based on a local,… Expand

Parameter estimation of multinomial logistic regression model using least absolute shrinkage and selection operator (LASSO)

- Mathematics
- 2018

Regression modeling was used to describe the relationship between the response variable and one or more predictor variables. For a categorical response variable, the logistic regression would be more… Expand

Constrained Statistical Inference for Categorical Data

- Computer Science
- 2020

Using real-world data from the Canadian Community Health Survey, the methodology of using constraints showed significant improvement on methodology that does not, which substantiates the added value of the work presented here. Expand

Stochastic gradient descent methods for estimation with large data sets

- Mathematics
- 2015

We develop methods for parameter estimation in settings with large-scale data sets, where traditional methods are no longer tenable. Our methods rely on stochastic approximations, which are… Expand

Insights from kernel conditional-probability estimates into female labour force participation decision in the UK

- Mathematics
- 2020

The female labour force participation decision in the UK is a well-researched topic in empirical economics and econometrics. In this paper, using data from the UK Labour Force Survey in 2007, we… Expand

Information-Theoretic Scoring Rules to Learn Additive Bayesian Network Applied to Epidemiology

- Computer Science, Mathematics
- ArXiv
- 2018

The purpose of this paper is to present an R package abn that has an implementation of multiple frequentist scores and some realistic simulations that show its usability and performance. Expand

Phase II monitoring of generalized linear profiles using weighted likelihood ratio charts

- Mathematics, Computer Science
- Comput. Ind. Eng.
- 2016

A new control chart is developed based on the weighted likelihood ratio test, and it can be readily extended to other generalized profiles or profiles with random predictors if the likelihood function can be obtained. Expand

#### References

SHOWING 1-10 OF 52 REFERENCES

Regularization Paths for Generalized Linear Models via Coordinate Descent.

- Computer Science, Medicine
- Journal of statistical software
- 2010

In comparative timings, the new algorithms are considerably faster than competing methods and can handle large problems and can also deal efficiently with sparse features. Expand

Multinomial logistic regression algorithm

- Mathematics
- 1992

The lower bound principle (introduced in Böhning and Lindsay 1988, Ann. Inst. Statist. Math., 40, 641–663), Böhning (1989, Biometrika, 76, 375–383) consists of replacing the second derivative matrix… Expand

The VGAM Package for Categorical Data Analysis

- Mathematics
- 2010

Classical categorical regression models such as the multinomial logit and proportional odds models are shown to be readily handled by the vector generalized linear and additive model (VGLM/VGAM)… Expand

Extended Model Formulas in R : Multiple Parts and Multiple Responses

- Mathematics
- 2010

Model formulas are the standard approach for specifying the variables in statistical models in the S language. Although being eminently useful in an extremely wide class of applications, they have… Expand

Trust Region Newton Method for Logistic Regression

- Mathematics, Computer Science
- J. Mach. Learn. Res.
- 2008

This paper applies a trust region Newton method to maximize the log-likelihood of the logistic regression model, and extends the proposed method to large-scale L2-loss linear support vector machines (SVM). Expand

maxent: An R Package for Low-memory Multinomial Logistic Regression with Support for Semi-automated Text Classification

- Computer Science
- R J.
- 2012

The focus of this maximum entropy classifier is to minimize memory consumption on very large datasets, particularly sparse document-term matrices represented by the tm text mining package. Expand

Discrete Choice Methods with Simulation

- Computer Science
- 2016

Discrete Choice Methods with Simulation by Kenneth Train has been available in the second edition since 2009 and contains two additional chapters, one on endogenous regressors and one on the expectation–maximization (EM) algorithm. Expand

A Numerical Study of the Limited Memory BFGS Method and the Truncated-Newton Method for Large Scale Optimization

- Mathematics, Computer Science
- SIAM J. Optim.
- 1991

This paper examines the numerical performances of two methods for large-scale optimization: a limited memory quasi-Newton method (L-BFGS), and a discrete truncated-Newton method (TN). Various ways of… Expand

Diagnostic Checking in Regression Relationships

- Computer Science
- 2015

A rich variety of diagnostic tests for these situations have been developed in the econometrics community, a collection of which has been implemented in the packages lmtest and strucchange covering the problems mentioned above. Expand

On the limited memory BFGS method for large scale optimization

- Mathematics, Computer Science
- Math. Program.
- 1989

The numerical tests indicate that the L-BFGS method is faster than the method of Buckley and LeNir, and is better able to use additional storage to accelerate convergence, and the convergence properties are studied to prove global convergence on uniformly convex problems. Expand