Stephan Mandt

Learn More
Stochastic Gradient Descent (SGD) is an important algorithm in machine learning. With constant learning rates, it is a stochastic process that, after an initial phase of convergence, generates samples from a stationary distribution. We show that SGD with constant rates can be effectively used as an approximate posterior inference algorithm for probabilistic(More)
Variational inference (VI) combined with data subsampling enables approximate posterior inference with large data sets for otherwise intractable models, but suffers from poor local optima. We first formulate a deterministic annealing approach for the generic class of conditionally conjugate exponential family models. This algorithm uses a temperature(More)
Word embeddings are a powerful approach for capturing semantic similarity among terms in a vocabulary. In this paper, we develop exponential family embeddings, a class of methods that extends the idea of word em-beddings to other types of high-dimensional data. As examples, we studied neural data with real-valued observations, count data from a market(More)
Among the goals of statistical genetics is to find associations between genetic data and binary phenotypes, such as heritable diseases. Often, the data are obfuscated by confounders such as age, ethnicity, or population structure. Linear mixed models are linear regression models that correct for confounding by means of correlated label noise; they are(More)
  • 1