Eigenvector method for umbrella sampling enables error analysis.

  title={Eigenvector method for umbrella sampling enables error analysis.},
  author={Erik H. Thiede and Brian Van Koten and Jonathan Weare and Aaron R. Dinner},
  journal={The Journal of chemical physics},
  volume={145 8},
Umbrella sampling efficiently yields equilibrium averages that depend on exploring rare states of a model by biasing simulations to windows of coordinate values and then combining the resulting data with physical weighting. Here, we introduce a mathematical framework that casts the step of combining the data as an eigenproblem. The advantage to this approach is that it facilitates error analysis. We discuss how the error scales with the number of windows. Then, we derive a central limit theorem… 

Figures from this paper

Umbrella sampling: a powerful method to sample tails of distributions

The umbrella sampling technique can be used to sample extremely low probability areas of the posterior distribution that may be required in statistical analyses of data and allows a considerably more robust sampling of multi-modal distributions compared to the standard sampling methods.

Understanding the Sources of Error in MBAR through Asymptotic Analysis

Multiple sampling strategies commonly used in molecular dynamics, such as umbrella sampling and alchemical free energy methods, involve sampling from multiple thermodynamic states. Commonly, the data

Stratification as a General Variance Reduction Method for Markov Chain Monte Carlo

It is shown that EMUS can be dramatically more efficient than direct MCMC when the target distribution is multimodal or when the goal is to compute tail probabilities.

Stratified UWHAM and Its Stochastic Approximation for Multicanonical Simulations Which Are Far from Equilibrium.

We describe a new analysis tool called Stratified unbinned Weighted Histogram Analysis Method (Stratified-UWHAM), which can be used to compute free energies and expectations from a multicanonical

Accurate Modeling of Grazing Transits Using Umbrella Sampling

Grazing transits present a special problem for statistical studies of exoplanets. Even though grazing planetary orbits are rare (due to geometric selection effects), for many low to moderate

Learning Optimal Flows for Non-Equilibrium Importance Sampling

This work shows how to use deep learning to represent the velocity field by a neural network and train it towards the zero variance optimum, and compares the performances of NEIS with those of Neal’s annealed importance sampling (AIS).

Forecasting using neural networks and short-trajectory data

This work develops an approach to solve Feynman-Kac equations by training neural networks on short-trajectory data using a low-dimensional model that facilitates visualization and motivates an adaptive sampling strategy that allows on-the-fly identification of and addition of data to regions important for predicting the statistics of interest.

Understanding and eliminating spurious modes in variational Monte Carlo using collective variables

It is demonstrated that a collective-variable-based penalization yields a substantially more robust training procedure, preventing the formation of spurious modes and improving the accuracy of energy estimates.

Active Importance Sampling for Variational Objectives Dominated by Rare Events: Consequences for Optimization and Generalization

This work introduces an approach that combines rare events sampling techniques with neural network optimization to optimize objective functions that are dominated by rare events, and shows that importance sampling reduces the asymptotic variance of the solution to a learning problem, suggesting benefits for generalization.

Long-Time-Scale Predictions from Short-Trajectory Data: A Benchmark Analysis of the Trp-Cage Miniprotein.

This work presents a new projection of the reactive current onto collective variables and provides improved estimators for rates and committors, and presents simple procedures for constructing suitable smoothly varying basis functions from arbitrary molecular features.



Calculation of free energy through successive umbrella sampling.

An implementation of umbrella sampling in which the pertinent range of states is subdivided into small windows that are sampled consecutively and linked together is considered, which is comparable to a multicanonical simulation with a very good weight function.

Optimal estimators and asymptotic variances for nonequilibrium path-ensemble averages.

A general minimal-variance estimator is derived that can combine nonequ equilibrium trajectory data sampled from multiple path-ensembles to estimate arbitrary functions of nonequilibrium expectations and develop asymptotic variance estimates pertaining to Jarzynski's equality for free energies and the Hummer-Szabo expressions for the potential of mean force.

Statistically optimal analysis of samples from multiple equilibrium states.

A new estimator for computing free energy differences and thermodynamic expectations as well as their uncertainties from samples obtained from multiple equilibrium states via either simulation or experiment is presented, which has significant advantages over multiple histogram reweighting methods for combining data from multiple states.

xTRAM: Estimating equilibrium expectations from time-correlated simulation data at multiple thermodynamic states

The expanded TRAM (xTRAM) estimator is formulated, shown to be asymptotically unbiased and a generalization of MBAR, and a random-swapping simulation protocol is introduced that can be used with xTRAM, gaining orders-of-magnitude advantages over simulation protocols that require the constraint of sampling from a global equilibrium.

On a Likelihood Approach for Monte Carlo Integration

The use of estimating equations has been a common approach for constructing Monte Carlo estimators. Recently, Kong et al. proposed a formulation of Monte Carlo integration as a statistical model,

Convergence and error estimation in free energy calculations using the weighted histogram analysis method

The challenges of obtaining fast and accurate solutions of the coupled nonlinear WHAM equations, quantifying the statistical errors of the resulting free energies, of diagnosing possible systematic errors, and of optimally allocating of the computational resources are addressed.

Self-Learning Adaptive Umbrella Sampling Method for the Determination of Free Energy Landscapes in Multiple Dimensions.

This work presents an efficient automatized umbrella sampling strategy for calculating multidimensional potential of mean force and demonstrates that a significant smaller number of umbrella windows needs to be employed to characterize the free energy landscape over the most relevant regions without any loss in accuracy.

Theory of binless multi-state free energy estimation with applications to protein-ligand binding.

It is shown that binless statistical analysis can accurately treat sparsely distributed interaction energy samples as obtained from unmodified interaction potentials that cannot be properly analyzed using standard binning methods, suggesting that bin less multi-state analysis of binding free energy simulations with unmodified potentials offers a straightforward alternative to the use of soft-core potentials for these alchemical transformations.

Free energies from dynamic weighted histogram analysis using unbiased Markov state model.

The dynamic histogram analysis method (DHAM) is developed, which finds that DHAM gives accurate free energies even in cases where WHAM fails, and may also prove useful in the construction of Markov state models from biased simulations in phase-space regions with otherwise low population.