Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Major League Baseball

  title={Bayesball: A Bayesian Hierarchical Model for Evaluating Fielding in Major League Baseball},
  author={Shane T. Jensen and Kenneth E. Shirley and Abraham J. Wyner},
  journal={The Annals of Applied Statistics},
The use of statistical modeling in baseball has received substantial attention recently in both the media and academic community. We focus on a relatively under-explored topic: the use of statistical models for the analysis of fielding based on high-resolution data consisting of on-field location of batted balls. We combine spatial modeling with a hierarchical Bayesian structure in order to evaluate the performance of individual fielders while sharing information between fielders at each… 
Hierarchical Bayesian Modeling of Hitting Performance in Baseball
A sophisticated statistical model for predicting the hitting performance of Major League baseball players is developed using the Bayesian paradigm and shares information across time and across players by using mixture distributions to control shrinkage for improved accuracy.
Estimating Fielding Ability in Baseball Players Over Time
Abstract Quantitative evaluation of fielding ability in baseball has been an ongoing challenge for statisticians. Detailed recording of ball-in-play data in recent years has spurred the development
A Survey of Methods for the Statistical Evaluation of Defensive Ability in Major League Baseball
We review the advances in evaluating defensive ability among Major League Baseball players, from the early part of the previous century to the most recent developments. Most early attempts to measure
Quantifying the Effect of The Shift in Major League Baseball
Baseball is a very strategic and abstract game, but the baseball world is strangely obsessed with statistics. Modern mainstream statisticians often study offensive data, such as batting average or
Assessing the impact of fielding in Twenty20 cricket
This paper attempts to quantify the importance of fielding in Twenty20 cricket. We introduce the metric of expected runs saved due to fielding which is both interpretable and is directly relevant to
Stochastic Modeling and Optimization in Baseball
This article reviews operations research techniques that can be and are used to address common decision making problems in baseball, focusing on those geared toward improving a team's probability of
A hierarchical Bayesian model of pitch framing
A Bayesian hierarchical model is introduced to estimate each umpire’s probability of calling a strike, adjusting for the pitch participants, pitch location, and contextual information like the count, which is able to translate into average runs saved across a season.
openWAR: An open source system for evaluating overall player performance in major league baseball
Abstract Within sports analytics, there is substantial interest in comprehensive statistics intended to capture overall player performance. In baseball, one such measure is wins above replacement
A probabilistic approach to the analysis of a volleyball set performance
A probabilistic model for predicting the score of a volleyball set that considers the players’ initial positions on court, and the position rotations required by the rules of the game, can be used as a support-decision tool to assist coaches and managers in selecting the optimal team formation and game tactics, leading to better utilization of resources and an improved success rate.


A Spatial Analysis of Basketball Shot Chart Data
H hierarchical spatial models are developed for shot-chart data, which allow for spatially varying effects of covariates and permit differential smoothing of the fitted surface in two spatial directions, which naturally correspond to polar coordinates.
Baseball Errors
This paper examines the factors that determine the rate of errors called in major league baseball, and changes in the error rate over time. We find that (1) the rate of errors depends on the quality
A State-Space Model for National Football League Scores
A predictive model for National Football League game scores using data from the period 1988–1993 is developed to obtain plausible inferences concerning team strengths and other model parameters, and to predict future game outcomes.
Bayesian data analysis.
  • J. Kruschke
  • Political Science
    Wiley interdisciplinary reviews. Cognitive science
  • 2010
A fatal flaw of NHST is reviewed and some benefits of Bayesian data analysis are introduced and illustrative examples of multiple comparisons in Bayesian analysis of variance and Bayesian approaches to statistical power are presented.
Bayesian analysis of binary and polychotomous response data
Abstract A vast literature in statistics, biometrics, and econometrics is concerned with the analysis of binary and polychotomous response data. The classical approach fits a categorical response
Conservative prior distributions for variance parameters in hierarchical models
Bayesian hierarchical models typically involve specifying prior distributions for one or more variance components. This is rather removed from the observed data, so specification based on expert
Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images
  • S. Geman, D. Geman
  • Physics
    IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 1984
The analogy between images and statistical mechanics systems is made and the analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations, creating a highly parallel ``relaxation'' algorithm for MAP estimation.
Ultimate zone rating
  • The Baseball Think Factory, March
  • 2003
Ultimate zone rating. The Baseball Think Factory
  • Ultimate zone rating. The Baseball Think Factory
  • 2003