Numerically stable, single-pass, parallel statistics algorithms
@article{Bennett2009NumericallySS,
title={Numerically stable, single-pass, parallel statistics algorithms},
author={J. Bennett and R. Grout and P. P{\'e}bay and D. Roe and D. Thompson},
journal={2009 IEEE International Conference on Cluster Computing and Workshops},
year={2009},
pages={1-8}
}Statistical analysis is widely used for countless scientific applications in order to analyze and infer meaning from data. A key challenge of any statistical analysis package aimed at large-scale, distributed data is to address the orthogonal issues of parallel scalability and numerical stability. In this paper we derive a series of formulas that allow for single-pass, yet numerically robust, pairwise parallel and incremental updates of both arbitrary-order centered statistical moments and co… CONTINUE READING
Figures, Tables, and Topics from this paper.
49 Citations
A Survey and Recommendations for Distributed, Parallel, Single Pass, Incremental Bayesian Classification Based on MapReduce for Big Data
- Computer Science
- 2017
Highly Influenced
Characterization of multiphase flows integrating X-ray imaging and virtual reality
- Computer Science
- 2017
Open Access
References
Publications referenced by this paper.
SHOWING 1-10 OF 13 REFERENCES
Note on a Method for Calculating Corrected Sums of Squares and Products
- Mathematics
- 1962
389
Open Access
Updating formulae and a pairwise algorithm for computing sample variances
- Mathematics
- 1979
133
Open Access
Design patterns: elements of reuseable object-oriented software
- Computer Science
- 1994
16,029
Open Access
Computing higher-order moments online
- 2008
Dns of a turbulent lifted ethylene/air jet flame in an auto-ignitive coflow -stabilization and flame structure
- 2008








