Numerically stable, single-pass, parallel statistics algorithms

@article{Bennett2009NumericallySS,
  title={Numerically stable, single-pass, parallel statistics algorithms},
  author={J. Bennett and R. Grout and P. P{\'e}bay and D. Roe and D. Thompson},
  journal={2009 IEEE International Conference on Cluster Computing and Workshops},
  year={2009},
  pages={1-8}
}
  • J. Bennett, R. Grout, +2 authors D. Thompson
  • Published 2009
  • Computer Science
  • 2009 IEEE International Conference on Cluster Computing and Workshops
  • Statistical analysis is widely used for countless scientific applications in order to analyze and infer meaning from data. A key challenge of any statistical analysis package aimed at large-scale, distributed data is to address the orthogonal issues of parallel scalability and numerical stability. In this paper we derive a series of formulas that allow for single-pass, yet numerically robust, pairwise parallel and incremental updates of both arbitrary-order centered statistical moments and co… CONTINUE READING
    MapReduce Based Classification for Fault Detection in Big Data Applications
    Novel Techniques for Efficient and Effective Subgroup Discovery
    Open Access
    Large Scale Data Analysis
    2D and 3D tracking and modelling.
    Open Access

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 13 REFERENCES
    Map-Reduce for Machine Learning on Multicore
    857
    Open Access
    Note on a Method for Calculating Corrected Sums of Squares and Products
    389
    Open Access
    Updating formulae and a pairwise algorithm for computing sample variances
    133
    Open Access
    TORQUE resource manager
    276
    Open Access
    Computing higher-order moments online
    • 2008
    Dns of a turbulent lifted ethylene/air jet flame in an auto-ignitive coflow -stabilization and flame structure
    • 2008