Preprocessing of centred logratio transformed density functions using smoothing splines

@article{Machalov2015PreprocessingOC,
  title={Preprocessing of centred logratio transformed density functions using smoothing splines},
  author={Jitka Machalov{\'a} and Karel Hron and Gianna Serafina Monti},
  journal={Journal of Applied Statistics},
  year={2015},
  volume={43},
  pages={1419 - 1435}
}
With large-scale database systems, statistical analysis of data, occurring in the form of probability distributions, becomes an important task in explorative data analysis. Nevertheless, due to specific properties of density functions, their proper statistical treatment of these data still represents a challenging task in functional data analysis. Namely, the usual metric does not fully accounts for the relative character of information, carried by density functions; instead, their geometrical… 
Compositional splines for representation of density functions
TLDR
A new class of spline functions, called compositional splines, are built which can approximate probability density functions in a consistent way and statistical processing of densities using the new approximation tool is demonstrated in case of simplicial functional principal component analysis with anthropometric data.
Simplicial splines for representation of density functions
TLDR
A new class of spline functions, called simplicial splines, are built, which can approximate probability density functions in a consistent way, using the Bayes space methodology of density functions.
Changing reference measure in Bayes spaces with applications to functional data analysis
Probability density functions (PDFs) can be understood as continuous compositions by the theory of Bayes spaces. The origin of a Bayes space is determined by a given reference measure. This can be
Weighting the domain of probability densities in functional data analysis
In functional data analysis, some regions of the domain of the functions can be of more interest than others owing to the quality of measurement, relative scale of the domain, or simply some external
Compositional regression with functional response
Nonparametric Forecasting of Multivariate Probability Density Functions
TLDR
A novel nonparametric framework for modelling a time series of copula probability density functions is proposed, which allows to forecast the entire function without the need of post-processing procedures to grant positiveness and unit integral.
Logratio Approach to Distributional Modeling
TLDR
A unifying framework for the discrete and the continuous case based on the theory of Bayes spaces is presented, and it turns out that the centered logratio transformation is a convenient tool for practical computations.
Bivariate Densities in Bayes Spaces: Orthogonal Decomposition and Spline Representation
A new orthogonal decomposition for bivariate probability densities embedded in Bayes Hilbert spaces is derived. It allows one to represent a density into independent and interactive parts, the former
M E ] 1 9 M ar 2 01 8 Nonparametric forecasting of multivariate probability density functions
TLDR
A novel nonparametric framework for modelling a time series of copula probability density functions, which allows to forecast the entire function without the need of post-processing procedures to grant positiveness and unit integral is proposed.
...
...

References

SHOWING 1-10 OF 41 REFERENCES
Dimensionality reduction when data are density functions
  • P. Delicado
  • Computer Science
    Comput. Stat. Data Anal.
  • 2011
Principal component analysis for compositional data with outliers
TLDR
It turns out that the procedure using ilr‐transformed data and robust PCA delivers superior results to all other approaches, demonstrating that due to the compositional nature of geochemical data PCA should be carried out without an appropriate transformation.
Density Estimation : New Spline Approaches and a Partial Review
Apart from kernel estimators, there have been quite a few different approaches of “generalized splines” for density estimation. In the present paper, Maximum Penalized Likelihood (mpl) approaches are
Density estimation via hybrid splines
TLDR
The Hybrid Spline method (H-spline) is a method of density estimation which involves regression splines and smoothing splines methods and simulations suggest that with more structured data (e.g., several modes) H-splines method estimates the modes as well as Logspline.
Density Estimation, Stochastic Processes and Prior Information
SUMMARY A method is proposed for the non-parametric estimation of a probability density, based upon a finite number of observations and prior information about the smoothness of the density. A
Bayes Hilbert Spaces
A Bayes linear space is a linear space of equivalence classes of proportional σ‐finite measures, including probability measures. Measures are identified with their density functions. Addition is
Hilbert Space of Probability Density Functions Based on Aitchison Geometry
The set of probability functions is a convex subset of L1 and it does not have a linear space structure when using ordinary sum and multiplication by real constants. Moreover, difficulties arise when
Functional Data Analysis
  • H. Müller
  • Computer Science
    International Encyclopedia of Statistical Science
  • 2011
TLDR
An overview of functional data analysis is provided, starting with simple statistical notions such as mean and covariance functions, then covering some core techniques, the most popular of which is functional principal component analysis (FPCA).
Estimating Distributions with a Fixed Number of Modes
A new approach for non- or semi-parametric density estimation allows to specify modes and antimodes. The new smoother is a Maximum Penalized Likelihood (MPL) estimate with a novel roughness penalty.
Bayesian-multiplicative treatment of count zeros in compositional data sets
Compositional count data are discrete vectors representing the numbers of outcomes falling into any of several mutually exclusive categories. Compositional techniques based on the log-ratio
...
...