Interpretation of Compositional Regression with Application to Time Budget Analysis

@article{Muller2016InterpretationOC,
  title={Interpretation of Compositional Regression with Application to Time Budget Analysis},
  author={Ivo Muller and Karel Hron and Eva Fi{\vs}erov{\'a} and Jan {\vS}mahaj and Panajotis Cakirpaloglu and Jana Van{\vc}{\'a}kov{\'a}},
  journal={arXiv: Statistics Theory},
  year={2016}
}
Regression with compositional response or covariates, or even regression between parts of a composition, is frequently employed in social sciences. Among other possible applications, it may help to reveal interesting features in time allocation analysis. As individual activities represent relative contributions to the total amount of time, statistical processing of raw data (frequently represented directly as proportions or percentages) using standard methods may lead to biased results… 

Figures and Tables from this paper

Regression analysis with compositional data using orthogonal log-ratio coordinates
TLDR
The log-ratio approach based on orthogonal log-Ratio coordinates is adopted to show how it can lead to considerable improvements in the interpretation of the results of regression modeling with compositional data, both as explanatory or response variables.
On interpretations of tests and effect sizes in regression models with a compositional predictor
Compositional data analysis is concerned with the relative importance of positive variables, expressed through their log-ratios. The literature has proposed a range of manners to compute log-ratios,
Impact of Covariates in Compositional Models and Simplicial Derivatives
In the framework of Compositional Data Analysis, vectors carrying relative information, also called compositional vectors, can appear in regression models either as dependent or as explanatory
Robust regression with compositional covariates including cellwise outliers
TLDR
Simulations show that the proposed procedure generally outperforms a traditional rowwise-only robust regression method (MM-estimator) and is preferable for interpretation through the use of appropriate coordinate systems for compositional data.
Cox regression survival analysis with compositional covariates: Application to modelling mortality risk from 24-h physical activity patterns
TLDR
This work introduces a formulation of the Cox regression model in terms of log-ratio coordinates which suitably deals with the constraints of compositional covariates, facilitates the use of common statistical inference methods, and allows for scientifically meaningful interpretations.
Analyzing the impacts of socio-economic factors on French departmental elections with CoDa methods
ABSTRACT The vote shares by party on a given subdivision of a territory form a vector called composition (mathematically, a vector belonging to a simplex). It is interesting to model these shares and
Analysing Pairwise Logratios Revisited
TLDR
Backward pivot coordinates is proposed, where each pairwise logratio is linked to one orthogonal coordinate system, and these systems are then used together to produce a concise output to discuss grain size control of the element composition of sediments.
Robust Compositional Analysis of Physical Activity and Sedentary Behaviour Data
TLDR
The findings suggested that replacing time spent in SB with vigorous PA may be a powerful tool against adolescents’ obesity.
Log-ratio transformations for dietary compositions: numerical and conceptual questions
TLDR
The log-ratio transformation of dietary data has both numerical and conceptual advantages, and overcomes the drawbacks of traditional substitution models.
“The Statistical Analysis of Compositional Data” by John Aitchison (1986): A Bibliometric Overview
This paper presents a complete bibliometric analysis of Aitchison’s 1986 seminal book “The Statistical Analysis of Compositional Data.” We have set three objectives. The first is to analyze the
...
...

References

SHOWING 1-10 OF 53 REFERENCES
Classical and robust orthogonal regression between parts of compositional data
ABSTRACT The different parts (variables) of a compositional data set cannot be considered independent from each other, since only the ratios between the parts constitute the relevant information to
Linear regression with compositional explanatory variables
TLDR
An approach based on the isometric logratio (ilr) transformation is used and it turns out that the resulting model is easy to handle, and that parameter estimation can be done in like in usual linear regression.
Regression analysis of compositional data when both the dependent variable and independent variable are components
It is well known that regression analyses involving compositional data need special attention because the data are not of full rank. For a regression analysis where both the dependent and independent
On the Interpretation of Orthonormal Coordinates for Compositional Data
The simplex with the Aitchison geometry is a natural sample space for compositional data, that is, observations carrying only relative information (especially proportions, percentages, etc., often
Principal component analysis for compositional data with outliers
TLDR
It turns out that the procedure using ilr‐transformed data and robust PCA delivers superior results to all other approaches, demonstrating that due to the compositional nature of geochemical data PCA should be carried out without an appropriate transformation.
Biplots of Compositional Data
Summary. The singular value decomposition and its interpretation as a linear biplot have proved to be a powerful tool for analysing many forms of multivariate data. Here we adapt biplot methodology
Isometric Logratio Transformations for Compositional Data Analysis
TLDR
An important result is the decomposition of the simplex, as a vector space, into orthogonal subspaces associated with nonoverlapping subcompositions, which gives the key to join compositions with different parts into a single composition by using a balancing element.
PLS‐DA for compositional data with application to metabolomics
When quantifying information in metabolomics, the results are often expressed as data carrying only relative information. Vectors of these data have positive components, and the only relevant
...
...