Partial Distance Correlation with Methods for Dissimilarities

  title={Partial Distance Correlation with Methods for Dissimilarities},
  author={G{\'a}bor J. Sz{\'e}kely and Maria L. Rizzo},
  journal={arXiv: Methodology},
Distance covariance and distance correlation are scalar coefficients that characterize independence of random vectors in arbitrary dimension. Properties, extensions, and applications of distance correlation have been discussed in the recent literature, but the problem of defining the partial distance correlation has remained an open question of considerable interest. The problem of partial distance correlation is more complex than partial correlation partly because the squared distance… Expand

Figures and Tables from this paper

Partial Distance Correlation
Partial distance correlation measures association between two random vectors with respect to a third random vector, analogous to, but more general than (linear) partial correlation. DistanceExpand
Fast Computing for Distance Covariance
It is shown that the computation of distance covariance and distance correlation of real-valued random variables can be implemented by an O(nlog n) algorithm and this is comparable to other computationally efficient algorithms. Expand
Energy distance
Energy distance is a metric that measures the distance between the distributions of random vectors. Energy distance is zero if and only if the distributions are identical, thus it characterizesExpand
Distance Metrics for Measuring Joint Dependence with Application to Causal Inference
Abstract Many statistical applications require the quantification of joint dependence among more than two random vectors. In this work, we generalize the notion of distance covariance to quantifyExpand
Distance-based and RKHS-based dependence metrics in high dimension
In this paper, we study distance covariance, Hilbert-Schmidt covariance (aka Hilbert-Schmidt independence criterion [Gretton et al. (2008)]) and related independence tests under the high dimensionalExpand
Estimating Feature-Label Dependence Using Gini Distance Statistics
This paper presents a framework for estimating dependence between numerical features and a categorical label using generalized Gini distance, an energy distance in reproducing kernel Hilbert spaces (RKHS). Expand
On Possibilistic Version of Distance Covariance and Correlation
Distance correlation is a relatively new measure of dependence in probability theory and statistics, which has the great advantage that it gives zero if and only if the variables are independent. InExpand
Distance correlation coefficients for Lancaster distributions
This work derives under mild convergence conditions a general series representation for the distance covariance and distance correlation coefficients for the bivariate normal distribution and its generalizations of Lancaster type, the multivariate normal distributions, and the b correlations between random vectors whose joint distributions belong to the class of Lancaster distributions. Expand
The Chi-Square Test of Distance Correlation
This work proposes a centered chi-square distribution, demonstrates it well-approximates the null distribution of unbiased distance correlation, and proves upper tail dominance and distribution bound between them, which is provably the most powerful test of distance correlation among all valid tests with known distribution. Expand
Asymptotic Distributions of High-Dimensional Nonparametric Inference with Distance Correlation
Understanding the nonlinear association between a pair of potentially high-dimensional random vectors is encountered frequently in many contemporary big data applications. Distance correlation hasExpand


Measuring and testing dependence by correlation of distances
Distance correlation is a new measure of dependence between random vectors. Distance covariance and distance correlation are analogous to product-moment covariance and correlation, but unlike theExpand
The distance correlation t-test of independence in high dimension
A modified distance correlation statistic is proposed, such that under independence the distribution of a transformation of the statistic converges to Student t, as dimension tends to infinity, and the resulting t-test is unbiased for every sample size greater than three and all significance levels. Expand
Brownian distance covariance
Distance correlation is a new class of multivariate dependence coefficients applicable to random vectors of arbitrary and not necessarily equal dimension. Distance covariance and distance correlationExpand
On the uniqueness of distance covariance
Distance covariance and distance correlation are non-negative real numbers that characterize the independence of random vectors in arbitrary dimensions. In this work we prove that distance covarianceExpand
The affinely invariant distance correlation
Szekely, Rizzo and Bakirov (Ann. Statist. 35 (2007) 2769–2794) and Szekely and Rizzo (Ann. Appl. Statist. 3 (2009) 1236–1265), in two seminal papers, introduced the powerful concept of distanceExpand
Equivalence of distance-based and RKHS-based statistics in hypothesis testing
It is shown that the energy distance most commonly employed in statistics is just one member of a parametric family of kernels, and that other choices from this family can yield more powerful tests. Expand
Measures of dependence between random vectors and tests of independence. Literature review
Simple correlation coefficients between two variables have been generalized to measure association between two matrices in many ways. Coefficients such as the RV coefficient, the distance covarianceExpand
Summary This paper investigates the roles of partial correlation and conditional correlation as measures of the conditional independence of two random variables. It first establishes a sufficientExpand
Some distance properties of latent root and vector methods used in multivariate analysis
SUMMARY This paper is concerned with the representation of a multivariate sample of size n as points P1, P2, ..., PI in a Euclidean space. The interpretation of the distance A(Pi, Pj) between the ithExpand
Multiple regression and correlation extensions of the mantel test of matrix correspondence
It is often necessary in population biology to compare two sets of distance measures. These measures can be based on genetic markers, morphological traits, geographic separation, ecologicalExpand