Standardization of Weighted Ranking Correlation Coefficients

arXiv:2504.08428v2 Announce Type: replace-cross
Abstract: A fundamental problem in statistics is measuring the correlation between two rankings of a set of items. Kendall’s $tau$ and Spearman’s $rho$ are well established correlation coefficients whose symmetric structure guarantees zero expected value between two rankings randomly chosen with uniform probability. In many modern applications, however, greater importance is assigned to top-ranked items, motivating weighted variants of these coefficients. Such weighting schemes generally break the symmetry of the original formulations, resulting in a non-zero expected value under independence and compromising the interpretation of zero correlation. We propose a general standardization function $g(cdot)$ that transforms a ranking correlation coefficient $Gamma$ into a standardized form $g(Gamma)$ with zero expected value under randomness. The transformation preserves the domain $[-1,1]$, satisfies the boundary conditions, is continuous and increasing, and reduces to the identity for coefficients that already satisfy the zero-expected-value property. The construction of $g(x)$ depends on three distributional parameters of $Gamma$: its mean, variance, and left variance; since their exact calculation becomes infeasible for large ranking lengths $n$, we develop accurate numerical estimates based on Monte Carlo sampling combined with polynomial regression to capture their dependence on $n$.

Liked Liked