Two-sample statistics based on anisotropic kernels

Cheng, Xiuyuan; Cloninger, Alexander; Coifman, Ronald R.

doi:10.1093/imaiai/iaz018

Citation Details

Two-sample statistics based on anisotropic kernels

Abstract

The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) statistic for measuring the distance between two distributions given finitely many multivariate samples. When the distributions are locally low-dimensional, the proposed test can be made more powerful to distinguish certain alternatives by incorporating local covariance matrices and constructing an anisotropic kernel. The kernel matrix is asymmetric; it computes the affinity between $n$ data points and a set of $n_R$ reference points, where $n_R$ can be drastically smaller than $n$. While the proposed statistic can be viewed as a special class of Reproducing Kernel Hilbert Space MMD, the consistency of the test is proved, under mild assumptions of the kernel, as long as $\|p-q\| \sqrt{n} \to \infty $, and a finite-sample lower bound of the testing power is obtained. Applications to flow cytometry and diffusion MRI datasets are demonstrated, which motivate the proposed approach to compare distributions.

Award ID(s):: 1818945 1819222

NSF-PAR ID:: 10126907

Author(s) / Creator(s):: Cheng, Xiuyuan ; Cloninger, Alexander ; Coifman, Ronald R.

Publisher / Repository:: Oxford University Press

Date Published:: 2019-12-10

Journal Name:: Information and Inference: A Journal of the IMA

ISSN:: 2049-8764

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/imaiai/iaz018

More Like this