Statistical Optimal Transport via Factored Couplings

Forrow, Aden; Huetter, Jan-Christian; Nitzan, Mor; Rigollet, Philippe; Schiebinger, Geoffrey; Weed, Jonathan

Citation Details

We propose a new method to estimate Wasserstein distances and optimal transport plans between two probability distributions from samples in high dimension. Unlike plug-in rules that simply replace the true distributions by their empirical counterparts, our method promotes couplings with low transport rank, a new structural assumption that is similar to the nonnegative rank of a matrix. Regularizing based on this assumption leads to drastic improvements on high-dimensional data for various tasks, including domain adaptation in single-cell RNA sequencing data. These findings are supported by a theoretical analysis that indicates that the transport rank is key in overcoming the curse of dimensionality inherent to data-driven optimal transport. more »

Award ID(s):: 1712596 1740751 1838071

PAR ID:: 10100611

Author(s) / Creator(s):: Forrow, Aden; Huetter, Jan-Christian; Nitzan, Mor; Rigollet, Philippe; Schiebinger, Geoffrey; Weed, Jonathan

Date Published:: 2019-04-01

Journal Name:: Proceedings of Machine Learning Research

Volume:: 89

Page Range / eLocation ID:: 2454--2465

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this