Outlier-Robust Optimal Transport
Authors
Authors
- Debarghya Mukherjee
- Aritra Guha
- Yuekai Sun
- Mikhail Yurochkin
- Justin Solomon
Authors
- Debarghya Mukherjee
- Aritra Guha
- Yuekai Sun
- Mikhail Yurochkin
- Justin Solomon
Published on
12/14/2020
Categories
Optimal transport (OT) provides a way of measuring distances between distributions that depends on the geometry of the sample space. In light of recent advances in solving the OT problem, OT distances are widely used as loss functions in minimum distance estimation. Despite its prevalence and advantages, however, OT is extremely sensitive to outliers. A single adversarially-picked outlier can increase OT distance arbitrarily. To address this issue, in this work we propose an outlier-robust OT formulation. Our formulation is convex but challenging to scale at a first glance. We proceed by deriving an \emph{equivalent} formulation based on cost truncation that is easy to incorporate into modern stochastic algorithms for regularized OT. We demonstrate our model applied to mean estimation under the Huber contamination model in simulation as well as outlier detection on real data.
Please cite our work using the BibTeX below.
@misc{mukherjee2020outlierrobust,
title={Outlier-Robust Optimal Transport},
author={Debarghya Mukherjee and Aritra Guha and Justin Solomon and Yuekai Sun and Mikhail Yurochkin},
year={2020},
eprint={2012.07363},
archivePrefix={arXiv},
primaryClass={stat.ME}
}