Explanation of Variability and Removal of Confounding Factors from Data through Optimal Transport

Esteban G Tabak,Giulio Trigila

doi:10.1002/cpa.21706

Abstract

A methodology based on the theory of optimal transport is developed to attribute variability in data sets to known and unknown factors and to remove such attributable components of the variability from the data. Denoting by x the quantities of interest and by z the explanatory factors, the procedure transforms x into filtered variables y through a z‐dependent map, so that the conditional probability distributions ρ(x|z) are pushed forward into a target distribution μ(y), independent of z. Among all maps and target distributions that achieve this goal, the procedure selects the one that minimally distorts the original data: the barycenter of the ρ(x|z). Connections are found to unsupervised learning and to fundamental problems in statistics such as conditional density estimation and sampling. Particularly simple instances of the methodology are shown to be equivalent to k‐means and principal component analysis. An application is shown to a time series of ground temperature hourly data across the United States.© 2017 Wiley Periodicals, Inc.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Explanation of Variability and Removal of Confounding Factors from Data through Optimal Transport

Abstract

Talk to us

Similar Papers

More From: Communications on Pure and Applied Mathematics

Lead the way for us

Journal: Communications on Pure and Applied Mathematics	Publication Date: Jul 5, 2017
Citations: 28

Similar Papers

Density-aware decentralised multi-agent exploration with energy constraint based on optimal transport theory
Kooktae Lee ... Rabiul Hasan Kabir
International Journal of Systems Science | VOL. 53
Kooktae Lee, et. al.Kooktae Lee ... Rabiul Hasan Kabir
29 Sep 2021
International Journal of Systems Science | VOL. 53

Wasserstein distance-based full waveform inversion method for density reconstruction
Hongying Liu ... Sen Yang
Journal of Applied Geophysics | VOL. 223
Hongying Liu, et. al.Hongying Liu ... Sen Yang
01 Apr 2024
Journal of Applied Geophysics | VOL. 223

Optimal transport theory to simplify freeform design
Zexin Feng ... Dewen Cheng
-
Zexin Feng, et. al.Zexin Feng ... Dewen Cheng
01 Jan 2019
01 Jan 2019

Global Sensitivity Analysis via Optimal Transport
Emanuele Borgonovo ... Alessio Figalli
Management Science | VOL. -
Emanuele Borgonovo, et. al.Emanuele Borgonovo ... Alessio Figalli
21 Aug 2024
Management Science | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Explanation of Variability and Removal of Confounding Factors from Data through Optimal Transport

Abstract

Talk to us

Similar Papers

More From: Communications on Pure and Applied Mathematics