Geodesic sinkhorn for fast and accurate optimal transport on manifolds.

Guillaume Huguet,Alexander Tong, María Ramos Zapatero,Christopher J Tape,Guy Wolf, Smita Krishnaswamy

ArXiv(2023)

引用 0|浏览5
暂无评分
摘要
Efficient computation of optimal transport distance between distributions is of growing importance in data science. Sinkhorn-based methods are currently the state-of-the-art for such computations, but require On2 computations. In addition, Sinkhorn-based methods commonly use an Euclidean ground distance between datapoints. However, with the prevalence of manifold structured scientific data, it is often desirable to consider geodesic ground distance. Here, we tackle both issues by proposing Geodesic Sinkhorn-based on diffusing a heat kernel on a manifold graph. Notably, Geodesic Sinkhorn requires only O(nlog⁡n) computation, as we approximate the heat kernel with Chebyshev polynomials based on the sparse graph Laplacian. We apply our method to the computation of barycenters of several distributions of high dimensional single cell data from patient samples undergoing chemotherapy. In particular, we define the barycentric distance as the distance between two such barycenters. Using this definition, we identify an optimal transport distance and path associated with the effect of treatment on cellular data.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要