Robust and scalable Bayes via a median of subset posterior measures.
JOURNAL OF MACHINE LEARNING RESEARCH(2017)
摘要
We propose a novel approach to Bayesian analysis that is provably robust to outliers in the data and often has computational advantages over standard methods. Our technique is based on splitting the data into non-overlapping subgroups, evaluating the posterior distribution given each independent subgroup, and then combining the resulting measures. The main novelty of our approach is the proposed aggregation step, which is based on the evaluation of a median in the space of probability measures equipped with a suitable collection of distances that can be quickly and efficiently evaluated in practice. We present both theoretical and numerical evidence illustrating the improvements achieved by our method.
更多查看译文
关键词
Big data,geometric median,distributed computing,parallel MCMC,Wasser-stein distance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络