...
首页> 外文期刊>Computational statistics & data analysis >Powered embarrassing parallel MCMC sampling in Bayesian inference, a weighted average intuition
【24h】

Powered embarrassing parallel MCMC sampling in Bayesian inference, a weighted average intuition

机译:Powered尴尬并行MCMC在贝叶斯推理中采样,加权平均直觉

获取原文
获取原文并翻译 | 示例
           

摘要

Abstract Although the Markov Chain Monte Carlo (MCMC) is very popular in parameter inference, the alleviation of the burden of calculation is crucial due to the limit of processors, memory, and disk bottleneck. This is especially true in terms of handling big data. In recent years, researchers have developed a parallel MCMC algorithm, in which full data are partitioned into subdatasets. Samples are drawn from the subdatasets independently at different machines without communication. In the extant literature, all machines are deemed to be identical. However, due to the heterogeneity of the data put into different machines, and the random nature of MCMC, the assumption of “identical machines” is questionable. Here we propose a Powered Embarrassing Parallel MCMC (PEPMCMC) algorithm, in which the full data posterior density is the product of the sub-posterior densities (posterior densities of different subdatasets) raised by some constraint powers. This is proven to be equivalent to a weighted averaging procedure. In our work, the powers are determined based on a maximum likelihood criterion, which leads to finding a maximum likelihood point within the convex hull of the estimates from different machines. We prove the asymptotic exactness and apply it to several cases to verify its strength in comparison with the unparallel and unpowered parallel algorithms. Furthermore, the connection between normal kernel density and parametric density estimations under certain conditions is investigated. ]]>
机译:<![cdata [ Abstract 虽然Markov链Monte Carlo(MCMC)在参数推断中非常流行,但由于限制,降低计算负担至关重要处理器,内存和磁盘瓶颈。在处理大数据方面尤其如此。近年来,研究人员已经开发了一种并行MCMC算法,其中将完整数据分区为子地图。在没有通信的不同机器上独立地从子地施抽取样本。在现存的文献中,所有机器都被认为是相同的。然而,由于数据的数据流入不同的机器,以及MCMC的随机性,“相同机器”的假设是可疑的。在这里,我们提出了一种动力的令人尴尬的并行MCMC(PEPMCMC)算法,其中全数据后密度是由某种约束力提高的子后密度(不同子地下的后密度)的产物。证明这相当于加权平均程序。在我们的工作中,权力是基于最大似然标准确定的,这导致在不同机器的估计的凸壳中找到最大似然点。我们证明了渐近精确性,并将其应用于几个案例,以验证其强度与无与伦比和无功的平行算法相比。此外,研究了在某些条件下正常核心密度和参数密度估计之间的连接。 ]]>

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号