首页> 外文期刊>Statistics and Its Interface >Aggregated estimating equation estimation
【24h】

Aggregated estimating equation estimation

机译:聚合估计方程估计

获取原文
           

摘要

Motivated by the recent active research on online analytical processing (OLAP), we develop a computation and storage efficient algorithm for estimating equation (EE) estimation in massive data sets using a “divide-and-conquer” strategy. In each partition of the data set, we compress the raw data into some low dimensional statistics and then discard the raw data. Then, we obtain an approximation to the EE estimator, the aggregated EE (AEE) estimator, by solving an equation aggregated from the saved low dimensional statistics in all partitions. Such low dimensional statistics are taken as the EE estimates and first-order derivatives of the estimating equations in each partition. We show that, under proper partitioning and some regularity conditions, the AEE estimator is strongly consistent and asymptotically equivalent to the EE estimator. A major application of the AEE technique is to support fast OLAP of EE estimations for data warehousing technologies such as data cubes and data streams. It can also be used to reduce the computation time and conquer the memory constraint problem posed by massive data sets. Simulation studies show that the AEE estimator provides efficient storage and remarkable deduction in computational time, especially in its applications to data cubes and data streams.
机译:受在线分析处理(OLAP)的近期积极研究的推动,我们开发了一种计算和存储有效的算法,用于使用“分而治之”策略对海量数据集中的方程(EE)进行估计。在数据集的每个分区中,我们将原始数据压缩为一些低维统计信息,然后丢弃原始数据。然后,通过求解从所有分区中保存的低维统计信息中聚合的方程,我们获得了EE估算器(聚合的EE(AEE)估算器)的近似值。这样的低维统计量被用作每个分区中EE估计和估计方程的一阶导数。我们证明,在适当的分区和某些规则性条件下,AEE估计量与EE估计量具有很强的一致性和渐近性。 AEE技术的主要应用是为数据仓库技术(例如数据立方体和数据流)支持EE估计的快速OLAP。它也可以用来减少计算时间并解决海量数据集带来的内存约束问题。仿真研究表明,AEE估计器可以高效存储并显着减少计算时间,尤其是在其应用于数据立方体和数据流时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号