...
首页> 外文期刊>Computational statistics & data analysis >Automatic aggregation of categories in multivariate contingency tables using information theory
【24h】

Automatic aggregation of categories in multivariate contingency tables using information theory

机译:使用信息论自动汇总多元列联表中的类别

获取原文
获取原文并翻译 | 示例
           

摘要

Low expected frequencies in tests associated to log-linear models building are treated with the aim of providing a methodology, useful for nonstatistician users, to analyze multivariate contingency tables. A procedure that reproduces the decisions of a statistical analyst studying a multivariate contingency table and confronted with low expected frequencies is provided, using the Bayesian information criterion to select a variable over which the aggregation should be done, and the entropy of Shannon to decide which categories should be aggregated. Prior opinions and knowledge about the feasibility of aggregation of categories within the context where the data have been collected are included in the system. The procedure has some user friendly techniques oriented to nonstatisticians, and it allowed greater efficiency when there are several multivariate tables to be analyzed using some variables that can be included in different log-linear models.
机译:处理与对数线性模型建立相关的测试中的低期望频率是为了提供一种对非统计量用户有用的分析多元列联表的方法。提供了一个过程,该过程可重现研究多元列联表的统计分析师的决策,并且面临较低的预期频率,该过程使用贝叶斯信息准则来选择应进行聚合的变量,并使用Shannon的熵来确定哪些类别应该汇总。系统中包含有关在已收集数据的上下文中汇总类别的可行性的事先意见和知识。该过程具有一些面向非统计人员的用户友好技术,并且当使用多个可包含在不同对数线性模型中的变量来分析多个变量表时,它可以提高效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号