【24h】

Statistical Models for Count Data

机译:计数数据的统计模型

获取原文
           

摘要

Statistical analyses involving count data may take several forms depending on the context of use, that is; simple counts such as the number of plants in a particular field and categorical data in which counts represent the number of items falling in each of the several categories. The mostly adapted model for analyzing count data is the Poisson model. Other models that can be considered for modeling count data are the negative binomial and the hurdle models. It is of great importance that these models are systematically considered and compared before choosing one at the expense of others to handle count data. In real world situations count data sets may have zero counts which have an importance attached to them. In this work, statistical simulation technique was used to compare the performance of these count data models. Count data sets with different proportions of zero were simulated. Akaike Information Criterion (AIC) was used in the simulation study to compare how well several count data models fit the simulated datasets. From the results of the study it was concluded that negative binomial model fits better to over-dispersed data which has below 0.3 proportion of zeros and that hurdle model performs better in data with 0.3 and above proportion of zero.
机译:涉及计数数据的统计分析可以根据使用情况采用多种形式,即:简单计数,例如特定字段中的植物数量和分类数据,其中计数代表属于几个类别中每个类别的物品数。用于分析计数数据的最适合的模型是泊松模型。可以考虑对计数数据进行建模的其他模型是负二项式和障碍模型。在选择一个模型以牺牲其他模型来处理计数数据之前,系统地考虑和比较这些模型非常重要。在现实世界中,计数数据集可能具有零计数,这对其具有重要意义。在这项工作中,使用统计模拟技术来比较这些计数数据模型的性能。模拟具有不同零比例的计数数据集。在模拟研究中使用了Akaike信息准则(AIC),以比较几种计数数据模型对模拟数据集的适应程度。从研究结果可以得出结论,负二项式模型更适合于零比例低于0.3的过度分散数据,而障碍模型在比例大于0.3且高于0.3的数据中表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号