...
首页> 外文期刊>Computers & geosciences >Statistical analysis of water-quality data containing multiple detection limits II: S-language software for nonparametric distribution modeling and hypothesis testing
【24h】

Statistical analysis of water-quality data containing multiple detection limits II: S-language software for nonparametric distribution modeling and hypothesis testing

机译:包含多个检测限的水质数据的统计分析II:用于非参数分布建模和假设检验的S语言软件

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Analysis of low concentrations of trace contaminants in environmental media often results in left-censored data that are below some limit of analytical precision. Interpretation of values becomes complicated when there are multiple detection limits in the data—perhaps as a result of changing analytical precision over time. Parametric and semi-parametric methods, such as maximum likelihood estimation and robust regression on order statistics, can be employed to model distributions of multiply censored data and provide estimates of summary statistics. However, these methods are based on assumptions about the underlying distribution of data. Nonparametric methods provide an alternative that does not require such assumptions. A standard nonparametric method for estimating summary statistics of multiply-censored data is the Kaplan-Meier (K-M) method. This method has seen widespread usage in the medical sciences within a general framework termed "survival analysis" where it is employed with right-censored time-to-failure data. However, K—M methods are equally valid for the left-censored data common in the geosciences. Our S-language software provides an analytical framework based on K-M methods that is tailored to the needs of the earth and environmental sciences community. This includes routines for the generation of empirical cumulative distribution functions, prediction or exceedance probabilities, and related confidence limits computation. Additionally, our software contains K-M-based routines for nonparametric hypothesis testing among an unlimited number of grouping variables. A primary characteristic of K-M methods is that they do not perform extrapolation and interpolation. Thus, these routines cannot be used to model statistics beyond the observed data range or when linear interpolation is desired. For such applications, the aforementioned parametric and semi-parametric methods must be used.
机译:对环境介质中痕量污染物的低浓度进行分析通常会导致左删失的数据低于分析精度的某些限制。当数据中存在多个检测限时,值的解释会变得很复杂,这可能是由于分析精度随时间而变化的结果。参数和半参数方法(例如最大似然估计和阶次统计量的稳健回归)可用于对多重审查数据的分布进行建模,并提供汇总统计量的估计值。但是,这些方法基于有关数据基础分布的假设。非参数方法提供了不需要这种假设的替代方法。 Kaplan-Meier(K-M)方法是一种用于估计多重删失数据的摘要统计量的标准非参数方法。这种方法已在医学科学中被称为“生存分析”的通用框架中得到广泛使用,在该框架中,将其与经过正确删减的失效时间数据一起使用。但是,KM方法对于地球科学中常见的左删失数据同样有效。我们的S语言软件提供了一个基于K-M方法的分析框架,该框架适合于地球和环境科学界的需求。这包括用于生成经验累积分布函数,预测或超出概率以及相关置信度限制计算的例程。此外,我们的软件还包含基于K-M的例程,用于在无数个分组变量中进行非参数假设检验。 K-M方法的主要特征是它们不执行外推和内插。因此,这些例程无法用于对超出观测数据范围或需要线性插值的统计模型进行建模。对于此类应用,必须使用前述的参数和半参数方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号