...
首页> 外文期刊>Journal of Hydrology >A tree-based statistical classification algorithm (CHAID) for identifying variables responsible for the occurrence of faecal indicator bacteria during waterworks operations
【24h】

A tree-based statistical classification algorithm (CHAID) for identifying variables responsible for the occurrence of faecal indicator bacteria during waterworks operations

机译:一种基于树的统计分类算法(CHAID),用于识别自来水厂运营期间造成粪便指示菌发生的变量

获取原文
获取原文并翻译 | 示例
           

摘要

Microbial contamination of groundwater used for drinking water can affect public health and is of major concern to local water authorities and water suppliers. Potential hazards need to be identified in order to protect raw water resources. We propose a non-parametric data mining technique for exploring the presence of total coliforms (TC) in a groundwater abstraction well and its relationship to readily available, continuous time series of hydrometric monitoring parameters (seven year records of precipitation, river water levels, and groundwater heads). The original monitoring parameters were used to create an extensive generic dataset of explanatory variables by considering different accumulation or averaging periods, as well as temporal offsets of the explanatory variables. A classification tree based on the Chi-Squared Automatic Interaction Detection (CHAID) recursive partitioning algorithm revealed statistically significant relationships between precipitation and the presence of TC in both a production well and a nearby monitoring well. Different secondary explanatory variables were identified for the two wells. Elevated water levels and short-term water table fluctuations in the nearby river were found to be associated with TC in the observation well. The presence of TC in the production well was found to relate to elevated groundwater heads and fluctuations in groundwater levels. The generic variables created proved useful for increasing significance levels. The tree-based model was used to predict the occurrence of TC on the basis of hydrometric variables. (C) 2014 Elsevier B.V. All rights reserved.
机译:用于饮用水的地下水的微生物污染会影响公众健康,这是当地水务部门和水供应商的主要关切。为了保护原水,需要确定潜在的危害。我们提出了一种非参数数据挖掘技术,用于探索地下水提取井中总大肠菌群(TC)的存在及其与随时可用的连续时间序列的水文监测参数(七年降水记录,河流水位和地下水头)。通过考虑不同的累积或平均周期以及解释变量的时间偏移,原始监视参数用于创建解释变量的广泛通用数据集。基于Chi-Squared自动交互检测(CHAID)递归分区算法的分类树显示,在生产井和附近的监测井中,降水与TC的存在之间存在统计学上的显着关系。确定了两个井的不同次要解释变量。在观测井中发现附近河流水位升高和短期水位波动与TC有关。发现生产井中TC的存在与地下水位升高和地下水位波动有关。事实证明,创建的通用变量对于提高重要性水平很有用。基于树的模型用于根据水文变量预测TC的发生。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号