首页> 外文期刊>Aquatic Sciences >Can bottom-up procedures improve the performance of stream classifications?
【24h】

Can bottom-up procedures improve the performance of stream classifications?

机译:自下而上的过程可以提高流分类的性能吗?

获取原文
获取原文并翻译 | 示例
           

摘要

Top-down methods for defining stream classifications are based on a conceptual model or expert-defined rules, whereas bottom-up methods use biological training data and statistical modelling. We compared the performance of six classification methods for explaining the taxonomic composition of invertebrate and fish assemblages recorded at 327 and 511 sites, respectively, distributed throughout France. Classification 1 and 2 were top-down classifications; The European Water Framework System A (WFDa,) and the French Hydro-ecoregions (HER 2). Four bottom-up classification procedures of increasing complexity were defined based on 11 variables that included watershed characteristics describing climate, topography, and geology, and site characteristics including elevation, bed slope and temperature. Classification 3 was defined using matrix correlation (MC) to select a combination of variable categories that produced the best discrimination of the observed taxonomic composition. Classification 4 and 5 were defined by clustering the sites based on their taxonomic data and then using linear discriminant analysis (LDA) and Random forests (RF) to discriminate the clusters based on the environmental variables. Classification 6 was defined using generalized dissimilarity modelling (GDM). Our hypothesis was that the bottom-up classifications would perform better because they flexibly accommodate complex relationships between compositional and environmental variation. We tested the classifications using the classification strength statistic (CS). The RF-based classification fitted the taxonomic patterns better than GDM or LDA and these latter classifications generally fitted better than the MC, WFDa or HER classifications. Cross validation analysis showed that differences in predictive CS (i.e. the CS statistics produced from sites not used in defining the classifications) were often significant. However, these differences were generally small. Gains in predictive performance of classifications appear to be small relative to the increase in complexity in the manner in which environmental variables are combined to define classes.
机译:自上而下的用于定义流分类的方法基于概念模型或专家定义的规则,而自下而上的方法则使用生物训练数据和统计建模。我们比较了六种分类方法的性能,以解释分别分布在整个法国的327和511个地点记录的无脊椎动物和鱼类组合的分类学组成。分类1和2是自上而下的分类;欧洲水框架体系A(WFDa)和法国水生态区(HER 2)。根据11个变量定义了四个自下而上的分类程序,这些程序的复杂性不断增加,这些变量包括描述气候,地形和地质的流域特征以及包括海拔,床坡度和温度在内的站点特征。使用矩阵相关性(MC)定义分类3,以选择对所观察到的生物分类组成产生最佳区分的变量类别的组合。通过基于站点的分类数据对站点进行聚类,然后使用线性判别分析(LDA)和随机森林(RF)来基于环境变量对聚类进行区分,来定义4类和5类。使用广义差异模型(GDM)定义分类6。我们的假设是自下而上的分类会更好,因为它们可以灵活地适应成分和环境变化之间的复杂关系。我们使用分类强度统计(CS)测试了分类。基于RF的分类比GDM或LDA更好地适合分类学模式,而这些后者的分类通常比MC,WFDa或HER更好。交叉验证分析表明,预测CS的差异(即,未用于定义分类的网站产生的CS统计数据)通常很明显。但是,这些差异通常很小。与将环境变量组合以定义类的方式的复杂性增加相比,分类的预测性能的收益似乎很小。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号