首页> 外文会议>Conference on computational natural language learning >On Model Stability as a Function of Random Seed
【24h】

On Model Stability as a Function of Random Seed

机译:关于模型稳定性与随机种子的关系

获取原文

摘要

In this paper, we focus on quantifying model stability as a function of random seed by investigating the effects of the induced randomness on model performance and the robustness of the model in general. We specifically perform a controlled study on the effect of random seeds on the behaviour of attention, gradient-based and surrogate model based (LIME) interpretations. Our analysis suggests that random seeds can adversely affect the consistency of models resulting in counterfactual interpretations. We propose a technique called Aggressive Stochastic Weight Averaging (ASWA) and an extension called Norm-filtered Aggressive Stochastic Weight Averaging (NASWA) which improves the stability of models over random seeds. With our ASWA and NASWA based optimization, we are able to improve the robustness of the original model, on average reducing the standard deviation of the model's performance by 72%.
机译:在本文中,我们通过研究诱导随机性对模型性能和模型的鲁棒性的影响,着重于将模型稳定性作为随机种子的函数进行量化。我们专门针对随机种子对注意力行为,基于梯度和基于替代模型(LIME)的行为的影响进行了对照研究。我们的分析表明,随机种子可能会对模型的一致性产生不利影响,从而导致反事实解释。我们提出了一种称为“主动随机加权平均”(ASWA)的技术,以及一个称为“范数过滤的主动随机加权平均”(NASWA)的扩展,该扩展提高了模型对随机种子的稳定性。通过基于ASWA和NASWA的优化,我们能够提高原始模型的鲁棒性,平均将模型性能的标准偏差降低72%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号