【24h】

Generalization to Mitigate Synonym Substitution Attacks

机译:概括,以缓解同义词替换攻击

获取原文

摘要

Studies have shown that deep neural networks are vulnerable to adversarial examples - perturbed inputs that cause DNN-based models to produce incorrect results. One robust adversarial attack in the NLP domain is the synonym substitution. In attacks of this variety, the adversary substitutes words with synonyms. Since synonym substitution perturbations aim to satisfy all lexical, grammatical, and semantic constraints, they are difficult to detect with automatic syntax check as well as by humans. In this work, we propose the first defensive method to mitigate synonym substitution perturbations that can improve the robustness of DNNs with both clean and adversarial data. We improve the generalization of DNN-based classifiers by replacing the embed-dings of the important words in the input samples with the average of their synonyms' em-beddings. By doing so, we reduce model sensitivity to particular words in the input samples. Our algorithm is generic enough to be applied in any NLP domain and to any model trained on any natural language.
机译:研究表明,深度神经网络容易受到对抗的例子 - 扰动输入,导致基于DNN的模型来产生不正确的结果。 NLP域中的一个强大的对抗攻击是同义词替换。在这种品种的攻击中,对手用同义词替换单词。由于同义词替代扰动旨在满足所有词汇,语法和语义约束,因此它们难以使用自动语法检查以及人类来检测。在这项工作中,我们提出了第一种防御方法来减轻可以改善DNN的鲁棒性与清洁和对冲数据的同义词替换扰动。通过将输入样本中的重要单词的嵌入点替换为具有其同义词的EM-BEDDINGS的平均值来改善基于DNN的分类器的泛化。通过这样做,我们将模型敏感性降低到输入样本中的特定单词。我们的算法足够通用,可以应用于任何NLP域以及任何自然语言培训的模型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号