Generalization to Mitigate Synonym Substitution Attacks

机译：概括，以缓解同义词替换攻击

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Studies have shown that deep neural networks are vulnerable to adversarial examples - perturbed inputs that cause DNN-based models to produce incorrect results. One robust adversarial attack in the NLP domain is the synonym substitution. In attacks of this variety, the adversary substitutes words with synonyms. Since synonym substitution perturbations aim to satisfy all lexical, grammatical, and semantic constraints, they are difficult to detect with automatic syntax check as well as by humans. In this work, we propose the first defensive method to mitigate synonym substitution perturbations that can improve the robustness of DNNs with both clean and adversarial data. We improve the generalization of DNN-based classifiers by replacing the embed-dings of the important words in the input samples with the average of their synonyms' em-beddings. By doing so, we reduce model sensitivity to particular words in the input samples. Our algorithm is generic enough to be applied in any NLP domain and to any model trained on any natural language.

机译：研究表明，深度神经网络容易受到对抗的例子 - 扰动输入，导致基于DNN的模型来产生不正确的结果。 NLP域中的一个强大的对抗攻击是同义词替换。在这种品种的攻击中，对手用同义词替换单词。由于同义词替代扰动旨在满足所有词汇，语法和语义约束，因此它们难以使用自动语法检查以及人类来检测。在这项工作中，我们提出了第一种防御方法来减轻可以改善DNN的鲁棒性与清洁和对冲数据的同义词替换扰动。通过将输入样本中的重要单词的嵌入点替换为具有其同义词的EM-BEDDINGS的平均值来改善基于DNN的分类器的泛化。通过这样做，我们将模型敏感性降低到输入样本中的特定单词。我们的算法足够通用，可以应用于任何NLP域以及任何自然语言培训的模型。

著录项

来源
《Workshop on Knowledge Extraction and Integration for Deep Learning Architectures》|2020年|20-28|共9页
会议地点
作者
Basemah Alshemali; Jugal Kalita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Automatic Prevention of Union Query Type SQL Injection Attack Using Private Synonym and Error Message Controller [J] . N. Gunaseeli, D. Jeya Mala Asian Journal of Information Technology . 2016,第22期

机译：使用专用同义词和错误消息控制器自动防止联合查询类型SQL注入攻击
2. System for DDoS attack mitigation by discovering the attack vectors through statistical traffic analysis [J] . Mircho Jordanov Mirchev, Seferin Todorov Mirtchev International journal of information and computer security . 2020,第3a4期

机译：通过统计流量分析发现攻击向量的DDOS攻击系统
3. Performance analysis of black-hole attack mitigation protocols under gray-hole attacks in MANET [J] . Gurung Shashi, Chauhan Siddhartha Wireless Networks . 2019,第3期

机译：MANET中灰洞攻击下黑洞攻击缓解协议的性能分析
4. Defense against Synonym Substitution-based Adversarial Attacks via Dirichlet Neighborhood Ensemble [C] . Yi Zhou, Xiaoqing Zheng, Cho-Jui Hsieh, International Joint Conference on Natural Language Processing;Annual Meeting of the Association for Computational Linguistics . 2021

机译：防御基于词义的替代基于的对冲攻击域通过Dirichlet邻里集合
5. Mitigation of Denial of Service Attacks in Software Defined Network =MITIGATION OF DENIAL OF SERVICE ATTACKS IN SOFTWARE DEFINED NETWORK [D] . Dridi, Lobna. 2017

机译：软件定义网络中的拒绝服务攻击的缓解=软件定义网络中的拒绝服务攻击的缓解
6. A new synonym-substitution method to enrich the human phenotype ontology [O] . Maria Taboada, Hadriana Rodriguez, Ranga C. Gudivada, 2017

机译：丰富人类表型本体的新同义词替换方法
7. Mitigate Wormhole Attack and Blackhole Attack Using Elliptic Curve Cryptography in MANET [O] . Mukul Shukla, Brijendra Kumar Joshi, Upendra Singh 2021

机译：使用猛禽曲线密码术中使用椭圆曲线密码缓解蠕虫攻击和黑洞攻击

Generalization to Mitigate Synonym Substitution Attacks

摘要

著录项

相似文献

相关主题

期刊订阅