Measuring and Analyzing Search Engine Poisoning of Linguistic Collisions

机译：测量和分析语言碰撞的搜索引擎中毒

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Misspelled keywords have become an appealing target in search poisoning, since they are less competitive to promote than the correct queries and account for a considerable amount of search traffic. Search engines have adopted several countermeasure strategies, e.g., Google applies automated corrections on queried keywords and returns search results of the corrected versions directly. However, a sophisticated class of attack, which we term as linguistic-collision misspelling, can evade auto-correction and poison search results. Cybercriminals target special queries where the misspelled terms are existent words, even in other languages (e.g., "idobe", a misspelling of the English word "adobe", is a legitimate word in the Nigerian language). In this paper, we perform the first large-scale analysis on linguistic-collision search poisoning attacks. In particular, we check 1.77 million misspelled search terms on Google and Baidu and analyze both English and Chinese languages, which are the top two languages used by Internet users. We leverage edit distance operations and linguistic properties to generate misspelling candidates. To more efficiently identify linguistic-collision search terms, we design a deep learning model that can improve collection rate by 2.84x compared to random sampling. Our results show that the abuse is prevalent: around 1.19% of linguistic-collision search terms on Google and Baidu have results on the first page directing to malicious websites. We also find that cybercriminals mainly target categories of gambling, drugs, and adult content. Mobile-device users disproportionately search for misspelled keywords, presumably due to small screen for input. Our work highlights this new class of search engine poisoning and provides insights to help mitigate the threat.

机译：拼错的关键字已经成为搜索中毒有吸引力的目标，因为他们不太有竞争力，推动不是正确的查询，并占了相当数量的搜索流量。搜索引擎已经采取了若干治理对策，例如，谷歌自动适用于关键字查询更正并返回搜索直接修正版本的结果。然而，复杂的类的攻击，这是我们任期语言碰撞拼写错误，可以逃避自动校正和毒药的搜索结果。网络犯罪分子的目标，其中拼错的条款是存在的话，即使是在其他语言（例如，“idobe”，英文单词“土坯”的拼写错误，是在尼日利亚语言合法的字）的特殊查询。在本文中，我们执行的语言碰撞搜索中毒攻击的第一次大规模分析。特别是，我们检查谷歌和百度177万个拼写错误的搜索术语和分析英语和中国的语言，这是由互联网用户使用的前两种语言。我们充分利用编辑距离操作和语言特性产生拼错的候选人。为了更有效地识别语言碰撞搜索方面，我们设计了一个深刻的学习模式，可以通过2.84x相比，随机抽样提高收缴率。我们的研究结果表明，滥用非常普遍：各地对谷歌的语言碰撞搜索字词和百度1.19％的人引导到恶意网站的第一页的结果。我们还发现，犯罪分子主要针对赌，毒和成人内容的类别。移动设备用户不成比例的搜索拼错的关键字，这可能是由于小屏幕进行输入。我们的工作突出了这种新一类的搜索引擎中毒，并提供洞察力，以帮助减轻威胁。

著录项

来源
《IEEE Symposium on Security and Privacy》|2019年|xxiv 723 p. :|共15页
会议地点
作者
Matthew Joslin; Neng Li; Shuang Hao; Minhui Xue; Haojin Zhu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类安全保密;
关键词
Search engines; Google; Deep learning; Toxicology; Recurrent neural networks; Linguistics; Drugs;

机译：搜索引擎;谷歌;深层学习;毒理学;递归神经网络;语言学;药品;

相似文献

外文文献
中文文献
专利

1. Analyzing the emotional outcomes of the online search behavior with search engines [J] . Carlos Flavian-Bianco, Raquel Gurrea-Sarasa, Carlos Orus-Sanclemente Computers in Human Behavior . 2011,第1期

机译：使用搜索引擎分析在线搜索行为的情感结果
2. Linguistic Query Based Quality Evaluation of Selected Image Search Engines [J] . Artur Karczmarczyk, Jaros?aw Jankowski, Wojciech Sa?abun Procedia Computer Science . 2017,第1期

机译：基于语言查询的选定图像搜索引擎的质量评估
3. Building Better Search Engines by Measuring Search Quality [J] . Voorhees Ellen M., Over Paul, Soboroff Ian IT professional . 2014,第2期

机译：通过衡量搜索质量来构建更好的搜索引擎
4. Measuring and Analyzing Search Engine Poisoning of Linguistic Collisions [C] . Matthew Joslin, Neng Li, Shuang Hao, IEEE Symposium on Security and Privacy . 2019

机译：测量和分析搜索引擎语言冲突中毒
5. Analyzing Consumer Behavior on Product Search Engines: Interplay between Search and Social Media. [D] . Li, Beibei. 2012

机译：分析产品搜索引擎上的消费者行为：搜索和社交媒体之间的相互作用。
6. Can electronic search engines optimize screening of search results in systematic reviews: an empirical study [O] . Margaret Sampson, Nicholas J Barrowman, David Moher, 2006

机译：电子搜索引擎能否在系统评价中优化对搜索结果的筛选：一项实证研究
7. Search engine poisoning and its prevalence in modern search engines [O] . Blaauw Pieter 2013

机译：搜索引擎中毒及其在现代搜索引擎中的流行
8. A Technique for Measuring, Recording and Analyzing Torsional Oscillations of Crackshafts of Reciprocating Engines [R] . Hammell, T. H., Carvish, J. E. 1969

机译：往复式发动机裂纹轴扭振的测量，记录和分析技术

Measuring and Analyzing Search Engine Poisoning of Linguistic Collisions

摘要

著录项

相似文献

相关主题

期刊订阅