首页> 外文会议>IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining >iTrustSO: An Intelligent System for Automatic Detection of Insecure Code Snippets in Stack Overflow
【24h】

iTrustSO: An Intelligent System for Automatic Detection of Insecure Code Snippets in Stack Overflow

机译:iTrustSO:一种自动检测堆栈溢出中不安全代码段的智能系统

获取原文

摘要

Despite the apparent benefits of modern social coding paradigm such as Stack Overflow, its potential security risks have been largely overlooked (e.g., insecure codes could be easily embedded and distributed). To address this imminent issue, in this paper, we bring a significant insight to leverage both social coding properties and code content for automatic detection of insecure code snippets in Stack Overflow. To determine if the given code snippets are insecure, we not only analyze the code content, but also utilize various kinds of relations among users, badges, questions, answers and code snippets in Stack Overflow. To model the rich semantic relationships, we first introduce a structured heterogeneous information network (HIN) for representation and then use meta-path based approach to incorporate higher-level semantics to build up relatedness over code snippets. Later, we propose a novel hierarchical attention-based sequence learning model named CodeHin2Vec to seamlessly integrate node (i.e., code snippet) content with HIN-based relations for representation learning. After that, a classifier is built for insecure code snippet detection. Integrating our proposed method, an intelligent system named iTrustSO is accordingly developed to address the code security issues in modern software coding platforms. Comprehensive experiments on the data collections from Stack Overflow are conducted to validate the effectiveness of our developed system iTrustSO by comparisons with alternative methods.
机译:尽管诸如堆栈溢出之类的现代社交编码范例具有明显的好处,但其潜在的安全风险已被大大忽略(例如,不安全的代码可以轻松地嵌入和分发)。为了解决这一迫在眉睫的问题,在本文中,我们带来了重要的见解,可以利用社交编码属性和代码内容来自动检测Stack Overflow中不安全的代码段。为了确定给定的代码片段是否不安全,我们不仅分析了代码内容,还利用了Stack Overflow中用户,徽章,问题,答案和代码片段之间的各种关系。为了对丰富的语义关系进行建模,我们首先引入一个结构化的异构信息网络(HIN)进行表示,然后使用基于元路径的方法来合并更高级别的语义,以建立与代码段的相关性。后来,我们提出了一种新颖的基于层次的,基于注意力的序列学习模型CodeHin2Vec,以将节点(即代码片段)内容与基于HIN的关系无缝集成在一起,以进行表示学习。此后,将为不安全的代码段检测构建分类器。通过整合我们提出的方法,开发了一个名为iTrustSO的智能系统,以解决现代软件编码平台中的代码安全性问题。通过与替代方法进行比较,对Stack Overflow的数据收集进行了全面的实验,以验证我们开发的系统iTrustSO的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号