首页> 外文会议>IEEE International Conference on Software Analysis, Evolution and Reengineering >HashMTI: Scalable Mutation-based Taint Inference with Hash Records
【24h】

HashMTI: Scalable Mutation-based Taint Inference with Hash Records

机译:HashMTI:基于可扩展的突变的污点推论哈希记录

获取原文

摘要

Mutation-based taint inference (MTI) is a novel technique for taint analysis. Compared with traditional techniques that track propagations of taint tags, MTI infers a variable is tainted if its values change due to input mutations, which is lightweight and conceptually sound. However, there are 3 challenges to its efficiency and scalability: (1) it cannot efficiently record variable values to monitor their changes; (2) it consumes a large amount of memory monitoring variable values, especially on complex programs; and (3) its excessive memory overhead leads to a low hit ratio of CPU cache, which slows down the speed of taint inference. This paper presents an efficient and scalable solution named HashMTI. We first explain the above challenges based on 4 observations. Motivated by these challenges, we propose a hash record scheme to efficiently monitor changes in variable values and significantly reduce the memory overhead. The scheme is based on our specially selected and optimized hash functions that possess 3 crucial properties. Moreover, we propose the DoubleMutation strategy, which applies additional mutations to mitigate the limitation of the hash record and detect more taint information. We implemented a prototype of HashMTI and evaluated it on 18 real-world programs and 4 LAVA-M programs. Compared with the baseline OrigMTI, HashMTI significantly reduces the overhead while having similar accuracy. It achieves a speedup of 2.5X to 23.5X and consumes little memory which is on average 70.4 times less than that of OrigMTI.
机译:基于突变的污染推理(MTI)是一种用于污染分析的新技术。与追踪Taint标签的传播的传统技术相比,MTI Infers如果由于输入突变而导致的值,则污染变量,这是重量轻的,概念性的声音。但是,它的效率和可扩展性存在3个挑战:(1)它无法有效记录变量值以监控其变化; (2)它会消耗大量的内存监控变量值,尤其是在复杂程序上; (3)其过度的记忆开销导致CPU高速缓存的低击中率,从而减慢了污染推理的速度。本文介绍了名为Hashmti的高效且可扩展的解决方案。我们首先根据4个观察解释上述挑战。通过这些挑战,我们提出了一个哈希录制方案,以有效监测变量值的变化,并显着降低内存开销。该方案基于我们的特殊选择和优化的散列功能,具有3个至关重要的属性。此外,我们提出了辅助突变的曲标策略,该策略应用额外的突变来减轻散列记录的限制并检测更多的污染信息。我们实施了HashMTI的原型,并在18个现实世界计划和4个熔岩程序中进行了评估。与基线OrigMTI相比,HashMTI在具有相似准确度的同时显着降低了开销。它达到了2.5倍至23.5倍的加速,并消耗了小于OrigMTI的小70.4倍的小记忆。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号