A New Algorithm for Identifying Loops in Decompilation

机译：识别反编译循环的新算法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Loop identification is an essential step of control flow analysis in decompilation. The Classical algorithm for identifying loops is Tarjan's intervalfinding algorithm, which is restricted to reducible graphs. Havlak presents one extension of Tarjan's algorithm to deal with irreducible graphs, which constructs a loop-nesting forest for an arbitrary flow graph. There's evidence showing that the running time of this algorithm is quadratic in the worst-case, and not almost linear as claimed. Ramalingam presents an improved algorithm with low time complexity on arbitrary graphs, but it performs not quite well on "real" control flow graphs (CFG). We present a novel algorithm for identifying loops in arbitrary CFGs. Based on a more detailed exploration on properties of loops and depth-first search (DFS), this algorithm traverses a CFG only once based on DFS and collects all information needed on the fly. It runs in approximately linear time and does not use any complicated data structures such as Interval/Derived Sequence of Graphs (DSG) or UNION-FIND sets. To perform complexity analysis of the algorithm, we introduce a new concept called unstructuredness coefficient to describe the unstructuredness of CFGs, and we find that the unstructuredness coefficients of these executables are usually small (＜1.5). Such "low-unstructuredness" property distinguishes these CFGs from general single-root connected directed graphs, and it offers an explanation why those algorithms existed perform not quite well on real-world cases. The new algorithm has been applied to 11526 CFGs in 6 typical binary executables on both Linux and Window platforms. Experimental result has validated our theoretical analysis and it shows that our algorithm runs 2-5 times faster than the Havlak-Tarjan algorithm, and 2-8 times faster than the Ramalingam-Havlak-Tarjan algorithm.

机译：回路识别是反编译中控制流分析的重要步骤。识别循环的经典算法是Tarjan的间隔查找算法，该算法仅限于可归约图。 Havlak提出了Tarjan算法的一种扩展，用于处理不可约图，该算法为任意流程图构建了一个循环嵌套的森林。有证据表明，在最坏的情况下，该算法的运行时间是二次的，而不是所要求的几乎线性的。 Ramalingam提出了一种改进的算法，该算法在任意图上具有较低的时间复杂度，但是在“实际”控制流程图（CFG）上却表现不佳。我们提出了一种新颖的算法，用于识别任意CFG中的循环。基于对循环属性和深度优先搜索（DFS）的更详细研究，该算法基于DFS仅遍历一次CFG，并即时收集所有所需信息。它以大约线性时间运行，并且不使用任何复杂的数据结构，例如区间/衍生图序列（DSG）或UNION-FIND集。为了进行算法的复杂性分析，我们引入了一个非结构化系数的新概念来描述CFG的非结构化，发现这些可执行文件的非结构化系数通常很小（＜1.5）。这种“低非结构性”属性将这些CFG与一般的单根连接有向图区分开来，并提供了一个解释，说明存在这些算法的原因在现实情况下效果不佳。新算法已在Linux和Window平台上应用于6个典型二进制可执行文件中的11526 CFG。实验结果验证了我们的理论分析，结果表明我们的算法运行速度比Havlak-Tarjan算法快2-5倍，比Ramalingam-Havlak-Tarjan算法快2-8倍。

著录项

来源
《International Symposium on Static Analysis(SAS 2007); 20070822-24; Kongens Lyngby(DK)》|2007年|P.170-183|共14页
会议地点 Kongens Lyngby(DK)
作者
Tao Wei; Jian Mao; Wei Zou; Yu Chen;
展开▼
作者单位

Institute of Computer Science and Technology Peking University;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词
control flow analysis; decompilation; loop identifying; unstructuredness coefficient;

机译：控制流分析;反编译;回路识别;非结构化系数;

相似文献

外文文献
中文文献
专利

1. Day and night closed-loop control in adults with type 1 diabetes: A comparison of two closed-loop algorithms driving continuous subcutaneous insulin infusion versus patient self-management [J] . LuijfY.M., HansDeVriesJ., ZwindermanK., Diabetes care . 2013,第12期

机译：成人1型糖尿病患者的昼夜闭环控制：两种驱动连续皮下胰岛素输注与患者自我管理的闭环算法的比较
2. Loop-Star and Loop-Tree Decompositions: Analysis and Efficient Algorithms [J] . Andriulli F.P. Antennas and Propagation, IEEE Transactions on . 2012,第5期

机译：环星和环树分解：分析和高效算法
3. Novel simulation-based algorithms for optimal open-loop and closed-loop scheduling of deficit irrigation systems [J] . N. Schutze, M. de Paly, U. Shamir Journal of Hydroinformatics . 2012,第1期

机译：基于模拟的新型算法，用于亏缺灌溉系统的最佳开环和闭环调度
4. A New Algorithm for Identifying Loops in Decompilation [C] . Tao Wei, Jian Mao, Wei Zou, International Symposium on Static Analysis . 2007

机译：一种识别反编译循环的新算法
5. Developing Algorithms to Detect Incidents on Freeways from Loop Detector and Vehicle Re-Identification Data [D] . Adhikari, Biraj. 2019

机译：从循环检测器和车辆重新识别数据中检测高速公路上检测事件的算法
6. Bias optimal linear estimation and the differences between open-loop simulation and closed-loop performance of spiking-based brain-computer interface algorithms [O] . Steven M. Chase, Andrew B. Schwartz, Robert E. Kass -1

机译：偏振最优线性估计和尖锐型脑电脑接口算法开环仿真与闭环性能的差异
7. 3P271 An algorithm for identifying loop crossing in protein structures(20. Origin of life Evolution,Poster,The 52nd Annual Meeting of the Biophysical Society of Japan(BSJ2014)) [O] . Tatsuo Mukai, George Chikenji 2014

机译：3P271一种识别蛋白质结构中环路循环的算法（20.人生进化，海报，海报，日本生物物理学会的第52次年会（BSJ2014））

A New Algorithm for Identifying Loops in Decompilation

摘要

著录项

相似文献

相关主题

期刊订阅