Code relatives: detecting similarly behaving software

Su, Fang-Hsiang; Bell, Jonathan; Harvey, Kenneth; Sethumadhavan, Simha; Kaiser, Gail; Jebara, Tony

doi:10.1145/2950290.2950321

Citation Details

Code relatives: detecting similarly behaving software

Detecting “similar code” is useful for many software engineering tasks. Current tools can help detect code with statically similar syntactic and–or semantic features (code clones) and with dynamically similar functional input/output (simions). Unfortunately, some code fragments that behave similarly at the finer granularity of their execution traces may be ignored. In this paper, we propose the term “code relatives” to refer to code with similar execution behavior. We define code relatives and then present DyCLINK, our approach to detecting code relatives within and across codebases. DyCLINK records instruction-level traces from sample executions, organizes the traces into instruction-level dynamic dependence graphs, and employs our specialized subgraph matching algorithm to efficiently compare the executions of candidate code relatives. In our experiments, DyCLINK analyzed 422+ million prospective subgraph matches in only 43 minutes. We compared DyCLINK to one static code clone detector from the community and to our implementation of a dynamic simion detector. The results show that DyCLINK effectively detects code relatives with a reasonable analysis time. more »

Award ID(s):: 1302269 1161079

PAR ID:: 10112154

Author(s) / Creator(s):: Su, Fang-Hsiang; Bell, Jonathan; Harvey, Kenneth; Sethumadhavan, Simha; Kaiser, Gail; Jebara, Tony

Date Published:: 2016-11-13

Journal Name:: 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering

Page Range / eLocation ID:: 702 - 714

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/2950290.2950321

More Like this