Evaluation of Large Language Models on Code Obfuscation (Student Abstract)

Swindle, Adrian; McNealy, Derrick; Krishnan, Giri; Ramyaa, Ramyaa

doi:10.1609/aaai.v38i21.30517

Citation Details

Evaluation of Large Language Models on Code Obfuscation (Student Abstract)

Obfuscation intends to decrease interpretability of code and identification of code behavior. Large Language Models(LLMs) have been proposed for code synthesis and code analysis. This paper attempts to understand how well LLMs can analyse code and identify code behavior. Specifically, this paper systematically evaluates several LLMs’ capabilities to detect obfuscated code and identify behavior across a variety of obfuscation techniques with varying levels of complexity. LLMs proved to be better at detecting obfuscations that changed identifiers, even to misleading ones, compared to obfuscations involving code insertions (unused variables, as well as variables that replace constants with expressions that evaluate to those constants). Hardest to detect were obfuscations that layered multiple simple transformations. For these, only 20-40% of the LLMs’ responses were correct. Adding misleading documentation was also successful in misleading LLMs. We provide all our code to replicate results at https://github.com/SwindleA/LLMCodeObfuscation. Overall, our results suggest a gap in LLMs’ ability to understand code. more »

Award ID(s):: 2150145

PAR ID:: 10498498

Author(s) / Creator(s):: Swindle, Adrian; McNealy, Derrick; Krishnan, Giri; Ramyaa, Ramyaa

Publisher / Repository:: AAAI

Date Published:: 2024-03-25

Journal Name:: Proceedings of the AAAI Conference on Artificial Intelligence

Volume:: 38

Issue:: 21

ISSN:: 2159-5399

Page Range / eLocation ID:: 23664 to 23666

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1609/aaai.v38i21.30517

More Like this