libLISA: Instruction Discovery and Analysis on x86-64

Craaijo, Jos; Verbeek, Freek; Ravindran, Binoy

doi:10.1145/3689723

Citation Details

libLISA: Instruction Discovery and Analysis on x86-64

Even though heavily researched, a full formal model of the x86-64 instruction set is still not available. We present libLISA, a tool for automated discovery and analysis of the ISA of a CPU. This produces the most extensive formal x86-64 model to date, with over 118000 different instruction groups. The process requires as little human specification as possible: specifically, we do not rely on a human-written (dis)assembler to dictate which instructions are executable on a given CPU, or what their in- and outputs are. The generated model is CPU-specific: behavior that is undefined is synthesized for the current machine. Producing models for five different x86-64 machines, we mutually compare them, discover undocumented instructions, and generate instruction sequences that are CPU-specific. Experimental evaluation shows that we enumerate virtually all instructions within scope, that the instructions' semantics are correct w.r.t. existing work, and that we improve existing work by exposing bugs in their handwritten models. more »

Award ID(s):: 2234257

PAR ID:: 10587434

Author(s) / Creator(s):: Craaijo, Jos; Verbeek, Freek; Ravindran, Binoy

Publisher / Repository:: Association for Computing Machinery

Date Published:: 2024-10-08

Journal Name:: Proceedings of the ACM on Programming Languages

Volume:: 8

Issue:: OOPSLA2

ISSN:: 2475-1421

Page Range / eLocation ID:: 333 to 361

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1145/3689723

More Like this