Verifying Semantic Equivalence of Large Models with Equality Saturation

Zulkifli, Kahfi S  (ORCID:0000000278509769); Qian, Wenbo  (ORCID:0000000283434845); Zhu, Shaowei  (ORCID:0000000203351151); Zhou, Yuan  (ORCID:0009000221448931); Zhang, Zhen  (ORCID:0000000201640849); Lou, Chang  (ORCID:0009000130565729)

doi:10.1145/3721146.3721943

Citation Details

This content will become publicly available on March 30, 2026

Verifying Semantic Equivalence of Large Models with Equality Saturation

Modern machine learning frameworks support very large models by incorporating parallelism and optimization techniques. Yet, these very techniques add new layers of complexity in ensuring the correctness of the computation. An incorrect implementation of these techniques might lead to compile-time or runtime errors that can easily be observed and fixed, but it might also lead to silent errors that will result in incorrect computations in training or inference, which do not exhibit any obvious symptom until the model is used later. These subtle errors not only waste computation resources, but involve significant developer effort to detect and diagnose. In this work, we propose Aerify, a framework to automatically expose silent errors by verifying semantic equivalence of models with equality saturation. Aerify constructs equivalence graphs (e-graphs) from intermediate representations of tensor programs, and incrementally applies rewriting rules---derived from generic templates and refined via domain-specific analysis---to prove or disprove equivalence at scale. When discrepancies remain unproven, Aerify pinpoints the corresponding graph segments and maps them back to source code, simplifying debugging and reducing developer overhead. Our preliminary results show strong potentials of Aerify in detecting real-world silent errors. more »

Award ID(s):: 2441284

PAR ID:: 10659229

Author(s) / Creator(s):: Zulkifli, Kahfi S ; Qian, Wenbo ; Zhu, Shaowei ; Zhou, Yuan ; Zhang, Zhen ; Lou, Chang

Publisher / Repository:: ACM

Date Published:: 2025-03-30

Page Range / eLocation ID:: 82 to 89

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on March 30, 2026
Conference Paper:
https://doi.org/10.1145/3721146.3721943

More Like this