NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The Impact of Literal Sorting on Cardinality Constraint Encodings

https://doi.org/10.1609/aaai.v39i11.33232

Reeves, Joseph E; Filipe, João; Hsu, Min-Chien; Martins, Ruben; Heule, Marijn_J H (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

The effectiveness of satisfiability solvers strongly depends on the quality of the encoding of a given problem into conjunctive normal form. Cardinality constraints are prevalent in numerous problems, prompting the development and study of various types of encoding. We present a novel approach to optimizing cardinality constraint encodings by exploring the impact of literal orderings within the constraints. By strategically placing related literals nearby each other, the encoding generates auxiliary variables in a hierarchical structure, enabling the solver to reason more abstractly about groups of related literals. Unlike conventional metrics such as formula size or propagation strength, our method leverages structural properties of the formula to redefine the roles of auxiliary variables to enhance the solver's learning capabilities. The experimental evaluation on benchmarks from the maximum satisfiability competition demonstrates that literal orderings can be more influential than the choice of the encoding type. Our literal ordering technique improves solver performance across various encoding techniques, underscoring the robustness of our approach.
more » « less
Free, publicly-accessible full text available April 11, 2026
BatFix: Repairing language model-based transpilation

https://doi.org/10.1145/3658668

Ramos, Daniel; Lynce, Inês; Manquinho, Vasco; Martins, Ruben; Le_Goues, Claire (July 2024, ACM Transactions on Software Engineering and Methodology)

To keep up with changes in requirements, frameworks, and coding practices, software organizations might need to migrate code from one language to another. Source-to-source migration, or transpilation, is often a complex, manual process. Transpilation requires expertise both in the source and target language, making it highly laborious and costly. Languages models for code generation and transpilation are becoming increasingly popular. However, despite capturing code-structure well, code generated by language models is often spurious and contains subtle problems. We proposeBatFix, a novel approach that augments language models for transpilation by leveraging program repair and synthesis to fix the code generated by these models.BatFixtakes as input both the original program, the target program generated by the machine translation model, and a set of test cases and outputs a repaired program that passes all test cases. Experimental results show that our approach is agnostic to language models and programming languages.BatFixcan locate bugs spawning multiple lines and synthesize patches for syntax and semantic bugs for programs migrated fromJavatoC++andPythontoC++from multiple language models, including, OpenAI’sCodex.
more » « less
Full Text Available
Reverse-Engineering Congestion Control Algorithm Behavior

https://doi.org/10.1145/3646547.3688443

Ferreira, Margarida; Ware, Ranysha; Kothari, Yash; Lynce, Inês; Martins, Ruben; Narayan, Akshay; Sherry, Justine (November 2024, ACM)

The rise of proprietary and novel congestion control algorithms (CCAs) opens questions about the future of Internet utilization, latency, and fairness. However, fully analyzing how novel CCAs impact these properties requires understanding the inner workings of these algorithms. We thus aim to reverse-engineer deployed CCAs' behavior from collected packet traces to facilitate analyzing them. We present Abagnale, a program synthesis pipeline that helps users automate the reverse-engineering task. Using Abagnale, we discover simple expressions capturing the behavior of 9 of the 16 CCAs distributed with the Linux kernel and analyze 7 CCAs from a graduate networking course.
more » « less
Free, publicly-accessible full text available November 4, 2025
Towards provably performant congestion control

Agarwal, Anup; Arun, Venkat; Ray, Devdeep; Martins, Ruben; Seshan, Srinivasan (April 2024, USENIX NSDI)

Full Text Available
Large Language Models for Test-Free Fault Localization

https://doi.org/10.1145/3597503.3623342

Yang, Aidan_Z H; Le_Goues, Claire; Martins, Ruben; Hellendoorn, Vincent (February 2024, ACM)

Full Text Available
Large Language Models for Test-Free Fault Localization

Yang, Aidan Z.H.; Le Goues, Claire; Martins, Ruben; Hellendoorn, Vincent J. (January 2024, Proceedings International Conference Software Engineering Education Practice)

Full Text Available
MELT: Mining Effective Lightweight Transformations from Pull Requests

https://doi.org/10.1109/ASE56229.2023.00117

Ramos, Daniel; Mitchell, Hailie; Lynce, Inês; Manquinho, Vasco; Martins, Ruben; Goues, Claire Le (September 2023, 38th IEEE/ACM International Conference on Automated Software Engineering (ASE))

Software developers often struggle to update APIs, leading to manual, time-consuming, and error-prone processes. We introduce Melt, a new approach that generates lightweight API migration rules directly from pull requests in popular library repositories. Our key insight is that pull requests merged into open-source libraries are a rich source of information sufficient to mine API migration rules. By leveraging code examples mined from the library source and automatically generated code examples based on the pull requests, we infer transformation rules in Comby, a language for structural code search and replace. Since inferred rules from single code examples may be too specific, we propose a generalization procedure to make the rules more applicable to client projects. Melt rules are syntax-driven, interpretable, and easily adaptable. Moreover, unlike previous work, our approach enables rule inference to seamlessly integrate into the library workflow, removing the need to wait for client code migrations. We evaluated Melt on pull requests from four popular libraries, successfully mining 461 migration rules from code examples in pull requests and 114 rules from auto-generated code examples. Our generalization procedure increases the number of matches for mined rules by 9×. We applied these rules to client projects and ran their tests, which led to an overall decrease in the number of warnings and fixing some test cases demonstrating MELT's effectiveness in real-world scenarios.
more » « less
Automating network heuristic design and analysis

https://doi.org/10.1145/3563766.3564085

Agarwal, Anup; Arun, Venkat; Ray, Devdeep; Martins, Ruben; Seshan, Srinivasan (November 2022, HotNets '22: Proceedings of the 21st ACM Workshop on Hot Topics in Networks)

Heuristics are ubiquitous in computer systems. Examples include congestion control, adaptive bit rate streaming, scheduling, load balancing, and caching. In some domains, theoretical proofs have provided clarity on the conditions where a heuristic is guaranteed to work well. This has not been possible in all domains because proving such guarantees can involve combinatorial reasoning making it hard, cumbersome and error-prone. In this paper we argue that computers should help humans with the combinatorial part of reasoning. We model reasoning questions as ∃∀ formulas [1] and solve them using the counterexample guided inductive synthesis (CEGIS) framework. As preliminary evidence, we prototype CCmatic, a tool that semi-automatically synthesizes congestion control algorithms that are provably robust. It rediscovered a recent congestion control algorithm that provably achieves high utilization and bounded delay under a challenging network model. It also found previously unknown variants of the algorithm that achieve different throughput-delay trade-offs.
more » « less
Full Text Available
MELT: Mining Effective Lightweight Transformations from Pull Requests

Ramos, Daniel; Mitchell, Hailie; Lynce, Ines; Manquinho, Vasco; Martins, Ruben; Le Goues, Claire (January 2023, IEEEACM International Conference on Automated Software Engineering)

Full Text Available
Patch Generation with Language Models: Feasibility and Scaling Behavior

Kolak, Sophia D.; Martins, Ruben; Le Goues, Claire; Hellendoorn, Vincent Josua (April 2022, Deep Learning for Code Workshop)

Large language models have shown a propensity for generating correct, multi-line programs from natural language prompts. Given past findings highlighting that bugs and patches can be distinguished by predictability according to simple language models, it is natural to ask if modern, large neural options lend themselves especially well to program repair without any calibration. We study this in the context of one-line bugs, providing a series of models of varying scales (from 160M to 12B parameters) with the context preceding a buggy line in 72 Java and Python programs and analyze the rank at which the correct patch (and original buggy line) is generated, if at all. Our results highlight a noticeable correlation of model size with test-passing accuracy and patch ranking quality, as well as several other findings related to the differences between the two languages and the propensity for especially the largest models to generate candidate patches that closely resemble (if not exactly match), the original developer patch.
more » « less
Full Text Available

« Prev Next »

Search for: All records