NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Lightweight Diagramming for Lightweight Formal Methods: A Grounded Language Design

https://doi.org/10.4230/lipics.ecoop.2025.26

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (January 2025, Schloss Dagstuhl – Leibniz-Zentrum für Informatik)
Aldrich, Jonathan; Silva, Alexandra (Ed.)
Tools such as Alloy enable users to incrementally define, explore, verify, and diagnose specifications for complex systems. A critical component of these tools is a visualizer that lets users graphically explore generated models. As we show, however, a default visualizer that knows nothing about the domain can be unhelpful and can even actively violate presentational and cognitive principles. At the other extreme, full-blown custom visualization requires significant effort as well as knowledge that a tool user might not possess. Custom visualizations can also exhibit bad (even silent) failures. This paper charts a middle ground between the extremes of default and fully-customizable visualization. We capture essential domain information for lightweight diagramming, embodying this in a language. To identify key elements of lightweight diagrams, we ground the language design in both the cognitive science research on diagrams and in a corpus of 58 custom visualizations. We distill from these sources a small set of orthogonal primitives, and use the primitives to guide a diagramming language called Cope-and-Drag (CnD). We evaluate it on sample tasks, three user studies, and performance, and find that short CnD specifications consistently improve model comprehension over the Alloy default. CnD thus defines a new point in the design space of diagramming: a language that is lightweight, effective, and driven by sound principles.
more » « less
Full Text Available
Misconceptions in Finite-Trace and Infinite-Trace Linear Temporal Logic

Greenman, Ben; Prasad, Siddhartha; Di_Stasio, Antonio; Zhu, Shufang; De_Giacomo, Giuseppe; Krishnamurthi, Shriram; Montali, Marco; Nelson, Tim; Zizyte, Milda (September 2024, Springer)

Full Text Available
Privacy-Respecting Type Error Telemetry at Scale

https://doi.org/10.22152/programming-journal.org/2024/8/12

Greenman, Ben; Jeffrey, Alan; Krishnamurthi, Shriram; Shah, Mitesh (March 2024, The Art, Science, and Engineering of Programming)

Full Text Available
Conceptual Mutation Testing for Student Programming Misconceptions

https://doi.org/10.22152/programming-journal.org/2024/8/7

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (December 2023, The Art, Science, and Engineering of Programming)

Full Text Available
Generating Programs Trivially: Student Use of Large Language Models

https://doi.org/10.1145/3576882.3617921

Prasad, Siddhartha; Greenman, Ben; Nelson, Tim; Krishnamurthi, Shriram (December 2023, ACM)

Full Text Available
Misconceptions in Finite-Trace and Infinite-Trace Linear Temporal Logic

https://doi.org/10.1007/978-3-031-71162-6_30

Greenman, Ben; Prasad, Siddhartha; Di_Stasio, Antonio; Zhu, Shufang; De_Giacomo, Giuseppe; Krishnamurthi, Shriram; Montali, Marco; Nelson, Tim; Zizyte, Milda (September 2024, Springer Nature Switzerland)

Abstract With the growing use of temporal logics in areas ranging from robot planning to runtime verification, it is critical that users have a clear understanding of what a specification means. Toward this end, we have been developing a catalog of semantic errors and a suite of test instruments targeting various user-groups. The catalog is of interest to educators, to logic designers, to formula authors, and to tool builders, e.g., to identify mistakes. The test instruments are suitable for classroom teaching or self-study. This paper reports on five sets of survey data collected over a three-year span. We study misconceptions about finite-trace$$\textsc {ltl}_{f}$$ ${L T L}_{f}$ in threeltl-aware audiences, and misconceptions about standardltlin novices. We find several mistakes, even among experts. In addition, the data supports several categories of errors in both$$\textsc {ltl}_{f}$$ ${L T L}_{f}$ andltlthat have not been identified in prior work. These findings, based on data from actual users, offer insights into whatspecific waystemporal logics are tricky and provide a groundwork for future interventions.
more » « less
Full Text Available
Forge: A Tool and Language for Teaching Formal Methods

https://doi.org/10.1145/3649833

Nelson, Tim; Greenman, Ben; Prasad, Siddhartha; Dyer, Tristan; Bove, Ethan; Chen, Qianfan; Cutting, Charles; Del_Vecchio, Thomas; LeVine, Sidney; Rudner, Julianne; et al (April 2024, Proceedings of the ACM on Programming Languages)

This paper presents the design ofForge, a tool for teaching formal methods gradually. Forge is based on the widely-used Alloy language and analysis tool, but contains numerous improvements based on more than a decade of experience teaching Alloy to students. Although our focus has been on the classroom, many of the ideas in Forge likely also apply to training in industry. Forge offers aprogression of languagesthat improve the learning experience by only gradually increasing in expressive power. Forge supportscustom visualizationof its outputs, enabling the use of widely-understood domain-specific representations. Finally, Forge provides a variety oftesting featuresto ease the transition from programming to formal modeling. We present the motivation for and design of these aspects of Forge, and then provide a substantial evaluation based on multiple years of classroom use.
more » « less
Full Text Available
Typed–Untyped Interactions: A Comparative Analysis

https://doi.org/10.1145/3579833

Greenman, Ben; Dimoulas, Christos; Felleisen, Matthias (March 2023, ACM Transactions on Programming Languages and Systems)

The literature presents many strategies for enforcing the integrity of types when typed code interacts with untyped code. This article presents a uniform evaluation framework that characterizes the differences among some major existing semantics for typed–untyped interaction. Type system designers can use this framework to analyze the guarantees of their own dynamic semantics.
more » « less
Full Text Available
How Profilers Can Help Navigate Type Migration

https://doi.org/10.1145/3622817

Greenman, Ben; Felleisen, Matthias; Dimoulas, Christos (October 2023, Proceedings of the ACM on Programming Languages)

Sound migratory typing envisions a safe and smooth refactoring of untyped code bases to typed ones. However, the cost of enforcing safety with run-time checks is often prohibitively high, thus performance regressions are a likely occurrence. Additional types can often recover performance, but choosing the right components to type is difficult because of the exponential size of the migratory typing lattice. In principal though, migration could be guided by off-the-shelf profiling tools. To examine this hypothesis, this paper follows the rational programmer method and reports on the results of an experiment on tens of thousands of performance-debugging scenarios via seventeen strategies for turning profiler output into an actionable next step. The most effective strategy is the use of deep types to eliminate the most costly boundaries between typed and untyped components; this strategy succeeds in more than 50% of scenarios if two performance degradations are tolerable along the way.
more » « less
Little Tricky Logic: Misconceptions in the Understanding of LTL

https://doi.org/10.22152/programming-journal.org/2023/7/7

Greenman, Ben; Saarinen, Sam; Nelson, Tim; Krishnamurthi, Shriram (November 2022, The Art, Science, and Engineering of Programming)

Context Linear Temporal Logic (LTL) has been used widely in verification. Its importance and popularity have only grown with the revival of temporal logic synthesis, and with new uses of LTL in robotics and planning activities. All these uses demand that the user have a clear understanding of what an LTL specification means. Inquiry Despite the growing use of LTL, no studies have investigated the misconceptions users actually have in understanding LTL formulas. This paper addresses the gap with a first study of LTL misconceptions. Approach We study researchers’ and learners’ understanding of LTL in four rounds (three written surveys, one talk-aloud) spread across a two-year timeframe. Concretely, we decompose “understanding LTL” into three questions. A person reading a spec needs to understand what it is saying, so we study the mapping from LTL to English. A person writing a spec needs to go in the other direction, so we study English to LTL. However, misconceptions could arise from two sources: a misunderstanding of LTL’s syntax or of its underlying semantics. Therefore, we also study the relationship between formulas and specific traces. Knowledge We find several misconceptions that have consequences for learners, tool builders, and designers of new property languages. These findings are already resulting in changes to the Alloy modeling language. We also find that the English to LTL direction was the most common source of errors; unfortunately, this is the critical “authoring” direction in which a subtle mistake can lead to a faulty system. We contribute study instruments that are useful for training learners (whether academic or industrial) who are getting acquainted with LTL, and we provide a code book to assist in the analysis of responses to similar-style questions. Grounding Our findings are grounded in the responses to our survey rounds. Round 1 used Quizius to identify misconceptions among learners in a way that reduces the threat of expert blind spots. Rounds 2 and 3 confirm that both additional learners and researchers (who work in formal methods, robotics, and related fields) make similar errors. Round 4 adds deep support for our misconceptions via talk-aloud surveys. Importance This work provides useful answers to two critical but unexplored questions: in what ways is LTL tricky and what can be done about it? Our survey instruments can serve as a starting point for other studies.
more » « less
Full Text Available

« Prev Next »

Search for: All records