NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Inferring Visualization Intent from Conversation

Li, Haotian; et, al (October 2024, Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM ’24),)

Full Text Available
Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching

Wu, Joshua; et, al (August 2024, VLDB)

Full Text Available
Equivalence by Canonicalization for Synthesis-Backed Refactoring

https://doi.org/10.1145/3656453

Lubin, Justin; Ferguson, Jeremy; Ye, Kevin; Yim, Jacob; Chasins, Sarah_E (June 2024, Proceedings of the ACM on Programming Languages)

We present an enumerative program synthesis framework calledcomponent-based refactoringthat can refactor “direct” style code that does not use library components into equivalent “combinator” style code that does use library components. This framework introduces a sound but incomplete technique to check the equivalence of direct code and combinator code calledequivalence by canonicalizationthat does not rely on input-output examples or logical specifications. Moreover, our approach can repurpose existing compiler optimizations, leveraging decades of research from the programming languages community. We instantiated our new synthesis framework in two contexts: (i) higher-order functional combinators such asmapandfilterin the staticallytyped functional programming language Elm and (ii) high-performance numerical computing combinators provided by the NumPy library for Python. We implemented both instantiations in a tool calledCobblerand evaluated it on thousands of real programs to test the performance of the component-based refactoring framework in terms of execution time and output quality. Our work offers evidence that synthesis-backed refactoring can apply across a range of domains without specification beyond the input program.
more » « less
Low-Resourced Languages and Online Knowledge Repositories: A Need-Finding Study.

https://doi.org/10.1145/3613904.3642605

Nigatu, Hellina Hailu; Canny, John; Chasins, Sarah E (May 2024, ACM)

Full Text Available
How Domain Experts Use an Embedded DSL

https://doi.org/10.1145/3622851

Rennels, Lisa; Chasins, Sarah E (October 2023, Proceedings of the ACM on Programming Languages)

Programming tools are increasingly integral to research and analysis in myriad domains, including specialized areas with no formal relation to computer science. Embedded domain-specific languages (eDSLs) have the potential to serve these programmers while placing relatively light implementation burdens on language designers. However, barriers to eDSL use reduce their practical value and adoption. In this paper, we aim to deepen our understanding of how programmers use eDSLs and identify user needs to inform future eDSL designs. We performed a contextual inquiry (9 participants) with domain experts using Mimi, an eDSL for climate change economics modeling. A thematic analysis identified five key themes, including: the interaction between the eDSL and the host language has significant and sometimes unexpected impacts on eDSL user experience, and users preferentially engage with domain-specific communities and code templates rather than host language resources. The needs uncovered in our study offer design considerations for future eDSLs and suggest directions for future DSL usability research.
more » « less
Full Text Available
Co-Designing for Transparency: Lessons from Building a Document Organization Tool in the Criminal Justice Domain

https://doi.org/10.1145/3593013.3594093

Nigatu, Hellina Hailu; Pickoff-White, Lisa; Canny, John; Chasins, Sarah (June 2023, ACM)

Full Text Available
A Need-Finding Study with Users of Geospatial Data

https://doi.org/10.1145/3544548.3581370

Ziegler, Parker; Chasins, Sarah E. (April 2023, ACM)

Full Text Available
Exploring the Learnability of Program Synthesizers by Novice Programmers

https://doi.org/10.1145/3526113.3545659

Dhanya Jayagopal; Justin Lubin; Sarah Chasins (October 2022, user interface software and technology (UIST))

Full Text Available
Trial by File Formats: Exploring Public Defenders' Challenges Working with Novel Surveillance Data

https://doi.org/10.1145/3512914

Warren, Rachel B.; Salehi, Niloufar (March 2022, Proceedings of the ACM on Human-Computer Interaction)

In the United States, public defenders (lawyers assigned to people accused of crimes who cannot afford a private attorney) serve as an essential bulwark against wrongful arrest and incarceration for low-income and marginalized people. Public defenders have long been overworked and under-resourced. However, these issues have been compounded by increases in the volume and complexity of data in modern criminal cases. We explore the technology needs of public defenders through a series of semi-structured interviews with public defenders and those who work with them. We find that public defenders' ability to reason about novel surveillance data is woefully inadequate not only due to a lack of resources and knowledge, but also due to the structure of the criminal justice system, which gives prosecutors and police (in partnership with private companies) more control over the type of information used in criminal cases than defense attorneys. We find that public defenders may be able to create fairer situations for their clients with better tools for data interpretation and access. Therefore, we call on technologists to attend to the needs of public defenders and the people they represent when designing systems that collect data about people. Our findings illuminate constraints that technologists and privacy advocates should consider as they pursue solutions. In particular, our work complicates notions of individual privacy as the only value in protecting users' rights, and demonstrates the importance of data interpretation alongside data visibility. As data sources become more complex, control over the data cannot be separated from access to the experts and technology to make sense of that data. The growing surveillance data ecosystem may systematically oppress not only those who are most closely observed, but groups of people whose communities and advocates have been deprived of the storytelling power over their information.
more » « less
Full Text Available

Search for: All records