NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SatLM: Satisfiability-Aided Language Models Using Declarative Prompting

Ye, Xi; Chen, Qiaochu; Dillig, Isil; Durrett, Greg (December 2023, Advances in neural information processing systems)

Prior work has combined chain-of-thought prompting in large language models (LLMs) with programmatic representations to perform effective and transparent reasoning. While such an approach works well for tasks that only require forward reasoning (e.g., straightforward arithmetic), it is less effective for constraint solving problems that require more sophisticated planning and search. In this paper, we propose a new satisfiability-aided language modeling (SatLM) approach for improving the reasoning capabilities of LLMs. We use an LLM to generate a declarative task specification rather than an imperative program and leverage an off-the-shelf automated theorem prover to derive the final answer. This approach has two key advantages. The declarative specification is closer to the problem description than the reasoning steps are, so the LLM can parse it out of the description more accurately. Furthermore, by offloading the actual reasoning task to an automated theorem prover, our approach can guarantee the correctness of the answer with respect to the parsed specification and avoid planning errors in the solving process. We evaluate SATLM on 8 different datasets and show that it consistently outperforms program-aided LMs in the imperative paradigm. In particular, SATLM outperforms program-aided LMs by 23% on a challenging subset of the GSM arithmetic reasoning dataset; SATLM also achieves a new SoTA on LSAT and BoardgameQA, surpassing previous models that are trained on the respective training sets.
more » « less
Full Text Available
Data Extraction via Semantic Regular Expression Synthesis

https://doi.org/10.1145/3622863

Chen, Qiaochu; Banerjee, Arko; Demiralp, Çağatay; Durrett, Greg; Dillig, Işıl (October 2023, Proceedings of the ACM on Programming Languages)

Many data extraction tasks of practical relevance require not only syntactic pattern matching but also semantic reasoning about the content of the underlying text. While regular expressions are very well suited for tasks that require only syntactic pattern matching, they fall short for data extraction tasks that involve both a syntactic and semantic component. To address this issue, we introduce semantic regexes, a generalization of regular expressions that facilitates combined syntactic and semantic reasoning about textual data. We also propose a novel learning algorithm that can synthesize semantic regexes from a small number of positive and negative examples. Our proposed learning algorithm uses a combination of neural sketch generation and compositional type-directed synthesis for fast and effective generalization from a small number of examples. We have implemented these ideas in a new tool called Smore and evaluated it on representative data extraction tasks involving several textual datasets. Our evaluation shows that semantic regexes can better support complex data extraction tasks than standard regular expressions and that our learning algorithm significantly outperforms existing tools, including state-of-the-art neural networks and program synthesis tools.
more » « less
Web question answering with neurosymbolic program synthesis

https://doi.org/10.1145/3453483.3454047

Chen, Qiaochu; Lamoreaux, Aaron; Wang, Xinyu; Durrett, Greg; Bastani, Osbert; Dillig, Isil (June 2021, ACM 2021)

Full Text Available
Optimal Neural Program Synthesis from Multimodal Specifications

https://doi.org/10.18653/v1/2021.findings-emnlp.146

Ye, Xi; Chen, Qiaochu; Dillig, Isil; Durrett, Greg (January 2021, Findings of the Association for Computational Linguistics: EMNLP 2021)

Full Text Available
Sketch-Driven Regular Expression Generation from Natural Language and Examples

https://doi.org/10.1162/tacl_a_00339

Ye, Xi; Chen, Qiaochu; Wang, Xinyu; Dillig, Isil; Durrett, Greg (December 2020, Transactions of the Association for Computational Linguistics)

Recent systems for converting natural language descriptions into regular expressions (regexes) have achieved some success, but typically deal with short, formulaic text and can only produce simple regexes. Real-world regexes are complex, hard to describe with brief sentences, and sometimes require examples to fully convey the user’s intent. We present a framework for regex synthesis in this setting where both natural language (NL) and examples are available. First, a semantic parser (either grammar-based or neural) maps the natural language description into an intermediate sketch, which is an incomplete regex containing holes to denote missing components. Then a program synthesizer searches over the regex space defined by the sketch and finds a regex that is consistent with the given string examples. Our semantic parser can be trained purely from weak supervision based on correctness of the synthesized regex, or it can leverage heuristically derived sketches. We evaluate on two prior datasets (Kushman and Barzilay 2013 ; Locascio et al. 2016 ) and a real-world dataset from Stack Overflow. Our system achieves state-of-the-art performance on the prior datasets and solves 57% of the real-world dataset, which existing neural systems completely fail on. 1
more » « less
Full Text Available
Benchmarking Multimodal Regex Synthesis with Complex Structures

https://doi.org/10.18653/v1/2020.acl-main.541

Ye, Xi; Chen, Qiaochu; Dillig, Isil; Durrett, Greg (January 2020, Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics)

Existing datasets for regular expression (regex) generation from natural language are limited in complexity; compared to regex tasks that users post on StackOverflow, the regexes in these datasets are simple, and the language used to describe them is not diverse. We introduce StructuredRegex, a new regex synthesis dataset differing from prior ones in three aspects. First, to obtain structurally complex and realistic regexes, we generate the regexes using a probabilistic grammar with pre-defined macros observed from real-world StackOverflow posts. Second, to obtain linguistically diverse natural language descriptions, we show crowdworkers abstract depictions of the underlying regex and ask them to describe the pattern they see, rather than having them paraphrase synthetic language. Third, we augment each regex example with a collection of strings that are and are not matched by the ground truth regex, similar to how real users give examples. Our quantitative and qualitative analysis demonstrates the advantages of StructuredRegex over prior datasets. Further experimental results using various multimodal synthesis techniques highlight the challenge presented by our dataset, including non-local constraints and multi-modal inputs.
more » « less
Full Text Available
Type-directed synthesis of visualizations from natural language queries

https://doi.org/10.1145/3563307

Chen, Qiaochu; Pailoor, Shankara; Barnaby, Celeste; Criswell, Abby; Wang, Chenglong; Durrett, Greg; Dillig, Işil (October 2022, Proceedings of the ACM on Programming Languages)

We propose a new technique based on program synthesis for automatically generating visualizations from natural language queries. Our method parses the natural language query into a refinement type specification using the intents-and-slots paradigm and leverages type-directed synthesis to generate a set of visualization programs that are most likely to meet the user's intent. Our refinement type system captures useful hints present in the natural language query and allows the synthesis algorithm to reject visualizations that violate well-established design guidelines for the input data set. We have implemented our ideas in a tool called Graphy and evaluated it on NLVCorpus, which consists of 3 popular datasets and over 700 real-world natural language queries. Our experiments show that Graphy significantly outperforms state-of-the-art natural language based visualization tools, including transformer and rule-based ones.
more » « less

Search for: All records