NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

LLM+AL: Bridging Large Language Models and Action Languages for Complex Reasoning about Actions

Isay, Adam; Lee, Joohyung (February 2025, AAAI Press)

Large Language Models (LLMs) have made significant strides in various intelligent tasks but still struggle with complex action reasoning tasks that require systematic search. To address this limitation, we introduce a method that bridges the natural language understanding capability of LLMs with the symbolic reasoning capability of action languages---formal languages for reasoning about actions. Our approach, termed {\sf LLM+AL}, leverages the LLM's strengths in semantic parsing and commonsense knowledge generation alongside the action language's expertise in automated reasoning based on encoded knowledge. We compare {\sf LLM+AL} against state-of-the-art LLMs, including {\sc ChatGPT-4}, {\sc Claude 3 Opus}, {\sc Gemini Ultra 1.0}, and {\sc o1-preview}, using benchmarks for complex reasoning about actions. Our findings indicate that while all methods exhibit various errors, {\sf LLM+AL}, with relatively simple human corrections, consistently leads to correct answers, whereas using LLMs alone does not yield improvements even after human intervention. {\sf LLM+AL} also contributes to automated generation of action languages.
more » « less
Free, publicly-accessible full text available February 4, 2026
Pathwise Explanation of ReLU Neural Networks

Lim, Seongwoo; Jo, Won; Lee, Joohyung; Choi, Jaesik (May 2024, PMLR (Proceedings of Machine Learning Research))

Neural networks have demonstrated a wide range of successes, but their “black box" nature raises concerns about transparency and reliability. Previous research on ReLU networks has sought to unwrap these networks into linear models based on activation states of all hidden units. In this paper, we introduce a novel approach that considers subsets of the hidden units involved in the decision making path. This pathwise explanation provides a clearer and more consistent understanding of the relationship between the input and the decision-making process. Our method also offers flexibility in adjusting the range of explanations within the input, i.e., from an overall attribution input to particular components within the input. Furthermore, it allows for the decomposition of explanations for a given input for more detailed explanations. Our experiments demonstrate that the proposed method outperforms existing methods both quantitatively and qualitatively.
more » « less
Full Text Available
Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering

https://doi.org/10.1109/WACV57701.2024.00656

Ishay, Adam; Yang, Zhun; Lee, Joohyung; Kang, Ilgu; Lim, Dongjae (January 2024, IEEE)

Full Text Available
Leveraging Large Language Models to Generate Answer Set Programs

https://doi.org/10.24963/kr.2023/37

Ishay, Adam; Yang, Zhun; Lee, Joohyung (September 2023, International Joint Conferences on Artificial Intelligence Organization)

Large language models (LLMs), such as GPT-3 and GPT-4, have demonstrated exceptional performance in various natural language processing tasks and have shown the ability to solve certain reasoning problems. However, their reasoning capabilities are limited and relatively shallow, despite the application of various prompting techniques. In contrast, formal logic is adept at handling complex reasoning, but translating natural language descriptions into formal logic is a challenging task that non-experts struggle with. This paper proposes a neuro-symbolic method that combines the strengths of large language models and answer set programming. Specifically, we employ an LLM to transform natural language descriptions of logic puzzles into answer set programs. We carefully design prompts for an LLM to convert natural language descriptions into answer set programs in a step by step manner. Surprisingly, with just a few in-context learning examples, LLMs can generate reasonably complex answer set programs. The majority of errors made are relatively simple and can be easily corrected by humans, thus enabling LLMs to effectively assist in the creation of answer set programs.
more » « less
Full Text Available
Intuitive Access to Smartphone Settings Using Relevance Model Trained by Contrastive Learning

https://doi.org/10.1609/aaai.v37i13.26861

Kim, Joonyoung; Lee, Kangwook; Shin, Haebin; Lee, Hurnjoo; Kang, Sechun; Choi, Byunguk; Shin, Dong; Lee, Joohyung (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

The more new features that are being added to smartphones, the harder it becomes for users to find them. This is because the feature names are usually short and there are just too many of them for the users to remember the exact words. The users are more comfortable asking contextual queries that describe the features they are looking for, but the standard term frequency-based search cannot process them. This paper presents a novel retrieval system for mobile features that accepts intuitive and contextual search queries. We trained a relevance model via contrastive learning from a pre-trained language model to perceive the contextual relevance between a query embedding and indexed mobile features. Also, to make it efficiently run on-device using minimal resources, we applied knowledge distillation to compress the model without degrading much performance. To verify the feasibility of our method, we collected test queries and conducted comparative experiments with the currently deployed search baselines. The results show that our system outperforms the others on contextual sentence queries and even on usual keyword-based queries.
more » « less
Full Text Available
Learning to solve constraint satisfaction problems with recurrent transformer

Yang, Zhun; Ishay, Adam; Lee, Joohyung (May 2023, International Conference on Learning Representations (ICLR))

Full Text Available
Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text

https://doi.org/10.18653/v1/2023.findings-acl.321

Yang, Zhun; Ishay, Adam; Lee, Joohyung (January 2023, Association for Computational Linguistics)

Full Text Available
Injecting Logical Constraints into Neural Networks via Straight-Through Estimators

Yang, Zhun; Lee, Joohyung; Park, Chiyoun. (July 2022, International Conference on Machine Learning)

Injecting discrete logical constraints into neural network learning is one of the main challenges in neuro-symbolic AI. We find that a straight-through-estimator, a method introduced to train binary neural networks, could effectively be applied to incorporate logical constraints into neural network learning. More specifically, we design a systematic way to represent discrete logical constraints as a loss function; minimizing this loss using gradient descent via a straight-through-estimator updates the neural network's weights in the direction that the binarized outputs satisfy the logical constraints. The experimental results show that by leveraging GPUs and batch training, this method scales significantly better than existing neuro-symbolic methods that require heavy symbolic computation for computing gradients. Also, we demonstrate that our method applies to different types of neural networks, such as MLP, CNN, and GNN, making them learn with no or fewer labeled data by learning directly from known constraints.
more » « less
Full Text Available
Extending Answer Set Programs with Neural Networks

https://doi.org/10.4204/EPTCS.325.41

Yang, Zhun (September 2020, Electronic proceedings in theoretical computer science)

The integration of low-level perception with high-level reasoning is one of the oldest problems in Artificial Intelligence. Recently, several proposals were made to implement the reasoning process in complex neural network architectures. While these works aim at extending neural networks with the capability of reasoning, a natural question that we consider is: can we extend answer set programs with neural networks to allow complex and high-level reasoning on neural network outputs? As a preliminary result, we propose NeurASP – a simple extension of answer set programs by embracing neural networks where neural network outputs are treated as probability distributions over atomic facts in answer set programs. We show that NeurASP can not only improve the perception accuracy of a pre-trained neural network, but also help to train a neural network better by giving restrictions through logic rules. However, training with NeurASP would take much more time than pure neural network training due to the internal use of a symbolic reasoning engine. For future work, we plan to investigate the potential ways to solve the scalability issue of NeurASP. One potential way is to embed logic programs directly in neural networks. On this route, we plan to first design a SAT solver using neural networks, then extend such a solver to allow logic programs.
more » « less
Full Text Available
A Simple Extension of Answer Set Programs to Embrace Neural Networks (Extended Abstract)

https://doi.org/10.4204/EPTCS.325

Yang, Zhun; Ishay, Adam; Lee, Joohyung (September 2020, Electronic proceedings in theoretical computer science)
Ricca, Francesco et (Ed.)
The integration of low-level perception with high-level reasoning is one of the oldest problems in Artificial Intelligence. Today, the topic is revisited with the recent rise of deep neural networks. However, it is still not clear how complex and high-level reasoning, such as default reasoning, ontology reasoning, and causal reasoning, can be successfully computed by these approaches. The latter subject has been well-studied in the area of knowledge representation (KR), but many KR formalisms, including answer set programming (ASP), are logic-oriented and do not incorporate high-dimensional feature space as in deep learning, which limits the applicability of KR in many practical applications.
more » « less
Full Text Available

« Prev Next »

Search for: All records