NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MKRAG: Medical Knowledge Retrieval Augmented Generation for Medical Question Answering

Shi, Yucheng; Xu, Shaochen; Yang, Tianze; Liu, Zhengliang; Liu, Tianming; Li, Xiang; Liu, Ninghao (May 2025, AMIA Annual Symposium)

Free, publicly-accessible full text available May 22, 2026
Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

Shi, Yucheng Shi; Li, Quanzheng; Sun, Jin; Li, Xiang; Liu, Ninghao (February 2025, The Thirteenth International Conference on Learning Representations)

Free, publicly-accessible full text available February 24, 2026
Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering

https://doi.org/10.1145/3627673.3679722

Shi, Yucheng; Tan, Qiaoyu; Wu, Xuansheng; Zhong, Shaochen; Zhou, Kaixiong; Liu, Ninghao (October 2024, ACM)

Full Text Available
DIRECT: Dual Interpretable Recommendation with Multi-aspect Word Attribution

https://doi.org/10.1145/3663483

Wu, Xuansheng; Wan, Hanqin; Tan, Qiaoyu; Yao, Wenlin; Liu, Ninghao (May 2024, ACM Transactions on Intelligent Systems and Technology)

Recommending products to users with intuitive explanations helps improve the system in transparency, persuasiveness, and satisfaction. Existing interpretation techniques include post-hoc methods and interpretable modeling. The former category could quantitatively analyze input contribution to model prediction but has limited interpretation faithfulness, while the latter could explain model internal mechanisms but may not directly attribute model predictions to input features. In this study, we propose a novelDualInterpretableRecommendation model called DIRECT, which integrates ideas of the two interpretation categories to inherit their advantages and avoid limitations. Specifically, DIRECT makes use of item descriptions as explainable evidence for recommendation. First, similar to the post-hoc interpretation, DIRECT could attribute the prediction of a user preference score to textual words of the item descriptions. The attribution of each word is related to its sentiment polarity and word importance, where a word is important if it corresponds to an item aspect that the user is interested in. Second, to improve the interpretability of embedding space, we propose to extract high-level concepts from embeddings, where each concept corresponds to an item aspect. To learn discriminative concepts, we employ a concept-bottleneck layer, and maximize the coding rate reduction on word-aspect embeddings by leveraging a word-word affinity graph extracted from a pre-trained language model. In this way, DIRECT simultaneously achieves faithful attribution and usable interpretation of embedding space. We also show that DIRECT achieves linear inference time complexity regarding the length of item reviews. We conduct experiments including ablation studies on five real-world datasets. Quantitative analysis, visualizations, and case studies verify the interpretability of DIRECT. Our code is available at:https://github.com/JacksonWuxs/DIRECT.
more » « less
Full Text Available
Automated Natural Language Explanation of Deep Visual Neurons with Large Models (Student Abstract)

https://doi.org/10.1609/aaai.v38i21.30537

Zhao, Chenxu; Qian, Wei; Shi, Yucheng; Huai, Mengdi; Liu, Ninghao (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

Interpreting deep neural networks through examining neurons offers distinct advantages when it comes to exploring the inner workings of Deep Neural Networks. Previous research has indicated that specific neurons within deep vision networks possess semantic meaning and play pivotal roles in model performance. Nonetheless, the current methods for generating neuron semantics heavily rely on human intervention, which hampers their scalability and applicability. To address this limitation, this paper proposes a novel post-hoc framework for generating semantic explanations of neurons with large foundation models, without requiring human intervention or prior knowledge. Experiments are conducted with both qualitative and quantitative analysis to verify the effectiveness of our proposed approach.
more » « less
Full Text Available
Black-box Backdoor Defense via Zero-shot Image Purification

Shi, Yucheng; Du, Mengnan; Wu, Xuansheng; Guan, Zihan; Sun, Jin; Liu, Ninghao (November 2023, Conference on Neural Information Processing Systems)

Full Text Available
GiGaMAE: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction

https://doi.org/10.1145/3583780.3614894

Shi, Yucheng; Dong, Yushun; Tan, Qiaoyu; Li, Jundong; Liu, Ninghao (October 2023, Proceedings of the 32nd ACM International Conference on Information and Knowledge Management)

Self-supervised learning with masked autoencoders has recently gained popularity for its ability to produce effective image or textual representations, which can be applied to various downstream tasks without retraining. However, we observe that the current masked autoencoder models lack good generalization ability on graph data. To tackle this issue, we propose a novel graph masked autoencoder framework called GiGaMAE. Different from existing masked autoencoders that learn node presentations by explicitly reconstructing the original graph components (e.g., features or edges), in this paper, we propose to collaboratively reconstruct informative and integrated latent embeddings. By considering embeddings encompassing graph topology and attribute information as reconstruction targets, our model could capture more generalized and comprehensive knowledge. Furthermore, we introduce a mutual information based reconstruction loss that enables the effective reconstruction of multiple targets. This learning objective allows us to differentiate between the exclusive knowledge learned from a single target and common knowledge shared by multiple targets. We evaluate our method on three downstream tasks with seven datasets as benchmarks. Extensive experiments demonstrate the superiority of GiGaMAE against state-of-the-art baselines. We hope our results will shed light on the design of foundation models on graph-structured data. Our code is available at: https://github.com/sycny/GiGaMAE.
more » « less
Full Text Available
Attacking Neural Networks with Neural Networks: Towards Deep Synchronization for Backdoor Attacks

https://doi.org/10.1145/3583780.3614784

Guan, Zihan; Sun, Lichao; Du, Mengnan; Liu, Ninghao (October 2023, ACM)

Full Text Available
ENGAGE: Explanation Guided Data Augmentation for Graph Representation Learning

https://doi.org/10.1007/978-3-031-43418-1_7

Shi, Yucheng; Zhou, Kaixiong; Liu, Ninghao (September 2023, Springer Nature Switzerland)

Full Text Available
Interpreting Unfairness in Graph Neural Networks via Training Node Attribution

https://doi.org/10.1609/aaai.v37i6.25905

Dong, Yushun; Wang, Song; Ma, Jing; Liu, Ninghao; Li, Jundong (June 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

Graph Neural Networks (GNNs) have emerged as the leading paradigm for solving graph analytical problems in various real-world applications. Nevertheless, GNNs could potentially render biased predictions towards certain demographic subgroups. Understanding how the bias in predictions arises is critical, as it guides the design of GNN debiasing mechanisms. However, most existing works overwhelmingly focus on GNN debiasing, but fall short on explaining how such bias is induced. In this paper, we study a novel problem of interpreting GNN unfairness through attributing it to the influence of training nodes. Specifically, we propose a novel strategy named Probabilistic Distribution Disparity (PDD) to measure the bias exhibited in GNNs, and develop an algorithm to efficiently estimate the influence of each training node on such bias. We verify the validity of PDD and the effectiveness of influence estimation through experiments on real-world datasets. Finally, we also demonstrate how the proposed framework could be used for debiasing GNNs. Open-source code can be found at https://github.com/yushundong/BIND.
more » « less
Full Text Available

Search for: All records