NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction

https://doi.org/10.1145/3711896.3737011

Abdullahi, Tassallah; Gemou, Ioanna; Nayak, Nihal V; Murtaza, Ghulam; Bach, Stephen H; Eickhoff, Carsten; Singh, Ritambhara (August 2025, KDD '25: Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Biomedical knowledge graphs (KGs) encode rich, structured information critical for drug discovery tasks, but extracting meaningful insights from large-scale KGs remains challenging due to their complex structure. Existing biomedical subgraph retrieval methods are tailored for graph neural networks (GNNs), limiting compatibility with other paradigms, including large language models (LLMs). We introduce K-Paths, a model-agnostic retrieval framework that extracts structured, diverse, and biologically meaningful multi-hop paths from dense biomedical KGs. These paths enable prediction of unobserved drug-drug and drug-disease interactions, including those involving entities not seen during training, thus supporting inductive reasoning. K-Paths is training-free and employs a diversity-aware adaptation of Yen's algorithm to extract the K shortest loopless paths between entities in a query, prioritizing biologically relevant and relationally diverse connections. These paths serve as concise, interpretable reasoning chains that can be directly integrated with LLMs or GNNs to improve generalization, accuracy, and enable explainable inference. Experiments on benchmark datasets show that K-Paths improves zero-shot reasoning across state-of-the-art LLMs. For instance, Tx-Gemma 27B improves by 19.8 and 4.0 F1 points on interaction severity prediction and drug repurposing tasks, respectively. Llama 70B achieves gains of 8.5 and 6.2 points on the same tasks. K-Paths also boosts the training efficiency of EmerGNN, a state-of-the-art GNN, by reducing the KG size by 90% while maintaining predictive performance. Beyond efficiency, K-Paths bridges the gap between KGs and LLMs, enabling scalable and explainable LLM-augmented scientific discovery. We release our code and the retrieved paths as a benchmark for inductive reasoning.
more » « less
Free, publicly-accessible full text available August 3, 2026
Does CLIP Bind Concepts? Probing Compositionality in Large Image Models

Lewis, Martha; Nayak, Nihal V; Yu, Peilin; Yu, Qinan; Merullo, Jack; Bach, Stephen H; Pavlick, Ellie (March 2024, Findings of the Association for Computational Linguistics: EACL 2024)

Large-scale neural network models combining text and images have made incredible progress in recent years. However, it remains an open question to what extent such models encode compositional representations of the concepts over which they operate, such as correctly identifying red cube by reasoning over the constituents red and cube. In this work, we focus on the ability of a large pretrained vision and language model (CLIP) to encode compositional concepts and to bind variables in a structure-sensitive way (e.g., differentiating cube behind sphere from sphere behind cube). To inspect the performance of CLIP, we compare several architectures from research on compositional distributional semantics models (CDSMs), a line of research that attempts to implement traditional compositional linguistic structures within embedding spaces. We benchmark them on three synthetic datasets– singleobject, two-object, and relational– designed to test concept binding. We find that CLIP can compose concepts in a single-object setting, but in situations where concept binding is needed, performance drops dramatically. At the same time, CDSMs also perform poorly, with best performance at chance level.
more » « less
Full Text Available
Fairness via Explanation Quality: Evaluating Disparities in the Quality of Post hoc Explanations

https://doi.org/10.1145/3514094.3534159

Dai, Jessica; Upadhyay, Sohini; Aivodji, Ulrich; Bach, Stephen H.; Lakkaraju, Himabindu (January 2022, AAAI/ACM Conference on AI, Ethics, and Society)

Full Text Available
Adversarial Multi Class Learning under Weak Supervision with Performance Guarantees

Mazzetto, Alessio; Cousins, Cyrus; Sam, Dylan; Bach, Stephen; Upfal, Eli (July 2021, International Conference on Machine Learning (ICML))
Zhang, Tong (Ed.)
We develop a rigorous approach for using a set of arbitrarily correlated weak supervision sources in order to solve a multiclass classification task when only a very small set of labeled data is available. Our learning algorithm provably converges to a model that has minimum empirical risk with respect to an adversarial choice over feasible labelings for a set of unlabeled data, where the feasibility of a labeling is computed through constraints defined by rigorously estimated statistics of the weak supervision sources. We show theoretical guarantees for this approach that depend on the information provided by the weak supervision sources. Notably, this method does not require the weak supervision sources to have the same labeling space as the multiclass classification task. We demonstrate the effectiveness of our approach with experiments on various image classification tasks.
more » « less
Full Text Available
Semi-Supervised Aggregation of Dependent Weak Supervision Sources With Performance Guarantees

Mazzetto, Alessio; Sam, Dylan; Park, Andrew; Upfal, Eli; Bach, Stephen (April 2021, International Conference on Artificial Intelligence and Statistics, 13-15)
Arindam, Banerjee; Kenji, Fukumizu (Ed.)
We develop a novel method that provides theoretical guarantees for learning from weak labelers without the (mostly unrealistic) assumption that the errors of the weak labelers are independent or come from a particular family of distributions. We show a rigorous technique for efficiently selecting small subsets of the labelers so that a majority vote from such subsets has a provably low error rate. We explore several extensions of this method and provide experimental results over a range of labeled data set sizes on 45 image classification tasks. Our performance-guaranteed methods consistently match the best performing alternative, which varies based on problem difficulty. On tasks with accurate weak labelers, our methods are on average 3 percentage points more accurate than the state-of-the-art adversarial method. On tasks with inaccurate weak labelers, our methods are on average 15 percentage points more accurate than the semi-supervised Dawid-Skene model (which assumes independence).
more » « less
Full Text Available

Search for: All records