NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Geometry Informed Tokenization of Molecules for Language Model Generation

Li, Xiner; Wang, Limei; Luo, Youzhi; Edwards, Carl; Gui, Shurui; Lin, Yuchao; Ji, Heng; Ji, Shuiwang (July 2025, ICML)

Free, publicly-accessible full text available July 23, 2026
MolCap-Arena: A Comprehensive Captioning Benchmark on Language-Enhanced Molecular Property Prediction

Edwards, Carl; Lu, Ziqing; Hajiramezanali, Ehsan; Biancalani, Tommaso; Ji, Heng; Scalia, Gabriele (May 2025, ICLR/OpenReview)

Free, publicly-accessible full text available May 18, 2026
Automating Intervention Discovery from Scientific Literature: A Progressive Ontology Prompting and Dual-LLM Framework

https://doi.org/10.24963/ijcai.2025/1078

Hu, Yuting; Liu, Dancheng; Wang, Qingyun; Yu, Charles; Xu, Chenhui; Zheng, Qingxiao; Ji, Heng; Xiong, Jinjun (September 2025, International Joint Conferences on Artificial Intelligence Organization)

Identifying effective interventions from the scientific literature is challenging due to the high volume of publications, specialized terminology, and inconsistent reporting formats, making manual curation laborious and prone to oversight. To address this challenge, this paper proposes a novel framework leveraging large language models (LLMs), which integrates a progressive ontology prompting (POP) algorithm with a dual-agent system, named LLM-Duo. On the one hand, the POP algorithm conducts a prioritized breadth-first search (BFS) across a predefined ontology, generating structured prompt templates and action sequences to guide the automatic annotation process. On the other hand, the LLM-Duo system features two specialized LLM agents, an explorer and an evaluator, working collaboratively and adversarially to continuously refine annotation quality. We showcase the real-world applicability of our framework through a case study focused on speech-language intervention discovery. Experimental results show that our approach surpasses advanced baselines, achieving more accurate and comprehensive annotations through a fully automated process. Our approach successfully identified 2,421 interventions from a corpus of 64,177 research articles in the speech-language pathology domain, culminating in the creation of a publicly accessible intervention knowledge base with great potential to benefit the speech-language pathology community.
more » « less
Free, publicly-accessible full text available September 1, 2026
Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation

https://doi.org/10.1007/978-981-97-9536-9_2

Zeng, Qi; Sidhu, Mankeerat; Blume, Ansel; Chan, Hou Pong; Wang, Lu; Ji, Heng (January 2025, Springer Nature Singapore)

Full Text Available
Large Language Models on Graphs: A Comprehensive Survey

https://doi.org/10.1109/TKDE.2024.3469578

Jin, Bowen; Liu, Gang; Han, Chi; Jiang, Meng; Ji, Heng; Han, Jiawei (December 2024, IEEE Transactions on Knowledge and Data Engineering)

Full Text Available
SMART: Self-Aware Agent for Tool Overuse Mitigation

https://doi.org/10.18653/v1/2025.findings-acl.239

Qian, Cheng; Acikgoz, Emre Can; Wang, Hongru; Chen, Xiusi; Sil, Avirup; Hakkani-Tür, Dilek; Tur, Gokhan; Ji, Heng (January 2025, Association for Computational Linguistics)

Full Text Available
GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices

https://doi.org/10.1145/3627673.3680103

Nguyen, Thao; Torres-Flores, Tiara; Hwang, Changhyun; Edwards, Carl; Diao, Ying; Ji, Heng (October 2024, ACM)

Full Text Available
MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents

https://doi.org/10.18653/v1/2025.acl-long.421

Zhu, Kunlun; Du, Hongyi; Hong, Zhaochen; Yang, Xiaocheng; Guo, Shuyi; Wang, Zhe; Wang, Zhenhailong; Qian, Cheng; Tang, Robert; Ji, Heng; et al (January 2025, Association for Computational Linguistics)

Full Text Available
The Law of Knowledge Overshadowing: Towards Understanding, Predicting and Preventing LLM Hallucination

https://doi.org/10.18653/v1/2025.findings-acl.1199

Zhang, Yuji; Li, Sha; Qian, Cheng; Liu, Jiateng; Yu, Pengfei; Han, Chi; Fung, Yi R; McKeown, Kathleen; Zhai, ChengXiang; Li, Manling; et al (January 2025, Association for Computational Linguistics)

Full Text Available
Fragment and Geometry Aware Tokenization of Molecules for Structure-Based Drug Design Using Language Models

Fu, Cong; Li, Xiner; Olson, Blake; Ji, Heng; Ji, Shuiwang (August 2024, ICLR/OpenReview)

Full Text Available

« Prev Next »

Search for: All records