NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Context Selection and Rewriting for Video-based Educational Question Generation

Yu, Mengxia; Nguyen, Bang; Zino, Olivia; Jiang, Meng (April 2025, arxiv: 2504.19406v2)

Free, publicly-accessible full text available April 25, 2026
MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems

Zhu, Zifeng; Jia, Mengzhao; Zhang, Zhihan; Li, Lang; Jiang, Meng (April 2025, Association for Computational Linguistics)

Multimodal Large Language Models (MLLMs) have demonstrated impressive abilities across various tasks, including visual question answering and chart comprehension, yet existing benchmarks for chart-related tasks fall short in capturing the complexity of real-world multi-chart scenarios. Current benchmarks primarily focus on single-chart tasks, neglecting the multi-hop reasoning required to extract and integrate information from multiple charts, which is essential in practical applications. To fill this gap, we introduce MultiChartQA, a benchmark that evaluates MLLMs’ capabilities in four key areas: direct question answering, parallel question answering, comparative reasoning, and sequential reasoning. Our evaluation of a wide range of MLLMs reveals significant performance gaps compared to humans. These results highlight the challenges in multi-chart comprehension and the potential of MultiChartQA to drive advancements in this field. Our code and data are available at https://github.com/Zivenzhu/Multi-chart-QA.
more » « less
Free, publicly-accessible full text available April 27, 2026
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench

Liu, Zheyuan; Dou, Guangyao; Jia, Mengzhao; Tan, Zhaoxuan; Zeng, Qingkai; Yuan, Yongle; Jiang, Meng (April 2025, Association for Computational Linguistics)

Generative models such as Large Language Models (LLM) and Multimodal Large Language models (MLLMs) trained on massive web corpora can memorize and disclose individuals’ confidential and private data, raising legal and ethical concerns. While many previous works have addressed this issue in LLM via machine unlearning, it remains largely unexplored for MLLMs. To tackle this challenge, we introduce Multimodal Large Language Model Unlearning Benchmark (MLLMU-Bench), a novel benchmark aimed at advancing the understanding of multimodal machine unlearning. MLLMU-Bench consists of 500 fictitious profiles and 153 profiles for public celebrities, each profile feature over 14 customized question-answer pairs, evaluated from both multimodal (image+text) and unimodal (text) perspectives. The benchmark is divided into four sets to assess unlearning algorithms in terms of efficacy, generalizability, and model utility. Finally, we provide baseline results using existing generative model unlearning algorithms. Surprisingly, our experiments show that unimodal unlearning algorithms excel in generation tasks, while multimodal unlearning approaches perform better in classification with multimodal inputs.
more » « less
Free, publicly-accessible full text available April 27, 2026
QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation

https://doi.org/10.18653/v1/2025.acl-long.1268

Nguyen, Bang; Du, Tingting; Yu, Mengxia; Angrave, Lawrence; Jiang, Meng (January 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available January 1, 2026
MultiChartQA: Benchmarking Vision-Language Models on Multi-Chart Problems

https://doi.org/10.18653/v1/2025.naacl-long.566

Zhu, Zifeng; Jia, Mengzhao; Zhang, Zhihan; Li, Lang; Jiang, Meng (January 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available January 1, 2026
Learning Attribute as Explicit Relation for Sequential Recommendation

Liu, Gang; Yang, Fan; Jiao, Yang; Garakani, Alireza Bagheri; Tong, Tian; Gao, Yan; Jiang, Meng (February 2025, Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Free, publicly-accessible full text available February 1, 2026
IHEval: Evaluating Language Models on Following the Instruction Hierarchy

Zhang, Zhihan; Li, Shiyang; Zhang, Zixuan; Liu, Xin; Jiang, Haoming; Tang, Xianfeng; Gao, Yifan; Li, Zheng; Wang, Haodong; Tan, Zhaoxuan; et al (April 2025, Association for Computational Linguistics)
Chiruzzo, Luis; Ritter, Alan; Wang, Lu (Ed.)
The instruction hierarchy, which establishes a priority order from system messages to user messages, conversation history, and tool outputs, is essential for ensuring consistent and safe behavior in language models (LMs). Despite its importance, this topic receives limited attention, and there is a lack of comprehensive benchmarks for evaluating models’ ability to follow the instruction hierarchy. We bridge this gap by introducing IHEval, a novel benchmark comprising 3,538 examples across nine tasks, covering cases where instructions in different priorities either align or conflict. Our evaluation of popular LMs highlights their struggle to recognize instruction priorities. All evaluated models experience a sharp performance decline when facing conflicting instructions, compared to their original instruction-following performance. Moreover, the most competitive open-source model only achieves 48% accuracy in resolving such conflicts. Our results underscore the need for targeted optimization in the future development of LMs.
more » « less
Free, publicly-accessible full text available April 27, 2026
Large Language Models on Graphs: A Comprehensive Survey

https://doi.org/10.1109/TKDE.2024.3469578

Jin, Bowen; Liu, Gang; Han, Chi; Jiang, Meng; Ji, Heng; Han, Jiawei (December 2024, IEEE Transactions on Knowledge and Data Engineering)

Free, publicly-accessible full text available December 1, 2025
Transcend the boundaries: Machine learning for designing polymeric membrane materials for gas separation

https://doi.org/10.1063/5.0205433

Xu, Jiaxin; Suleiman, Agboola; Liu, Gang; Zhang, Renzheng; Jiang, Meng; Guo, Ruilan; Luo, Tengfei (December 2024, Chemical Physics Reviews)

Polymeric membranes have become essential for energy-efficient gas separations such as natural gas sweetening, hydrogen separation, and carbon dioxide capture. Polymeric membranes face challenges like permeability-selectivity tradeoffs, plasticization, and physical aging, limiting their broader applicability. Machine learning (ML) techniques are increasingly used to address these challenges. This review covers current ML applications in polymeric gas separation membrane design, focusing on three key components: polymer data, representation methods, and ML algorithms. Exploring diverse polymer datasets related to gas separation, encompassing experimental, computational, and synthetic data, forms the foundation of ML applications. Various polymer representation methods are discussed, ranging from traditional descriptors and fingerprints to deep learning-based embeddings. Furthermore, we examine diverse ML algorithms applied to gas separation polymers. It provides insights into fundamental concepts such as supervised and unsupervised learning, emphasizing their applications in the context of polymer membranes. The review also extends to advanced ML techniques, including data-centric and model-centric methods, aimed at addressing challenges unique to polymer membranes, focusing on accurate screening and inverse design.
more » « less
Free, publicly-accessible full text available December 1, 2025
Chain-of-Layer: Iteratively Prompting Large Language Models for Taxonomy Induction from Limited Examples

https://doi.org/10.1145/3627673.3679608

Zeng, Qingkai; Bai, Yuyang; Tan, Zhaoxuan; Feng, Shangbin; Liang, Zhenwen; Zhang, Zhihan; Jiang, Meng (October 2024, ACM)

Full Text Available

« Prev Next »

Search for: All records