NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation

https://doi.org/10.18653/v1/2025.emnlp-main.587

Wang, Song; Chen, Zihan; Wang, Peng; Wei, Zhepei; Tan, Zhen; Meng, Yu; Shen, Cong; Li, Jundong (November 2025, Association for Computational Linguistics)

Free, publicly-accessible full text available November 4, 2026
TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching

Meng, Yu; Fan, C (July 2025, Proceedings of the 42nd International Conference on Machine Learning)

Free, publicly-accessible full text available July 13, 2026
LLM Alignment as Retriever Optimization: An Information Retrieval Perspective

Jin, Bowen; Yoon, Jinsung; Qin, Zhen; Wang, Ziqi; Xiong, Wei; Meng, Yu; Han, Jiawei; Arik, Sercan (July 2025, International Machine Learning Society (IMLS))

Free, publicly-accessible full text available July 23, 2026
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction

https://doi.org/10.18653/v1/2024.emnlp-main.747

Zhou, Sizhe; Meng, Yu; Jin, Bowen; Han, Jiawei (January 2024, Association for Computational Linguistics)

Full Text Available
Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective

https://doi.org/10.1145/3580305.3599569

Meng, Yu; Huang, Jiaxin; Zhang, Yu; Zhang, Yunyi; Han, Jiawei (August 2023, ACM)

Full Text Available
Tuning language models as training data generators for augmentation-enhanced few-shot learning

Meng, Yu; Michalski, Martin; Huang, Jiaxin; Zhang, Yu; Abdelzaher, Tarek; Han, Jiawei (July 2023, ACM)

Full Text Available
Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks

Jin, Bowen; Zhang, Yu; Meng, Yu; Han, Jiawei (May 2023, The Eleventh International Conference on Learning Representations, {ICLR} 2023)

Edges in many real-world social/information networks are associated with rich text information (e.g., user-user communications or user-product reviews). However, mainstream network representation learning models focus on propagating and aggregating node attributes, lacking specific designs to utilize text semantics on edges. While there exist edge-aware graph neural networks, they directly initialize edge attributes as a feature vector, which cannot fully capture the contextualized text semantics of edges. In this paper, we propose Edgeformers, a framework built upon graph-enhanced Transformers, to perform edge and node representation learning by modeling texts on edges in a contextualized way. Specifically, in edge representation learning, we inject network information into each Transformer layer when encoding edge texts; in node representation learning, we aggregate edge representations through an attention mechanism within each node’s ego-graph. On five public datasets from three different domains, Edgeformers consistently outperform state-of-the-art baselines in edge classification and link prediction, demonstrating the efficacy in learning edge and node representations, respectively.
more » « less
Full Text Available
SCStory: Self-supervised and Continual Online Story Discovery

https://doi.org/10.1145/3543507.3583507

Yoon, Susik; Meng, Yu; Lee, Dongha; Han, Jiawei (April 2023, ACM)
Proc. 2023 The Web Conf. (Ed.)
We present a framework SCStory for online story discovery, that helps people digest rapidly published news article streams in realtime without human annotations. To organize news article streams into stories, existing approaches directly encode the articles and cluster them based on representation similarity. However, these methods yield noisy and inaccurate story discovery results because the generic article embeddings do not effectively reflect the storyindicative semantics in an article and cannot adapt to the rapidly evolving news article streams. SCStory employs self-supervised and continual learning with a novel idea of story-indicative adaptive modeling of news article streams. With a lightweight hierarchical embedding module that first learns sentence representations and then article representations, SCStory identifies story-relevant information of news articles and uses them to discover stories. The embedding module is continuously updated to adapt to evolving news streams with a contrastive learning objective, backed up by two unique techniques, confidence-aware memory replay and prioritized-augmentation, employed for label absence and data scarcity problems. Thorough experiments on real and the latest news data sets demonstrate that SCStory outperforms existing state-of-the-art algorithms for unsupervised online story discovery.
more » « less
Weakly Supervised Multi-Label Classification of Full-Text Scientific Papers

https://doi.org/10.1145/3580305.3599544

Zhang, Yu; Jin, Bowen; Chen, Xiusi; Shen, Yanzhen; Zhang, Yunyi; Meng, Yu; Han, Jiawei (August 2023, ACM)
Proc. 2023 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (Ed.)
Instead of relying on human-annotated training samples to build a classifier, weakly supervised scientific paper classification aims to classify papers only using category descriptions (e.g., category names, category-indicative keywords). Existing studies on weakly supervised paper classification are less concerned with two challenges: (1) Papers should be classified into not only coarse-grained research topics but also fine-grained themes, and potentially into multiple themes, given a large and fine-grained label space; and (2) full text should be utilized to complement the paper title and abstract for classification. Moreover, instead of viewing the entire paper as a long linear sequence, one should exploit the structural information such as citation links across papers and the hierarchy of sections and paragraphs in each paper. To tackle these challenges, in this study, we propose FuTex, a framework that uses the cross-paper network structure and the in-paper hierarchy structure to classify full-text scientific papers under weak supervision. A network-aware contrastive fine-tuning module and a hierarchyaware aggregation module are designed to leverage the two types of structural signals, respectively. Experiments on two benchmark datasets demonstrate that FuTex significantly outperforms competitive baselines and is on par with fully supervised classifiers that use 1,000 to 60,000 ground-truth training samples.
more » « less
Full Text Available
Patton: Language Model Pretraining on Text-Rich Networks

https://doi.org/10.18653/v1/2023.acl-long.387

Jin, Bowen; Zhang, Wentao; Zhang, Yu; Meng, Yu; Zhang, Xinyang; Zhu, Qi; Han, Jiawei (July 2023, Association for Computational Linguistics)

A real-world text corpus sometimes comprises not only text documents, but also semantic links between them (e.g., academic papers in a bibliographic network are linked by citations and co-authorships). Text documents and semantic connections form a text-rich network, which empowers a wide range of downstream tasks such as classification and retrieval. However, pretraining methods for such structures are still lacking, making it difficult to build one generic model that can be adapted to various tasks on text-rich networks. Current pretraining objectives, such as masked language modeling, purely model texts and do not take inter-document structure information into consideration. To this end, we propose our PretrAining on TexT-Rich NetwOrk framework PATTON. PATTON1 includes two pretraining strategies: network-contextualized masked language modeling and masked node prediction, to capture the inherent dependency between textual attributes and network structure. We conduct experiments on four downstream tasks in five datasets from both academic and e-commerce domains, where PATTON outperforms baselines significantly and consistently.
more » « less
Full Text Available

« Prev Next »

Search for: All records