NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning

Chen, Zihan; Wang, Song; Tan, Zhen; Li, Jundong; Shen, Cong (July 2025, International Conference on Machine Learning (ICML))

Free, publicly-accessible full text available July 31, 2026
Genesis: A Compiler for Hamiltonian Simulation on Hybrid CV-DV Quantum Computers

https://doi.org/10.1145/3695053.3731065

Chen, Zihan; Li, Jiakang; Guo, Minghao; Chen, Henry; Li, Zirui; Bierman, Joel; Huang, Yipeng; Zhou, Huiyang; Liu, Yuan; Zhang, Eddy Z (June 2025, ACM)

Free, publicly-accessible full text available June 20, 2026
Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning

https://doi.org/10.1609/aaai.v39i16.33830

Fu, Xingbo; Chen, Zihan; He, Yinhan; Wang, Song; Zhang, Binchi; Chen, Chen; Li, Jundong (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)

Federated Graph Learning (FGL) enables multiple clients to jointly train powerful graph learning models, e.g., Graph Neural Networks (GNNs), without sharing their local graph data for graph-related downstream tasks, such as graph property prediction. In the real world, however, the graph data can suffer from significant distribution shifts across clients as the clients may collect their graph data for different purposes. In particular, graph properties are usually associated with invariant label-relevant substructures (i.e., subgraphs) across clients, while label-irrelevant substructures can appear in a client-specific manner. The issue of distribution shifts of graph data hinders the efficiency of GNN training and leads to serious performance degradation in FGL. To tackle the aforementioned issue, we propose a novel FGL framework entitled FedVN that eliminates distribution shifts through client-specific graph augmentation strategies with multiple learnable Virtual Nodes (VNs). Specifically, FedVN lets the clients jointly learn a set of shared VNs while training a global GNN model. To eliminate distribution shifts, each client trains a personalized edge generator that determines how the VNs connect local graphs in a client-specific manner. Furthermore, we provide theoretical analyses indicating that FedVN can eliminate distribution shifts of graph data across clients. Comprehensive experiments on four datasets under five settings demonstrate the superiority of our proposed FedVN over nine baselines.
more » « less
Free, publicly-accessible full text available April 11, 2026
Mixture of Demonstrations for In-Context Learning

Wang, Song; Chen, Zihan; Shi, Chengshuai; Shen, Cong; Li, Jundong (December 2024, Annual Conference on Neural Information Processing Systems)

In-Context Learning (ICL) empowers Large Language Models (LLMs) to tackle various tasks by providing input-output examples as additional inputs, referred to as demonstrations. Nevertheless, the performance of ICL could be easily impacted by the quality of selected demonstrations. Existing efforts generally learn a retriever model to score each demonstration for selecting suitable demonstrations, however, the effect is suboptimal due to the large search space and the noise from unhelpful demonstrations. In this study, we introduce MoD, which partitions the demonstration pool into groups, each governed by an expert to reduce search space. We further design an expert-wise training strategy to alleviate the impact of unhelpful demonstrations when optimizing the retriever model. During inference, experts collaboratively retrieve demonstrations for the input query to enhance the ICL performance. We validate MoD via experiments across a range of NLP datasets and tasks, demonstrating its state-of-the-art performance and shedding new light on the future design of retrieval methods for ICL.
more » « less
Free, publicly-accessible full text available December 10, 2025
FastGAS: Fast Graph-based Annotation Selection for In-Context Learning

Chen, Zihan; Wang, Song; Shen, Cong; Li, Jundong (August 2024, Findings of the Association for Computational Linguistics ACL 2024)

In-context learning (ICL) empowers large language models (LLMs) to tackle new tasks by using a series of training instances as prompts. Since generating the prompts needs to sample from a vast pool of instances and annotate them (e.g., add labels in classification task), existing methods have proposed to select a subset of unlabeled examples for annotation, thus enhancing the quality of prompts and concurrently mitigating annotation costs. However, these methods often require a long time to select instances due to their complexity, hindering their practical viability. To address this limitation, we propose a graph-based selection method, FastGAS, designed to efficiently identify high-quality instances while minimizing computational overhead. Initially, we construct a data similarity graph based on instance similarities. Subsequently, employing a graph partitioning algorithm, we partition the graph into pieces. Within each piece (i.e., subgraph), we adopt a greedy approach to pick the most representative nodes. By aggregating nodes from diverse pieces and annotating the corresponding instances, we identify a set of diverse and representative instances for ICL. Compared to prior approaches, our method not only exhibits superior performance on different tasks but also significantly reduces selection time. In addition, we demonstrate the efficacy of our approach in LLMs of larger sizes.
more » « less
Full Text Available
Federated Graph Learning with Structure Proxy Alignment

https://doi.org/10.1145/3637528.3671717

Fu, Xingbo; Chen, Zihan; Zhang, Binchi; Chen, Chen; Li, Jundong (August 2024, Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Federated Graph Learning (FGL) aims to learn graph learning models over graph data distributed in multiple data owners, which has been applied in various applications such as social recommendation and financial fraud detection. Inherited from generic Federated Learning (FL), FGL similarly has the data heterogeneity issue where the label distribution may vary significantly for distributed graph data across clients. For instance, a client can have the majority of nodes from a class, while another client may have only a few nodes from the same class. This issue results in divergent local objectives and impairs FGL convergence for node-level tasks, especially for node classification. Moreover, FGL also encounters a unique challenge for the node classification task: the nodes from a minority class in a client are more likely to have biased neighboring information, which prevents FGL from learning expressive node embeddings with Graph Neural Networks (GNNs). To grapple with the challenge, we propose FedSpray, a novel FGL framework that learns local class-wise structure proxies in the latent space and aligns them to obtain global structure proxies in the server. Our goal is to obtain the aligned structure proxies that can serve as reliable, unbiased neighboring information for node classification. To achieve this, FedSpray trains a global feature-structure encoder and generates unbiased soft targets with structure proxies to regularize local training of GNN models in a personalized way. We conduct extensive experiments over four datasets, and experiment results validate the superiority of FedSpray compared with other baselines. Our code is available at https://github.com/xbfu/FedSpray.
more » « less
Full Text Available
Verification of Machine Unlearning is Fragile

Zhang, Binchi; Chen, Zihan; Shen, Cong; Li, Jundong (July 2024, 2024 International Conference on Machine Learning)

As privacy concerns escalate in the realm of machine learning, data owners now have the option to utilize machine unlearning to remove their data from machine learning models, following recent legislation. To enhance transparency in machine unlearning and avoid potential dishonesty by model providers, various verification strategies have been proposed. These strategies enable data owners to ascertain whether their target data has been effectively unlearned from the model. However, our understanding of the safety issues of machine unlearning verification remains nascent. In this paper, we explore the novel research question of whether model providers can circumvent verification strategies while retaining the information of data supposedly unlearned. Our investigation leads to a pessimistic answer: \textit{the verification of machine unlearning is fragile}. Specifically, we categorize the current verification strategies regarding potential dishonesty among model providers into two types. Subsequently, we introduce two novel adversarial unlearning processes capable of circumventing both types. We validate the efficacy of our methods through theoretical analysis and empirical experiments using real-world datasets. This study highlights the vulnerabilities and limitations in machine unlearning verification, paving the way for further research into the safety of machine unlearning.
more » « less
Full Text Available
Personalized Federated Learning with Attention-Based Client Selection

https://doi.org/10.1109/ICASSP48485.2024.10447362

Chen, Zihan; Li, Jundong; Shen, Cong (April 2024, IEEE)

Personalized Federated Learning (PFL) relies on collective data knowledge to build customized models. However, non-IID data between clients poses significant challenges, as collaborating with clients who have diverse data distributions can harm local model performance, especially with limited training data. To address this issue, we propose FedACS, a new PFL algorithm with an Attention-based Client Selection mechanism. FedACS integrates an attention mechanism to enhance collaboration among clients with similar data distributions and mitigate the data scarcity issue. It prioritizes and allocates resources based on data similarity. We further establish the theoretical convergence behavior of FedACS. Experiments on CIFAR10 and FMNIST validate FedACS’s superiority, showcasing its potential to advance personalized federated learning. By tackling non-IID data challenges and data scarcity, FedACS offers promising advances in personalized federated learning.
more » « less
Full Text Available
Arctic Maritime Cyclone Distribution and Trends in the ERA5 Reanalysis

https://doi.org/10.1175/JAMC-D-21-0016.1

Chen, Zihan; Lynch, Amanda H. (April 2022, Journal of Applied Meteorology and Climatology)

Abstract We present a tracking algorithm for synoptic to meso- α -scale Arctic cyclones that differentiates between cold- and warm-core systems. The algorithm is applied to the ERA5 reanalysis north of 60°N from 1950 to 2019. In this dataset, over one-half of the cyclones that meet minimum intensity and duration thresholds can be classified as cold-core systems. Systems that undergo transition, typically from cold to warm core, make up 27.2% of cyclones and are longer lived. The relatively infrequent warm-core cyclones are more intense and are most common in winter. The Arctic-wide occurrence of maritime cyclones has increased from 1979 to 2019 when compared with the period from 1950 to 1978, but the trends have high interannual variability. This shift has ramifications for transportation, fisheries, and extractive industries, as well as impacts on communities across the Arctic.
more » « less
Full Text Available
FPGA-Based Velocity Estimation for Control of Robots with Low-Resolution Encoders

Wu, Jie Ying; Chen, Zihan; Deguet, Anton; Kazanzides, Peter (October 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Robot control algorithms often rely on measurements of robot joint velocities, which can be estimated by measuring the time between encoder edges. When encoder edges occur infrequently, such as at low velocities and/or with low resolution encoders, this measurement delay may affect the stability of closed-loop control. This is evident in both the joint position control and Cartesian impedance control of the da Vinci Research Kit (dVRK), which contains several low-resolution encoders. We present a hardware-based method that gives more frequent velocity updates and is not affected by common encoder imperfections such as non-uniform duty cycles and quadrature phase error. The proposed method measures the time between consecutive edges of the same type but, unlike prior methods, is implemented for the rising and falling edges of both channels. Additionally, it estimates acceleration to enable software compensation of the measurement delay. The method is shown to improve Cartesian impedance control of the dVRK.
more » « less
Full Text Available

« Prev Next »

Search for: All records