NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Debiasing Federated Learning with Correlated Client Participation

Sun, Zhenyu; Zhang, Ziyang; Xu, Zheng; Joshi, Gauri; Sharma, Pranay; Wei, Ermin (April 2025, Proceedings of the International Conference on Learning Representations)

Free, publicly-accessible full text available April 28, 2026
Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models

https://doi.org/10.18653/v1/2024.emnlp-main.717

Cho, Yae Jee; Liu, Luyang; Xu, Zheng; Fahrezi, Aldi; Joshi, Gauri (November 2024, Association for Computational Linguistics)

Free, publicly-accessible full text available November 30, 2025
Towards Robust Fidelity for Evaluating Explainability of Graph Neural Networks

Xu, Zheng; Shirani, Farhad; Wang, Tianchun; Cheng, Wei; Chen, Zhuomin; Chen, Haifeng; Wei, Hua; Luo, Dongsheng (March 2024, ICLR)

Graph Neural Networks (GNNs) are neural models that leverage the dependency structure in graphical data via message passing among the graph nodes. GNNs have emerged as pivotal architectures in analyzing graph-structured data, and their expansive application in sensitive domains requires a comprehensive understanding of their decision-making processes — necessitating a framework for GNN explainability. An explanation function for GNNs takes a pre-trained GNN along with a graph as input, to produce a ‘sufficient statistic’ subgraph with respect to the graph label. A main challenge in studying GNN explainability is to provide fidelity measures that evaluate the performance of these explanation functions. This paper studies this foundational challenge, spotlighting the inherent limitations of prevailing fidelity metrics, including Fid+, Fid−, and Fid∆. Specifically, a formal, information-theoretic definition of explainability is introduced and it is shown that existing metrics often fail to align with this definition across various statistical scenarios. The reason is due to potential distribution shifts when subgraphs are removed in computing these fidelity measures. Subsequently, a robust class of fidelity measures are introduced, and it is shown analytically that they are resilient to distribution shift issues and are applicable in a wide range of scenarios. Extensive empirical analysis on both synthetic and real datasets are provided to illustrate that the proposed metrics are more coherent with gold standard metrics. The source code is available at https://trustai4s-lab.github.io/fidelity.
more » « less
Full Text Available
On the Convergence of Federated Averaging with Cyclic Client Participation

Cho, Yae Jee; Sharma, Pranay Sharma; Joshi, Gauri; Xu, Zheng; Kale, Satyen; Zhang, Tong (July 2023, International Conference on Machine Learning (ICML))

Federated Averaging (FedAvg) and its variants are the most popular optimization algorithms in federated learning (FL). Previous convergence analyses of FedAvg either assume full client participation or partial client participation where the clients can be uniformly sampled. However, in practical cross-device FL systems, only a subset of clients that satisfy local criteria such as battery status, network connectivity, and maximum participation frequency requirements (to ensure privacy) are available for training at a given time. As a result, client availability follows a natural cyclic pattern. We provide (to our knowledge) the first theoretical framework to analyze the convergence of FedAvg with cyclic client participation with several different client optimizers such as GD, SGD, and shuffled SGD. Our analysis discovers that cyclic client participation can achieve a faster asymptotic convergence rate than vanilla FedAvg with uniform client participation under suitable conditions, providing valuable insights into the design of client sampling protocols.
more » « less
Full Text Available
Spatially Composition-graded Monolayer WSe2xTe2−2x Nanosheets

Kai Xu, Zheng Hao (December 2021, 52th IEEE Semiconductor Interface Specialists Conference)

Alloying in two-dimensional (2D) transition metal dichalcogenides (TMD) has allowed bandgap engineering and phase transformation, which provide more flexibility and functionality for electronic and photonic devices. To date, many ternary TMD alloys with homogenous compositions have been synthesized. However, realization of bandgap modulation spatially within a single TMD nanosheet remains largely unexplored. In this work, we demonstrate the synthesis of spatially composition-graded WSe2xTe2-2x flakes using an in situ chemical vapor deposition method. The photoluminescence and Raman spectra line-scanning characterization indicate a spatially graded bandgap, which increases from 1.46 eV (center) to 1.61 eV (edge) within one monolayer flake. Furthermore, the electronic devices based on this spatially graded material exhibit tunable transfer characteristics.
more » « less
Full Text Available
The application of machine-learning and Raman spectroscopy for the rapid detection of edible oils type and adulteration

https://doi.org/10.1016/j.foodchem.2021.131471

Zhao, Hefei; Zhan, Yinglun; Xu, Zheng; John Nduwamungu, Joshua; Zhou, Yuzhen; Powers, Robert; Xu, Changmou (March 2022, Food Chemistry)

Full Text Available
Privacy-preserving Travel Time Prediction with Uncertainty Using GPS Trace Data

https://doi.org/10.1109/TMC.2021.3074865

Liu, Fang; Wang, Dong; Xu, Zheng-Quan (April 2021, IEEE Transactions on Mobile Computing)

The rapid growth of GPS technology and mobile devices has led to a massive accumulation of location data, bringing considerable benefits to individuals and society. One of the major usages of such data is travel time prediction, a typical service provided by GPS navigation devices and apps. Meanwhile, the constant collection and analysis of the individual location data also pose unprecedented privacy threats. We leverage the notion of geo-indistinguishability, an extension of differential privacy to the location privacy setting, and propose a procedure for privacy-preserving travel time prediction without collecting actual individual GPS trace data. We propose new concepts to examine the impact of the geo-indistinguishability sanitization on the usefulness of GPS traces and provide analytical and experimental utility analysis for privacy-preserving travel time prediction. We also propose new metrics to measure the adversary error in learning individual GPS traces from the collected sanitized data. Our experiment results suggest that the proposed procedure provides travel time analysis with satisfactory accuracy at reasonably small privacy costs.
more » « less
Full Text Available
O-pH: Optical pH Monitor to Measure Oral Biofilm Acidity and Assist in Enamel Health Monitoring

https://doi.org/10.1109/TBME.2022.3153659

Sharma, Manuja; Lee, Lauren K.; Carson, Matthew D.; Park, David S.; An, Se W.; Bovenkamp, Micah G.; Cayetano, Jess J.; Berude, Ian A; Xu, Zheng; Sadr, Alireza; et al (February 2022, IEEE Transactions on Biomedical Engineering)

Full Text Available
The Impact of Neural Network Overparameterization on Gradient Confusion and Stochastic Gradient Descent

Sankararaman, Karthik A; De, Soham; Xu, Zheng; Huang, Ronny; Goldstein, Tom (March 2020, International Conference on Machine Learning)

This paper studies how neural network architecture affects the speed of training. We introduce a simple concept called gradient confusion to help formally analyze this. When gradient confusion is high, stochastic gradients produced by different data samples may be negatively correlated, slowing down convergence. But when gradient confusion is low, data samples interact harmoniously, and training proceeds quickly. Through theoretical and experimental results, we demonstrate how the neural network architecture affects gradient confusion, and thus the efficiency of training. Our results show that, for popular initialization techniques, increasing the width of neural networks leads to lower gradient confusion, and thus faster model training. On the other hand, increasing the depth of neural networks has the opposite effect. Our results indicate that alternate initialization techniques or networks using both batch normalization and skip connections help reduce the training burden of very deep networks.
more » « less
Full Text Available
Universal Adversarial Training

https://doi.org/10.1609/aaai.v34i04.6017

Shafahi, Ali; Najibi, Mahyar; Xu, Zheng; Dickerson, John P; Davis, Larry S; Goldstein, Tom (April 2020, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records