Search for: All records

Award ID contains: 2203412

« Prev Next »

Total Resources

10

Resource Type
Conference Paper

5

Conference Proceeding

0

Dataset

0

Journal Article

5

Workshop Report

0

Availability
Full Text / Resource Available

8

Citation Only

2

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap

Hang Wang and Sen Lin and Junshan Zhang ( July 2023 , 2023 International Conference on Machine Learning)
Barbara Engelhardt, Emma Brunskill (Ed.)
Free, publicly-accessible full text available July 23, 2024
HiFlash: Communication-Efficient Hierarchical Federated Learning With Adaptive Staleness Control and Heterogeneity-Aware Client-Edge Association

https://doi.org/10.1109/TPDS.2023.3238049

Wu, Qiong ; Chen, Xu ; Ouyang, Tao ; Zhou, Zhi ; Zhang, Xiaoxi ; Yang, Shusen ; Zhang, Junshan ( May 2023 , IEEE Transactions on Parallel and Distributed Systems)

Free, publicly-accessible full text available May 1, 2024
Guest Editorial Communication-Efficient Distributed Learning Over Networks

https://doi.org/10.1109/JSAC.2023.3241848

Cao, Xuanyu ; Başar, Tamer ; Diggavi, Suhas ; Eldar, Yonina C. ; Letaief, Khaled B. ; Poor, H. Vincent ; Zhang, Junshan ( April 2023 , IEEE Journal on Selected Areas in Communications)

Full Text Available
Communication-Efficient Distributed Learning: An Overview

https://doi.org/10.1109/JSAC.2023.3242710

Cao, Xuanyu ; Başar, Tamer ; Diggavi, Suhas ; Eldar, Yonina C. ; Letaief, Khaled B. ; Poor, H. Vincent ; Zhang, Junshan ( April 2023 , IEEE Journal on Selected Areas in Communications)

Full Text Available
FedHome: Cloud-Edge Based Personalized Federated Learning for In-Home Health Monitoring

https://doi.org/10.1109/TMC.2020.3045266

Wu, Qiong ; Chen, Xu ; Zhou, Zhi ; Zhang, Junshan ( August 2022 , IEEE Transactions on Mobile Computing)

Full Text Available
TRGP: Trust Region Gradient Projection for Continual Learning

Lin, Sen ; Yang, Li ; Fan, Deliang ; Zhang, Junshan ( April 2022 , The Tenth International Conference on Learning Representations)

Catastrophic forgetting is one of the major challenges in continual learning. To address this issue, some existing methods put restrictive constraints on the optimization space of the new task for minimizing the interference to old tasks. However, this may lead to unsatisfactory performance for the new task, especially when the new task is strongly correlated with old tasks. To tackle this challenge, we propose Trust Region Gradient Projection (TRGP) for continual learning to facilitate the forward knowledge transfer based on an efficient characterization of task correlation. Particularly, we introduce a notion of 'trust region' to select the most related old tasks for the new task in a layer-wise and single-shot manner, using the norm of gradient projection onto the subspace spanned by task inputs. Then, a scaled weight projection is proposed to cleverly reuse the frozen weights of the selected old tasks in the trust region through a layer-wise scaling matrix. By jointly optimizing the scaling matrices and the model, where the model is updated along the directions orthogonal to the subspaces of old tasks, TRGP can effectively prompt knowledge transfer without forgetting. Extensive experiments show that our approach achieves significant improvement over related state-of-the-art methods.
more » « less
Full Text Available
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback

Wang, Hang ; Lin, Sen ; Zhang, Junshan ( December 2021 , AAdvances in Neural Information Processing Systems 34 (NeurIPS 2021))

The ensemble method is a promising way to mitigate the overestimation issue in Q-learning, where multiple function approximators are used to estimate the action values. It is known that the estimation bias hinges heavily on the ensemble size (i.e., the number of Q-function approximators used in the target), and that determining the 'right' ensemble size is highly nontrivial, because of the time-varying nature of the function approximation errors during the learning process. To tackle this challenge, we first derive an upper bound and a lower bound on the estimation bias, based on which the ensemble size is adapted to drive the bias to be nearly zero, thereby coping with the impact of the time-varying approximation errors accordingly. Motivated by the theoretic findings, we advocate that the ensemble method can be combined with Model Identification Adaptive Control (MIAC) for effective ensemble size adaptation. Specifically, we devise Adaptive Ensemble Q-learning (AdaEQ), a generalized ensemble method with two key steps: (a) approximation error characterization which serves as the feedback for flexibly controlling the ensemble size, and (b) ensemble size adaptation tailored towards minimizing the estimation bias. Extensive experiments are carried out to show that AdaEQ can improve the learning performance than the existing methods for the MuJoCo benchmark.
more » « less
Full Text Available
MetaGater: Fast Learning of Conditional Channel Gated Networks via Federated Meta-Learning

https://doi.org/10.1109/MASS52906.2021.00031

Lin, Sen ; Yang, Li ; He, Zhezhi ; Fan, Deliang ; Zhang, Junshan ( October 2021 , 2021 IEEE 18th International Conference on Mobile Ad Hoc and Smart Systems (MASS))

There has recently been an increasing interest in computationally-efficient learning methods for resource-constrained applications, e.g., pruning, quantization and channel gating. In this work, we advocate a holistic approach to jointly train the backbone network and the channel gating which can speed up subnet selection for a new task at the resource-limited node. In particular, we develop a federated meta-learning algorithm to jointly train good meta-initializations for both the backbone networks and gating modules, by leveraging the model similarity across learning tasks on different nodes. In this way, the learnt meta-gating module effectively captures the important filters of a good meta-backbone network, and a task-specific conditional channel gated network can be quickly adapted from the meta-initializations using data samples of the new task. The convergence of the proposed federated meta-learning algorithm is established under mild conditions. Experimental results corroborate the effectiveness of our method in comparison to related work.
more » « less
Full Text Available
Distributed Q-Learning with State Tracking for Multi-agent Networked Control

Wang, Hang ; Lin, Sen ; Jafarkhani, Hamid ; Zhang, Junshan ( May 2021 , AAMAS '21: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent SystemsMay 2021)

This paper studies distributed Q-learning for Linear Quadratic Regulator (LQR) in a multi-agent network. The existing results often assume that agents can observe the global system state, which may be infeasible in large-scale systems due to privacy concerns or communication constraints. In this work, we consider a setting with unknown system models and no centralized coordinator. We devise a state tracking (ST) based Q-learning algorithm to design optimal controllers for agents. Specifically, we assume that agents maintain local estimates of the global state based on their local information and communications with neighbors. At each step, every agent updates its local global state estimation, based on which it solves an approximate Q-factor locally through policy iteration. Assuming a decaying injected excitation noise during the policy evaluation, we prove that the local estimation converges to the true global state, and establish the convergence of the proposed distributed ST-based Q-learning algorithm. The experimental studies corroborate our theoretical results by showing that our proposed method achieves comparable performance with the centralized case.
more » « less
Full Text Available
Impact of Social Learning on Privacy-Preserving Data Collection

https://doi.org/10.1109/JSAIT.2021.3053545

Akbay, Abdullah Basar ; Wang, Weina ; Zhang, Junshan ( March 2021 , IEEE Journal on Selected Areas in Information Theory)

Full Text Available