NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression

Slivkins, Aleksandrs; Zhou, Xingyu; Sankararaman, Karthik; Foster, Dylan (December 2024, Journal of machine learning research)

Full Text Available
Locally Private and Robust Multi-Armed Bandits

Zhou, Xingyu; Zhang, Wei (November 2024, 38th Conference on Neural Information Processing Systems (NeurIPS 2024))

Full Text Available
Locally Private and Robust Multi-Armed Bandits

Zhou, Xingyu; Zhang, Wei (November 2024, 38th Conference on Neural Information Processing Systems (NeurIPS 2024))

Full Text Available
On Differentially Private Federated Linear Contextual Bandits

Zhou, Xingyu; Chowdhury, Sayak R (May 2024, The Twelfth International Conference on Learning Representations (ICLR 2024))

Full Text Available
(Private) Kernelized Bandits with Distributed Biased Feedback

https://doi.org/10.1145/3579318

Li, Fengjiao; Zhou, Xingyu; Ji, Bo (March 2023, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

In this paper, we study kernelized bandits with distributed biased feedback. This problem is motivated by several real-world applications (such as dynamic pricing, cellular network configuration, and policy making), where users from a large population contribute to the reward of the action chosen by a central entity, but it is difficult to collect feedback from all users. Instead, only biased feedback (due to user heterogeneity) from a subset of users may be available. In addition to such partial biased feedback, we are also faced with two practical challenges due to communication cost and computation complexity. To tackle these challenges, we carefully design a new distributed phase-then-batch-based elimination (DPBE) algorithm, which samples users in phases for collecting feedback to reduce the bias and employs maximum variance reduction to select actions in batches within each phase. By properly choosing the phase length, the batch size, and the confidence width used for eliminating suboptimal actions, we show that DPBE achieves a sublinear regret of ~O(T1-α/2 +√γT T), where α ∈ (0,1) is the user-sampling parameter one can tune. Moreover, DPBE can significantly reduce both communication cost and computation complexity in distributed kernelized bandits, compared to some variants of the state-of-the-art algorithms (originally developed for standard kernelized bandits). Furthermore, by incorporating various differential privacy models (including the central, local, and shuffle models), we generalize DPBE to provide privacy guarantees for users participating in the distributed learning process. Finally, we conduct extensive simulations to validate our theoretical results and evaluate the empirical performance.
more » « less
Interference Constrained Beam Alignment for Time-Varying Channels via Kernelized Bandits

https://doi.org/10.23919/WiOpt56218.2022.9930591

Deng, Yuntian; Zhou, Xingyu; Ghosh, Arnob; Gupta, Abhishek; Shroff, Ness B. (September 2022, 2022 20th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt))

Full Text Available
Differentially Private Linear Bandits with Partial Distributed Feedback

https://doi.org/10.23919/WiOpt56218.2022.9930524

Li, Fengjiao; Zhou, Xingyu; Ji, Bo (September 2022, 2022 20th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt))

Full Text Available
On Kernelized Multi-Armed Bandits with Constraints

Zhou, Xingyu; Ji Bo (January 2022, NeurIPS 2022)

Full Text Available

Search for: All records