Search for: All records

Creators/Authors contains: "Yang, Jing"

« Prev Next »

Total Resources

98

Resource Type
Conference Paper

21

Conference Proceeding

2

Dataset

0

Journal Article

75

Workshop Report

0

Availability
Full Text / Resource Available

87

Citation Only

11

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Provably efficient UCB-type algorithms for learning predictive state representation

Huang, Ruiquan ; Liang, Yingbin ; Yang, Jing. ( May 2024 , Proc. International Conference on Learning Representations (ICLR))
Provably efficient UCB-type algorithms for learning predictive state representation

Huang, Ruiquan ; Liang, Yingbin ; Yang, Jing. ( May 2024 , International Conference on Learning Representations (ICLR),)

Free, publicly-accessible full text available May 7, 2025
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets

Yang, Kun ; Shen, Cong ; Yang, Jing ; Yeh, Shu-ping ; Sydir, Jerry ( April 2024 , Conference record Asilomar Conference on Signals Systems Computers)

Free, publicly-accessible full text available April 1, 2025
Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets

Yang, Kun ; Shen, Cong ; Yang, Jing ; Yeh, Shu-ping ; Sydir, Jerry ( March 2024 , Conference record Asilomar Conference on Signals Systems Computers)

Free, publicly-accessible full text available March 1, 2025
Differential superoxide production in phosphorylated neuronal nitric oxide synthase mu and alpha variants

https://doi.org/10.1016/j.jinorgbio.2023.112454

Gyawali, Yadav Prasad ; Jiang, Ting ; Yang, Jing ; Zheng, Huayu ; Liu, Rui ; Zhang, Haikun ; Feng, Changjian ( February 2024 , Journal of Inorganic Biochemistry)

Free, publicly-accessible full text available February 1, 2025
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs

Cheng, Yuan ; Yang, Jing ; Liang, Yingbin ( December 2023 , Proc. Conference on Neural Information Processing Systems (NeurIPS 2023))

Free, publicly-accessible full text available December 10, 2024
Random Orthogonalization for Federated Learning in Massive MIMO Systems

https://doi.org/10.1109/TWC.2023.3302335

Wei, Xizixiang ; Shen, Cong ; Yang, Jing ; Vincent Poor, H. ( August 2023 , IEEE Transactions on Wireless Communications)

Free, publicly-accessible full text available August 30, 2024
Reward Teaching for Federated Multi-armed Bandits

https://doi.org/10.1109/ISIT54713.2023.10206444

Shi, Chengshuai ; Xiong, Wei ; Shen, Cong ; Yang, Jing ( June 2023 , 2023 IEEE International Symposium on Information Theory (ISIT))

Free, publicly-accessible full text available June 25, 2024
Non-stationary Reinforcement Learning under General Function Approximation

Feng, Songtao and ; Yin, Ming ; Huang, Ruiquan ; Wang, Yu-Xiang ; Yang, Jing ; Liang, Yingbin ( July 2023 , Proceedings of Machine Learning Research)
Krause, Andreas and (Ed.)
General function approximation is a powerful tool to handle large state and action spaces in a broad range of reinforcement learning (RL) scenarios. However, theoretical understanding of non-stationary MDPs with general function approximation is still limited. In this paper, we make the first such an attempt. We first propose a new complexity metric called dynamic Bellman Eluder (DBE) dimension for non-stationary MDPs, which subsumes majority of existing tractable RL problems in static MDPs as well as non-stationary MDPs. Based on the proposed complexity metric, we propose a novel confidence-set based model-free algorithm called SW-OPEA, which features a sliding window mechanism and a new confidence set design for non-stationary MDPs. We then establish an upper bound on the dynamic regret for the proposed algorithm, and show that SW-OPEA is provably efficient as long as the variation budget is not significantly large. We further demonstrate via examples of non-stationary linear and tabular MDPs that our algorithm performs better in small variation budget scenario than the existing UCB-type algorithms. To the best of our knowledge, this is the first dynamic regret analysis in non-stationary MDPs with general function approximation.
more » « less
Probing the effects of hydrogen on the materials used for large-scale transport of hydrogen through multi-scale simulations

https://doi.org/10.1016/j.rser.2023.113353

Cheng, Guang ; Wang, Xiaoli ; Chen, Kaiyuan ; Zhang, Yang ; Venkatesh, T.A. ; Wang, Xiaolin ; Li, Zunzhao ; Yang, Jing ( August 2023 , Renewable and Sustainable Energy Reviews)

Free, publicly-accessible full text available August 1, 2024

« Prev Next »