NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Autonomous Generative Feature Replay for Non-Exemplar Class-Incremental Learning

https://doi.org/10.1109/ICASSP48485.2024.10448256

Zhang, Yinjie; Shao, Ming; Shi, Wenlong; Xia, Haifeng; Xia, Siyu (April 2024, ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP))

Full Text Available
Few-shot Shape Recognition by Learning Deep Shape-aware Features

https://doi.org/10.1109/WACV57701.2024.00186

Shi, Wenlong; Lu, Changsheng; Shao, Ming; Zhang, Yinjie; Xia, Siyu; Koniusz, Piotr (January 2024, 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV))

Full Text Available
Critic-over-Actor-Critic Modeling: Finding Optimal Strategy in ICU Environments

https://doi.org/10.1109/BigData55660.2022.10021125

Ryan, Riazat; Shao, Ming (December 2022, 2022 IEEE International Conference on Big Data (Big Data))

Reinforcement learning (RL) is mechanized to learn from experience. It solves the problem in sequential decisions by optimizing reward-punishment through experimentation of the distinct actions in an environment. Unlike supervised learning models, RL lacks static input-output mappings and the objective of minimization of a vector error. However, to find out an optimal strategy, it is crucial to learn both continuous feedback from training data and the offline rules of the experiences with no explicit dependence on online samples. In this paper, we present a study of a multi-agent RL framework which involves a Critic in semi-offline mode criticizing over an online Actor-Critic network, namely, Critic-over-Actor-Critic (CoAC) model, in finding optimal treatment plan of ICU patients as well as optimal strategy in a combative battle game. For further validation, we also examine the model in the adversarial assignment.
more » « less
Full Text Available

Search for: All records