NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

Bukharin, Alexander; Li, Yan; Yu, Yue; Zhang, Qingru; Chen, Zhehui; Zuo, Simiao; Zhang, Chao; Zhang, Songan; Zhao, Tuo. (December 2023, Conference on Neural Information Processing Systems)

Full Text Available
SMURF-THP: score matching-based uncertainty quantification for transformer Hawkes process

Li, Zichong; Xu, Yanbo; Zuo, Simiao; Jiang, Haoming; Zhang, Chao; Zhao, Tuo; Zha, Hongyuan (September 2023, International Conference on Machine Learning)

Full Text Available
Machine Learning Force Fields with Data Cost Aware Training

Bukharin, Alexander; Liu, Tianyi; Wang, Shengjie; Zuo, Simiao; Gao, Weihao; Yan, Wen; Zhao, Tuo. (July 2023, International Conference on Machine Learning)

Full Text Available
Context-Aware Query Rewriting for Improving Users’ Search Experience on E-commerce Websites

https://doi.org/10.18653/v1/2023.acl-industry.59

Zuo, Simiao; Yin, Qingyu; Jiang, Haoming; Xi, Shaohui; Yin, Bing; Zhang, Chao; Zhao, Tuo (January 2023, Association for Computational Linguistics)

Full Text Available
MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation

https://doi.org/10.18653/v1/2022.naacl-main.116

Zuo, Simiao; Zhang, Qingru; Liang, Chen; He, Pengcheng; Zhao, Tuo; Chen, Weizhu (January 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Full Text Available
Self-Training with Differentiable Teacher

https://doi.org/10.18653/v1/2022.findings-naacl.70

Zuo, Simiao; Yu, Yue; Liang, Chen; Jiang, Haoming; Er, Siawpeng; Zhang, Chao; Zhao, Tuo; Zha, Hongyuan (January 2022, Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies)

Full Text Available
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization

https://doi.org/10.18653/v1/2021.acl-long.510

Liang, Chen; Zuo, Simiao; Chen, Minshuo; Jiang, Haoming; Liu, Xiaodong; He, Pengcheng; Zhao, Tuo; Chen, Weizhu (July 2021, Annual Meeting of the Association for Computational Linguistics)

Full Text Available
A Hypergradient Approach to Robust Regression without Correspondence

Xie, Yujia; Mao, Yixiu; Zuo, Simiao; Xu, Hongteng; Ye, Xiaojing; Zhao, Tuo; Zha, Hongyuan. (April 2021, International Conference on Learning Representations)

Full Text Available
A Hypergradient Approach to Robust Regression without Correspondence

Xie, Yujia; Mao, Yixiu; Zuo, Simiao; Xu, Hongteng; Ye, Xiaojing; Zhao, Tuo; Zha, Hongyuan (April 2021, International Conference on Learning Representations)
null (Ed.)
We consider a regression problem, where the correspondence between the input and output data is not available. Such shuffled data are commonly observed in many real world problems. Take flow cytometry as an example: the measuring instruments are unable to preserve the correspondence between the samples and the measurements. Due to the combinatorial nature of the problem, most of the existing methods are only applicable when the sample size is small, and are limited to linear regression models. To overcome such bottlenecks, we propose a new computational framework --- ROBOT --- for the shuffled regression problem, which is applicable to large data and complex models. Specifically, we propose to formulate regression without correspondence as a continuous optimization problem. Then by exploiting the interaction between the regression model and the data correspondence, we propose to develop a hypergradient approach based on differentiable programming techniques. Such a hypergradient approach essentially views the data correspondence as an operator of the regression model, and therefore it allows us to find a better descent direction for the model parameters by differentiating through the data correspondence. ROBOT is quite general, and can be further extended to an inexact correspondence setting, where the input and output data are not necessarily exactly aligned. Thorough numerical experiments show that ROBOT achieves better performance than existing methods in both linear and nonlinear regression tasks, including real-world applications such as flow cytometry and multi-object tracking.
more » « less
Full Text Available
A Hypergradient Approach to Robust Regression without Correspondence

Xie, Yujia; Mao, Yixiu; Zuo, Simiao; Xu, Hongteng; Ye, Xiaojing; Zhao, Tuo; Zha, Hongyuan (January 2021, International Conference on Learning Representations)

Full Text Available

« Prev Next »

Search for: All records