NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Actor-Critic Alignment for Offline-to-Online Reinforcement Learning

Yu, Zishun; Zhang, Xinhua (July 2023, International Conference on Machine Learning (ICML))

Full Text Available
Certifying Robust Graph Classification under Orthogonal Gromov-Wasserstein Threats

Hongwei Jin; Zishun Yu; Xinhua Zhang (December 2022, Advances in neural information processing systems)

Full Text Available
Moment Distributionally Robust Tree Structured Prediction

Li, Yeshu; Saeed, Danyal; Zhang, Xinhua; Ziebart, Brian D; Gimpel, Kevin (December 2022, Advances in neural information processing systems)

Full Text Available
Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound

Hongwei Jin; Zishun Yu; Xinhua Zhang (August 2022, Uncertainty in artificial intelligence)

Full Text Available
Warping Layer: Representation Learning for Label Structures in Weakly Supervised Learning

Yingyi Ma; Xinhua Zhang (March 2022, Proceedings of the International Workshop on Artificial Intelligence and Statistics)

Full Text Available
Distributionally Robust Imitation Learning

Bashiri, Mohammad Ali; Ziebart, Brian; Zhang, Xinhua (December 2021, Advances in neural information processing systems)

We consider the imitation learning problem of learning a policy in a Markov Decision Process (MDP) setting where the reward function is not given, but demonstrations from experts are available. Although the goal of imitation learning is to learn a policy that produces behaviors nearly as good as the experts’ for a desired task, assumptions of consistent optimality for demonstrated behaviors are often violated in practice. Finding a policy that is distributionally robust against noisy demonstrations based on an adversarial construction potentially solves this problem by avoiding optimistic generalizations of the demonstrated data. This paper studies Distributionally Robust Imitation Learning (DRoIL) and establishes a close connection between DRoIL and Maximum Entropy Inverse Reinforcement Learning. We show that DRoIL can be seen as a framework that maximizes a generalized concept of entropy. We develop a novel approach to transform the objective function into a convex optimization problem over a polynomial number of variables for a class of loss functions that are additive over state and action spaces. Our approach lets us optimize both stationary and non-stationary policies and, unlike prevalent previous methods, it does not require repeatedly solving an inner reinforcement learning problem. We experimentally show the significant benefits of DRoIL’s new optimization method on synthetic data and a highway driving environment.
more » « less
Full Text Available
Distributionally Robust Imitation Learning

Mohammad Ali Bashiri; Brian D. Ziebart; Xinhua Zhang (December 2021, Advances in neural information processing systems)

Full Text Available
Generalised Lipschitz Regularisation Equals Distributional Robustness

Cranko, Zac; Shi, Zhan; Zhang, Xinhua; Nock, Richard; Kornblith, Simon (January 2021, International Conference on Machine Learning (ICML))
null (Ed.)
Full Text Available
Implicit Task-Driven Probability Discrepancy Measure for Unsupervised Domain Adaptation

Mao Li; Kaiqi Jiang; Xinhua Zhang (January 2021, Advances in neural information processing systems)

Full Text Available
Convex Representation Learning for Generalized Invariance in Semi-Inner-Product Space

Ma, Yingyi; Ganapathiraman, Vignesh; Yu, Yaoliang; Zhang, Xinhua (July 2020, International Conference on Machine Learning (ICML))

Invariance (defined in a general sense) has been one of the most effective priors for representation learning. Direct factorization of parametric models is feasible only for a small range of invariances, while regularization approaches, despite improved generality, lead to nonconvex optimization. In this work, we develop a convex representation learning algorithm for a variety of generalized invariances that can be modeled as semi-norms. Novel Euclidean embeddings are introduced for kernel representers in a semi-inner-product space, and approximation bounds are established. This allows invariant representations to be learned efficiently and effectively as confirmed in our experiments, along with accurate predictions.
more » « less
Full Text Available

« Prev Next »

Search for: All records