Direct Loss Minimization for Sparse Gaussian Processes

Wei, Yadi; Sheth, Rishit; Khardon, Roni

Citation Details

The paper provides a thorough investigation of Direct Loss Minimization (DLM), which optimizes the posterior to minimize predictive loss, in sparse Gaussian processes. For the conjugate case, we consider DLM for log-loss and DLM for square loss showing a significant performance improvement in both cases. The application of DLM in non-conjugate cases is more complex because the logarithm of expectation in the log-loss DLM objective is often intractable and simple sampling leads to biased estimates of gradients. The paper makes two technical contributions to address this. First, a new method using product sampling is proposed, which gives unbiased estimates of gradients (uPS) for the objective function. Second, a theoretical analysis of biased Monte Carlo estimates (bMC) shows that stochastic gradient descent converges despite the biased gradients. Experiments demonstrate empirical success of DLM. A comparison of the sampling methods shows that, while uPS is potentially more sample-efficient, bMC provides a better tradeoff in terms of convergence time and computational efficiency. more »

Award ID(s):: 1906694

PAR ID:: 10231699

Author(s) / Creator(s):: Wei, Yadi; Sheth, Rishit; Khardon, Roni

Date Published:: 2021-04-01

Journal Name:: Proceedings of Machine Learning Research

Volume:: 130

ISSN:: 2640-3498

Page Range / eLocation ID:: 2566-2574

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this