Low-Precision Arithmetic for Fast Gaussian Processes

Maddox, Wesley J.; Potapczynski, Andres; Wilson, Andrew Gordon.

Citation Details

Low-precision arithmetic has had a transformative effect on the training of neural networks, reducing computation, memory and energy requirements. However, despite its promise, low-precision arithmetic has received little attention for Gaussian processes (GPs), largely because GPs require sophisticated linear algebra routines that are unstable in low-precision. We study the different failure modes that can occur when training GPs in half precision. To circumvent these failure modes, we propose a multi-faceted approach involving conjugate gradients with re-orthogonalization, mixed precision, and preconditioning. Our approach significantly improves the numerical stability and practical performance of conjugate gradients in low- precision over a wide range of settings, enabling GPs to train on 1.8 million data points in 10 hours on a single GPU, without any sparse approximations. more »

Award ID(s):: 1922658

PAR ID:: 10350918

Author(s) / Creator(s):: Maddox, Wesley J.; Potapczynski, Andres; Wilson, Andrew Gordon.

Date Published:: 2022-07-14

Journal Name:: UAI 2022.

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this