Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Bu, Y; Arminian, G; Toni, L.; Wornell, G. W.; Rodrigues, M. R.

Citation Details

We provide an information-theoretic analy- sis of the generalization ability of Gibbs- based transfer learning algorithms by focus- ing on two popular empirical risk minimiza- tion (ERM) approaches for transfer learning, α-weighted-ERM and two-stage-ERM. Our key result is an exact characterization of the generalization behavior using the conditional symmetrized Kullback-Leibler (KL) informa- tion between the output hypothesis and the target training samples given the source train- ing samples. Our results can also be applied to provide novel distribution-free generaliza- tion error upper bounds on these two afore- mentioned Gibbs algorithms. Our approach is versatile, as it also characterizes the gener- alization errors and excess risks of these two Gibbs algorithms in the asymptotic regime, where they converge to the α-weighted-ERM and two-stage-ERM, respectively. Based on our theoretical results, we show that the ben- efits of transfer learning can be viewed as a bias-variance trade-off, with the bias induced by the source distribution and the variance induced by the lack of target samples. We believe this viewpoint can guide the choice of transfer learning algorithms in practice. more »

Award ID(s):: 1717610

PAR ID:: 10378621

Author(s) / Creator(s):: Bu, Y; Arminian, G; Toni, L.; Wornell, G. W.; Rodrigues, M. R.

Date Published:: 2022-03-01

Journal Name:: Proceedings of Machine Learning Research

Volume:: 151

ISSN:: 2640-3498

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this