Pretraining and the lasso

Craig, Erin (ORCID:0000000248936671); Pilanci, Mert; Le_Menestrel, Thomas; Narasimhan, Balasubramanian; Rivas, Manuel_A (ORCID:0000000314579925); Gullaksen, Stein-Erik; Dehghannasiri, Roozbeh (ORCID:0000000174133437); Salzman, Julia; Taylor, Jonathan; Tibshirani, Robert

doi:10.1093/jrsssb/qkaf050

Citation Details

Pretraining and the lasso

Abstract Pre-training is a powerful paradigm in machine learning to pass information across models. For example, suppose one has a modest-sized dataset of images of cats and dogs and plans to fit a deep neural network to classify them. With pre-training, we start with a neural network trained on a large corpus of images of not just cats and dogs but hundreds of classes. We fix all network weights except the top layer(s) and fine tune on our dataset. This often results in dramatically better performance than training solely on our dataset. Here, we ask: ‘Can pre-training help the lasso?’. We propose a framework where the lasso is fit on a large dataset and then fine-tuned on a smaller dataset. The latter can be a subset of the original, or have a different but related outcome. This framework has a wide variety of applications, including stratified and multi-response models. In the stratified model setting, lasso pre-training first estimates coefficients common to all groups, then estimates group-specific coefficients during fine-tuning. Under appropriate assumptions, support recovery of the common coefficients is superior to the usual lasso trained on individual groups. This separate identification of common and individual coefficients also aids scientific understanding. more »

Award ID(s):: 2113389

PAR ID:: 10627618

Author(s) / Creator(s):: Craig, Erin; Pilanci, Mert; Le_Menestrel, Thomas; Narasimhan, Balasubramanian; Rivas, Manuel_A; Gullaksen, Stein-Erik; Dehghannasiri, Roozbeh; Salzman, Julia; Taylor, Jonathan; Tibshirani, Robert

Publisher / Repository:: Oxford University Press

Date Published:: 2025-08-08

Journal Name:: Journal of the Royal Statistical Society Series B: Statistical Methodology

ISSN:: 1369-7412

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1093/jrsssb/qkaf050

More Like this