TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretraining

Zhang, Ruiyi; Somayajula, Sai Ashish; Xie, Pengtao

Citation Details

This content will become publicly available on June 11, 2026

TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretraining

Large-scale general domain pretraining followed by downstream-specific finetuning has become a predominant paradigm in machine learning. However, discrepancies between the pretraining and target domains can still lead to performance degradation in certain cases, underscoring the need for task-adaptive continued pretraining (TAP). TAP methods typically involve continued pretraining on task-specific unlabeled datasets or introducing additional unsupervised learning objectives to enhance model capabilities. While many TAP methods perform continued pretraining with multiple pretraining objectives, they often determine the tradeoff parameters between objectives manually, resulting in suboptimal outcomes and higher computational costs. In this paper, we propose TapWeight, a task-adaptive pretraining framework which automatically determines the optimal importance of each pretraining objective based on downstream feedback. TapWeight reweights each pretraining objective by solving a multi-level optimization problem. We applied TapWeight to both molecular property prediction and natural language processing tasks, significantly surpassing baseline methods. Experimental results validate the effectiveness and generalizability of TapWeight. more »

Award ID(s):: 2339216 2405974

PAR ID:: 10618436

Author(s) / Creator(s):: Zhang, Ruiyi; Somayajula, Sai Ashish; Xie, Pengtao

Publisher / Repository:: Transactions on Machine Learning Research

Date Published:: 2025-06-11

Journal Name:: Transactions on machine learning research

ISSN:: 2835-8856

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
This content will become publicly available on June 11, 2026
Journal Article:
The DOI is not currently available.

More Like this