BirdieDNA: Reward-Based Pre-Training for Genomic Sequence Modeling

Blouir, Samuel; Circi, Defne; Moldwin, Asher; Shehu, Amarda

Citation Details

Transformer-based language models have shown promise in genomics but face challenges unique to DNA, such as sequence lengths spanning hundreds of millions of base pairs and subtle long-range dependencies. Although next-token prediction remains the predominant pre-training objective (inherited from NLP), recent research suggests that multi-objective frameworks can better capture complex structure. In this work, we explore whether the Birdie framework, a reinforcement learning-based, mixture-of-objectives pre-training strategy, can similarly benefit genomic foundation models. We compare a slightly modified Birdie approach with a purely autoregressive, next token prediction baseline on standard Nucleotide Transformer benchmarks. Our results show performance gains in the DNA domain, indicating that mixture-of-objectives training could be a promising alternative to next token prediction only pre-training for genomic sequence modeling. more »

Award ID(s):: 2310113

PAR ID:: 10612834

Author(s) / Creator(s):: Blouir, Samuel; Circi, Defne; Moldwin, Asher; Shehu, Amarda

Publisher / Repository:: ICLR MLGenX Workshop

Date Published:: 2024-04-26

Format(s):: Medium: X

Location:: Singapore

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this