DeepMinimizer: A Differentiable Framework for Optimizing Sequence-Specific Minimizer Schemes

Hoang, M.; Zheng, H.; Kingsford, C.

doi:10.1007/978-3-031-04749-7_4

Citation Details

DeepMinimizer: A Differentiable Framework for Optimizing Sequence-Specific Minimizer Schemes

Minimizers are k-mer sampling schemes designed to generate sketches for large sequences that preserve sufficiently long matches between sequences. Despite their widespread application, learning an effective minimizer scheme with optimal sketch size is still an open question. Most work in this direction focuses on designing schemes that work well on expectation over random sequences, which have limited applicability to many practical tools. On the other hand, several methods have been proposed to construct minimizer schemes for a specific target sequence. These methods, however, require greedy approximations to solve an intractable discrete optimization problem on the permutation space of k-mer orderings. To address this challenge, we propose: (a) a reformulation of the combinatorial solution space using a deep neural network re-parameterization; and (b) a fully differentiable approximation of the discrete objective. We demonstrate that our framework, DEEPMINIMIZER, discovers minimizer schemes that significantly outperform state-of-the-art constructions on genomic sequences. more »

Award ID(s):: 1937540

PAR ID:: 10328097

Author(s) / Creator(s):: Hoang, M.; Zheng, H.; Kingsford, C.

Editor(s):: Pe'er, I.

Date Published:: 2022-01-01

Journal Name:: RECOMB 2022: Research in Computational Molecular Biology

Volume:: 13278

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-031-04749-7_4

More Like this