Deciphering RNA splicing logic with interpretable machine learning

Liao, Susan E; Sudarshan, Mukund; Regev, Oded

doi:10.1073/pnas.2221165120

Citation Details

Deciphering RNA splicing logic with interpretable machine learning

Machine learning methods, particularly neural networks trained on large datasets, are transforming how scientists approach scientific discovery and experimental design. However, current state-of-the-art neural networks are limited by their uninterpretability: Despite their excellent accuracy, they cannot describe how they arrived at their predictions. Here, using an “interpretable-by-design” approach, we present a neural network model that provides insights into RNA splicing, a fundamental process in the transfer of genomic information into functional biochemical products. Although we designed our model to emphasize interpretability, its predictive accuracy is on par with state-of-the-art models. To demonstrate the model’s interpretability, we introduce a visualization that, for any given exon, allows us to trace and quantify the entire decision process from input sequence to output splicing prediction. Importantly, the model revealed uncharacterized components of the splicing logic, which we experimentally validated. This study highlights how interpretable machine learning can advance scientific discovery. more »

Award ID(s):: 2226731

PAR ID:: 10527711

Author(s) / Creator(s):: Liao, Susan E; Sudarshan, Mukund; Regev, Oded

Publisher / Repository:: National Academy of Sciences

Date Published:: 2023-10-10

Journal Name:: Proceedings of the National Academy of Sciences

Volume:: 120

Issue:: 41

ISSN:: 0027-8424

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
https://doi.org/10.1073/pnas.2221165120

More Like this