Direct prediction of intrinsically disordered protein conformational properties from sequence

Lotthammer, Jeffrey_M (ORCID:0000000250227006); Ginell, Garrett_M (ORCID:0000000165115480); Griffith, Daniel (ORCID:0000000296339601); Emenecker, Ryan_J; Holehouse, Alex_S (ORCID:0000000241555729)

doi:10.1038/s41592-023-02159-5

Citation Details

Direct prediction of intrinsically disordered protein conformational properties from sequence

Abstract Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well described by a stable three-dimensional structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means that IDRs are largely absent from the Protein Data Bank, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations and deep learning to develop ALBATROSS, a deep-learning model for predicting ensemble dimensions of IDRs, including the radius of gyration, end-to-end distance, polymer-scaling exponent and ensemble asphericity, directly from sequences at a proteome-wide scale. ALBATROSS is lightweight, easy to use and accessible as both a locally installable software package and a point-and-click-style interface via Google Colab notebooks. We first demonstrate the applicability of our predictors by examining the generalizability of sequence–ensemble relationships in IDRs. Then, we leverage the high-throughput nature of ALBATROSS to characterize the sequence-specific biophysical behavior of IDRs within and between proteomes. more »

Award ID(s):: 2128068 2419923 2213983

PAR ID:: 10488787

Author(s) / Creator(s):: Lotthammer, Jeffrey_M; Ginell, Garrett_M; Griffith, Daniel; Emenecker, Ryan_J; Holehouse, Alex_S

Publisher / Repository:: Nature Publishing Group

Date Published:: 2024-01-31

Journal Name:: Nature Methods

Volume:: 21

Issue:: 3

ISSN:: 1548-7091

Format(s):: Medium: X Size: p. 465-476

Size(s):: p. 465-476

Sponsoring Org:: National Science Foundation

Journal Article:
https://doi.org/10.1038/s41592-023-02159-5

More Like this